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Preface 


A course in discrete structures (discrete mathematics) played an important role 
in Curriculum 68, the very first ACM Computer Science Curriculum Guide: 
“This course introduces the student to those fundamental algebraic, logical, and 
combinatoric concepts from mathematics needed in the subsequent computer 
science courses and shows the applications of these concepts to various areas 
of computer science"! Fast forward 45 years or so (through mobile comput- 
ing, wireless networks, robotics, virtual reality, 3-D graphics, the Internet ...) 
to the joint ACM/IEEE-CS Computer Science Curricula 2013, where—still— 
discrete structures are of fundamental importance. “The material in discrete 
structures is pervasive in the areas of data structures and algorithms but ap- 
pears elsewhere in computer science as well. For example, an ability to create 
and understand a proof—either a formal symbolic proof or a less formal but 
still mathematically rigorous argument—is important in virtually every area of 
computer science, including (to name just a few) formal specification, verifica- 
tion, databases, and cryptography. Graph theory concepts are used in networks, 
operating systems, and compilers. Set theory concepts are used in software 
engineering and in databases. Probability theory is used in intelligent systems, 
networking, and a number of computing applications.”* 

This Seventh Edition was guided by Curricula 2013, and virtually all of the 
Core Tier | and Tier 2 topics for discrete structures from that document are includ- 
ed. Covering all those topics can fill a one-semester course, but there is certainly 
enough material in this edition to make for a very respectable two-semester course. 

However much we as instructors may see the value in this foundational 
course, it is a difficult experience for many students, who often view it as a series 
of unconnected topics with little or no application to the rest of their chosen field 
of study. In the big picture, these topics are bound together by themes of 


* importance of logical thinking 
* power of mathematical notation 
* usefulness of abstractions 


but such themes are best appreciated in hindsight. Telling students, “You will 
need ideas from this course in many of your future computer science courses," 
is also of little motivation. That's why it is important to carve out time in your 
course syllabus (for either a one-semester or two-semester course) for some of 
the applications of this material. Here are topics in this edition that you may 


‘Communications of the ACM, Vol. 11, Issue 3 (March 1968), pp. 151-197. 
2Сотршег Science Curricula 2013, Pre-release version, http://cs2013.com 


| xiv Preface 


choose from, according to your interests and the interests of your students. Yes, 
students will probably see most of these topics in more detail in later computer 
science courses, but a quick introduction now can keep their interest and make 
your claim of relevance more credible. 


Section 1.5 Logic programming 
Sections 1.6 and 2.3 Proof of correctness 
Section 3.3 Analysis of algorithms 
Section 5.3 Relations and databases 
Section 5.6 The mighty mod function 
Section 6.4 Huffman codes 

Section 8.2 Logic networks 

Section 9.2 Coding theory 


In addition, there is a Special Interest Page in each chapter that highlights 
interesting applications culled from “the real world.” 


NEW IN THE SEVENTH EDITION 


* The former Chapters 2 and 3 have been reorganized as Chapters 2, 3, and 
4 for better clarity and sequencing 


* New sections or subsections have been added: 


Probability 
* Bayes’ Theorem 
* Binomial Distribution 


Order of Magnitude (new section) 
* The Master Theorem 
* Proof of the Master Theorem 


Matrices 
* Gaussian Elimination 


Coding Theory (new section) 
* Introduction 
* Background: Homomorphisms and Cosets 
* Generating Group Codes 
* Decoding Group Codes 


e “Special interest pages"—one per chapter—have been introduced to add 
relevance and interest to the material being presented. 


* Answers to all odd-numbered exercises, as opposed to answers to fewer, 
selected exercises, appear in the back of the book. When an exercise asks 


Ргеїасе ху 


for a proof, the complete proof is given. Otherwise, the answer is just 
the answer, not necessarily the solution. A Student Solutions Manual with 
solutions for odd-numbered exercises from the book is available from the 
Web site at www.whfreeman.com/gersting. The student manual also in- 
cludes two sample tests per chapter. A complete Solutions Manual is avail- 
able to instructors from the publisher. 


* Many new exercises have been added, particularly with an eye toward 
pairing odd-numbered exercises with similar even-numbered exercises. 


e Of course, student learning aids such as chapter objectives, practice prob- 
lems, reminders, section reviews, and chapter reviews remain. 


WEB SITE 


Online Study Guide 


A Web site for the book may be found at www.whfreeman.com/gersting. The Web 
pages contain representative new example problems (not contained in the text) for 
many of the end-of-section Techniques. Each Technique that has a corresponding 
Web page example is marked with the icon D. 

Each example on the Web first states the problem. Then succeeding pages 
develop the solution, much as the student would be expected to write it. As the stu- 
dent navigates the pages, the solution unfolds step-by-step. A compressed audio 
file is also part of each Web page after the initial problem statement. The audio file 
contains a first-person stream-of-consciousness thought process about that step of 
the solution—why it occurred to the narrator to try this, why it looked promising, 
what knowledge was being called on to suggest that this step should come next, 
and so on. The point is, students see perfect and complete worked-out proofs in 
the textbook and often see them performed by the instructor. Yet when a student 
goes home and tries to produce such a solution by himself or herself, he or she is 
unsure where to start or how to think about the problem or how to see any pat- 
tern to enable a guess as to what to do next. Consequently the student gives up in 
frustration. The purpose of the audio narration is to share the “secret picture” that 
mathematicians use to solve problems. 

To access the problems, after you go to www.whfreeman.com/gersting, select 
a chapter section, then select a sample problem and follow its step-by-step process 
with the “Next” button. 


PowerPoint Slides 


Instructors who visit the web site will also have access to PowerPoint slides 
accompanying each section of the text. 
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NOTE TO THE STUDENT 


As you go through this book, you'll encounter many new terms and new ideas. 
Try reading with pencil and paper at hand and work the Practice problems as you 
encounter them. They are intended to reinforce or clarify some new terminology 
or method just introduced; answers are given at the back of the book. Pay atten- 
tion also to the Reminders that point out common pitfalls or provide helpful hints. 

Be sure to visit the Web site at www.whfreeman.com/gersting for detailed, 
worked-out solutions to additional example problems tied to the Techniques in 
each section. The Web site solutions are accompanied by audio files that explain 
each step. A Student Solutions Manual with solutions for odd-numbered exercises 
from the book is available from the Web site. The student manual also includes two 
sample tests per chapter. 

You may find at first that the thought processes required to solve the exercises 
in the book are new and difficult. Your biggest attribute for success will be persever- 
ance. Here's what I tell my students: “If you do not see at first how to solve a problem, 
don’t give up, think about it some more; be sure you understand all the terminology 
used in the problem, play with some ideas. If no approach presents itself, let it be 
and think about it again later. Repeat this process for days on end. When you finally 
wake up in the middle of the night with an idea, you'll know you are putting in the 
right amount of effort for this course.” Mathematical results don't spring fully formed 
from the foreheads of mathematical geniuses; well, maybe from mathematical ge- 
niuses, but for the rest of us, it takes work, patience, false starts, and perseverance. 

Enjoy the experience! 


Chapter 


Formal Logic 


CHAPTER OBJECTIVES | ° 


Question: 


After studying this chapter, you will be able to: 


* Use the formal symbols of propositional logic. 

* Find the truth value of an expression in propositional logic. 

* Construct formal proofs in propositional logic, and use such proofs to deter- 
mine the validity of English language arguments. 

e Use the formal symbols of predicate logic. 

* Find the truth value in some interpretation of an expression in predicate logic. 

* Use predicate logic to represent English language sentences. 

* Construct formal proofs in predicate logic, and use such proofs to determine 
the validity of English language arguments. 

* Understand how the programming language Prolog is built on predicate logic. 

* Mathematically prove the correctness of programs that use assignment state- 
ments and conditional statements. 


You have been selected to serve on jury duty for a criminal case. The attorney for the 
defense argues as follows: 


If my client is guilty, then the knife was in the drawer. Either the knife was not in the 
drawer or Jason Pritchard saw the knife. If the knife was not there on October 10, 
it follows that Jason Pritchard did not see the knife. Furthermore, if the knife was 
there on October 10, then the knife was in the drawer and also the hammer was in 
the barn. But we all know that the hammer was not in the barn. Therefore, ladies 
and gentlemen of the jury, my client is innocent. 


Is the attorney's argument sound? How should you vote? 


It's much easier to answer this question if the argument is recast in the notation 
of formal logic. Formal logic strips away confusing verbiage and allows us to 
concentrate on the underlying reasoning being applied. In fact, formal logic—the 
subject ofthis chapter— provides the foundation for the organized, careful method 
of thinking that characterizes any reasoned activity—a criminal investigation, a 
scientific experiment, a sociological study. In addition, formal logic has direct 
applications in computer science. The last two sections of this chapter explore 
a programming language based on logic and the use of formal logic to verify 
the correctness of computer programs. Also, circuit logic (the logic governing 


| 2 Formal Logic 


computer circuitry) is a direct analog of the statement logic of this chapter. We 
will study circuit logic in Chapter 8. 


SECTION 1.1 | STATEMENTS, SYMBOLIC REPRESENTATION, 


AND TAUTOLOGIES 


Formal logic can represent the statements we use in English to communicate facts 
or information. A statement (or proposition) is a sentence that is either true or 
false. 


ИШЕ Consider the following: 


a. Ten is less than seven. 

b. Cheyenne is the capital of Wyoming. 

c. She is very talented. 

d. There are life forms on other planets in the universe. 


Sentence (a) is a statement because it is false. Sentence (b) is a statement because 
it is true. Sentence (c) is neither true nor false because “she” is not specified; there- 
fore (c) is not a statement. Sentence (d) is a statement because it is either true or 
false; we do not have to be able to decide which. e 


Connectives and Truth Values 


In English, simple statements are combined with connecting words like and to 
make more interesting compound statements. The truth value of a compound 
statement depends on the truth values of its components and which connecting 
words are used. If we combine the two true statement, “Elephants are big,” and, 
“Baseballs are round,” we would consider the resulting statement, “Elephants are 
big and baseballs are round,” to be true. In this book, as in many logic books, 
capital letters near the beginning of the alphabet, such as А, B, and C, are used to 
represent statements and are called statement letters; the symbol ^ is a logical 
connective representing and. We agree, then, that if A is true and B is true, A /\ B 
(read “А and B") should be considered true. 


PRACTICE 1 | ! 


a. If A is true and B is false, what truth value would you assign to А ^ В? 
b. If A is false and В is true, what truth value would you assign to А ^ В? 
c. If A and B are both false, what truth value would you assign to А ^ В? ш 


The expression А ^ В is called the conjunction of А and В, and A and В are 
called the conjuncts of this expression. Table 1.1 summarizes the truth value of 
A ^ B for all possible truth values of the conjuncts A and В. Each row of the table 


'Answers to practice problems are in the back of the text. 


ТАВГЕ 1.1 


Section 1.1 Statements, Symbolic Representation, and Tautologies 3 


TABLE 1.2 


represents a particular truth value assignment to the statement let- 


A 


B 


ters, and the resulting truth value for the compound expression is 
AVB | shown. 
T Another connective is the word or, denoted by the symbol V. The 


expression A V B (read “А or B”) is called the disjunction of А and B, 


and А and В are called the disjuncts of this expression. ТЁ А and B аге 
both true, then A V B would be considered true, giving the first line 


= 
= 
F 
F 


njal = ч 


of the truth table for disjunction (Table 1.2). 


PRACTICE 2 


Use your understanding of the word or to complete the truth table for disjunction, 
Table 1.2. m 


PRACTICE 3 


Statements may be combined in the form “if statement 1, then statement 2.” If 
A denotes statement 1 and B denotes statement 2, the compound statement would 
be denoted by А — B (read “4 implies B”). The logical connective here is implica- 
tion, and it conveys the meaning that the truth of A implies or leads to the truth of 
B. In the implication A — B, A stands for the antecedent statement and B stands 
for the consequent statement. 

The truth table for implication is less obvious than that for conjunction or 
disjunction. To understand its definition, let’s suppose your friend remarks, “If I 
pass my economics test, then ГІЇ go to the movie Friday.” If your friend passes the 
test and goes to the movie, the remark was true. If your friend passes the test but 
doesn’t go to the movie, the remark was false. If your friend doesn’t pass the test, 
then—whether he or she goes to the movie or not—you could not claim that the 
remark was false. You would probably want to give the benefit of the doubt and say 
that the statement was true. By convention, A — В is considered true if A is false, 
regardless of the truth value of B. 


Summarize this discussion by writing the truth table for А — В. и 


The equivalence connective is symbolized by «». Unlike conjunction, 
disjunction, and implication, the equivalence connective is not really a funda- 
mental connective but a convenient shortcut. The expression А <> В stands for 
(A — B) ^ (B > A). We can write the truth table for equivalence by constructing, 
one piece at a time, a table for (4 — B) ^ (B — A), as in Table 1.3. From this truth 
table, A < B is true exactly when А and В have the same truth value. 


TABLE 1.3 

A B AB ВА (A > В) ^ (В – A) 
T T T T T 

T F F T F 

F T T F F 

F F T T T 


The connectives we've seen so far are called binary connectives because 
they join two expressions together to produce a third expression. Now let's consider 
a unary connective, a connective acting on one expression to produce a second 
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expression. Negation is a unary connective. The negation of A—symbolized by 
A'—is read “not A.” 


PRACTICE 4 | Write the truth table for A’. (It will require only two rows.) n 


Table 1.4 summarizes the truth values for all of the logical connectives. This 
information is critical to an understanding of logical reasoning. 


TABLE 1.4 
A B A^B | AVB | А-В | А-В А' 
T T T y T T F 
т Е Е T F F 
F T F T T F T 
F F F F T T 


Because of the richness of the English language, words that have differ- 
ent shades of meaning are nonetheless represented by the same logical connec- 
tive. Table 1.5 shows the common English words associated with various logical 
connectives. 


TABLE 1.5 


English Word Logical Connective Logical Expression 


and; but; also; in addition; Conjunction АЛВ 
moreover 


or Disjunction AVB 


If A, then B. Implication AB 
A implies B. 

A, therefore B. 

A only if B. 

B follows from A. 

A is a sufficient condition 
for B. 

B is a necessary condition 
for A. 


A if and only if B. Equivalence AB 
A is necessary and 
sufficient for B. 


REMINDER 


not A Negation А! 
A only if B means It is false that А... 
AB It is not true that А... 


Suppose that А — B is true. Then, according to the truth table for implication, 
the consequent, В, can be true even though the antecedent, А, is false. So while 
the truth of A leads to (implies) the truth of В, the truth of B does not imply the 
truth of A. The phrase “В is a necessary condition for A” to describe A — В simply 


PRACTICE 5 
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means that if A is true, then В is necessarily true, as well. “А only if В” describes 
the same thing, that A implies B. 


The statement, “Fire is a necessary condition for smoke,” can be restated, “If there 
is smoke, then there is fire.” The antecedent is “there is smoke,” and the conse- 


quent is “there is fire.” 


a. If the rain continues, then the river will flood. 


Name the antecedent and consequent in each of the following statements. (Hint: Rewrite 
each statement in if-then form.) 


b. A sufficient condition for network failure is that the central switch goes down. 
c. The avocados are ripe only if they are dark and soft. 
d. A good diet is a necessary condition for a healthy cat. 


Expressing the negation of a statement must be done with care, especially for a 
compound statement. Table 1.6 gives some examples. 


TABLE 1.6 


Statement 


Correct Negation 


Incorrect Negation 


It will rain tomorrow. 


Peter is tall and thin. 


It is false that it will rain 
tomorrow. 


It will not rain tomorrow. 


It is false that Peter is tall 
and thin. 


Peter is not tall or he is 
not thin. 


Peter is short or fat. 


Peter is short and fat. 


Too strong a statement. 
Peter fails to have both 
properties (tallness and 
thinness) but may still 
have one property. 


The river is shallow or 
polluted. 


It is false that the river is 
shallow or polluted. 


The river is neither shallow 
nor polluted. 


The river is deep and 
unpolluted. 


The river is not shallow or 
not polluted. 


Too weak a statement. 
The river fails to have 
either property, not just 
fails to have one property. 


PRACTICE 6 | Which of the following represents A’ if А is the statement “Julie likes butter but hates 


a. Julie hates butter and cream. 

b. Julie does not like butter or cream. 
c. Julie dislikes butter but loves cream. 
d. Julie hates butter or likes cream. 


6 Formal Logic 


We can string statement letters, connectives, and parentheses (or brackets) 
together to form new expressions, as in 


(4 > B) Л(В A) 


Of course, just as in a computer programming language, certain syntax rules 
(rules on which strings are legitimate) prevail; for example, 


A) AA C 


would not be considered a legitimate string. An expression that is a legitimate 
string is called a well-formed formula, or wff. To reduce the number of parenthe- 
ses required in a wff, we stipulate an order in which connectives are applied. This 
order of precedence is 


. connectives within parentheses, innermost parentheses first 


А 


1 
2; 
3. A, V 
4. > 
5. e 
This means that the expression A V B’ stands for A V (B^), not (А V B)'. Similarly, 
A V B — C means (A V B) > C, not A V (B > C). However, we often use paren- 
theses anyway, just to be sure that there is no confusion. 

In a wff with a number of connectives, the connective to be applied last is the 
main connective. In 


A ^(B—C) 
the main connective is ^. In 
(AV B)AC)> (BV C!) 


the main connective is —. Capital letters near the end of the alphabet, such as Р, О, 
R, and S, are used to represent wffs. Thus P could represent a single statement let- 
ter, which is the simplest kind of wff, or a more complex wff. We might represent 


(AV B)AC)>(BVC') 
as 
P—Q 


if we want to hide some of the details for the moment and only concentrate on the 
main connective. 

Wffs composed of statement letters and connectives have truth values that 
depend on the truth values assigned to their statement letters. We write the 
truth table for any wff by building up the component parts, just as we did for 
(A > B) ^ (B > A). The main connective is addressed in the last column of the 
table. 
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The truth table for the wff A V B' — (A V B)' is given in Table 1.7. The main con- 


nective, according to the rules of precedence, is implication. e 
TABLE 1.7 
A B |B' | AVB' | AVB | (АУВ) | AVB'eS(AV BJ 
T | T F T Т F F 
ТЕ Т T Т F F 
F/T] F F T F T 
F Е T T F T T 


If we are making a truth table for a wff that contains n different statement 
letters, how many rows will the truth table have? From truth tables done so far, 
we know that a wff with only one statement letter has two rows in its truth table, 
and a wff with two statement letters has four rows. The number of rows equals the 
number of true-false combinations possible among the statement letters. The first 
statement letter has two possibilities, T and F. For each of these possibilities, the 
second statement letter has two possible values. Figure 1.1a pictures this as a two- 
level “tree” with four branches showing the four possible combinations of T and Е 
for two statement letters. For n statement letters, we extend the tree to л levels, as 
in Figure 1.16. The total number of branches then equals 2". The total number of 
rows in a truth table for n statement letters is also 2”. 


Statement letters Choices 
1 2 = 2! branches 
2 4 = 2? branches 


(a) 


Statement letters Choices 
1 2 = 2! branches 
2 4 = 2? branches 
3 8 = 23 branches 
n 2” branches 


Figure 1.1 (b) 
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This tree structure also tells us how to enumerate all the T-F com- 


TABEED binations among the л statement letters when setting up a truth table. If 
A B e we read each level of the tree from bottom to top, it says that the T-F 
T T T values for statement letter л (which will compose the last column of the 
T T F truth table) alternate, those for statement letter n — 1 alternate every 

two values, those for statement letter n — 2 alternate every four values, 
Ш E t and so forth. Thus a truth table for three statement letters would begin 
T F F as shown in Table 1.8. The values for statement letter C alternate, those 
F T T for statement letter В alternate in groups of two, and those for state- 
F T F ment letter А alternate in groups of four, resulting in something like a 
F F T sideways version of the tree. (Reading the rows from the bottom up and 

using 1 for T and 0 for F shows that we are simply counting up from zero 
д E p in binary numbers.) 


PRACTICE 7 | Construct truth tables for the following wffs. 


a. (A В) < (B — A) (Remember that C < D is true precisely when C and D have the same 
truth value.) 

b. (АУ A) 2 (B^ B?) 

c. [(MAB)C'] 

d. (4 — B) e (В' А!) н 


Tautologies 


A wff-like item (d) of Practice 7, whose truth values are always true, is called a 
tautology. A tautology is “intrinsically true” by its very structure; it is true no 
matter what truth values are assigned to its statement letters. A simpler example 
of a tautology is А V A’; consider, for example, the statement “Today the sun will 
shine or today the sun will not shine,’ which must always be true because one 
or the other of these must happen. A wff like item (b) of Practice 7, whose truth 
values are always false, is called a contradiction. A contradiction is “intrinsically 


REMINDER false" by its very structure. A simpler example of a contradiction is A ^ A’; con- 
A, B, C stand for single sider “Today is Tuesday and today is not Tuesday,” which is false no matter what 
statement letters; P, Q, day of the week it is. 

R, S stand for wffs. Suppose that P and Q represent two wffs, and it happens that the wff 


P «» Q is a tautology. If we did a truth table using the statement letters in P 
and Q, then the truth values of the wffs P and Q would agree for every row of 
the truth table. In this case, P and Q are said to be equivalent wffs, denoted by 
P <= О. Thus P < О states a fact, namely, that the particular wff Р < Q isa 
tautology. Practice 7(d) has the form P <> О , where P is the wff (A — B) and 
О is the wff (B' — A’), and P < О was shown to be a tautology. Therefore, 
(A > В) <= (B' > A^). 

We will list some basic equivalences, prove one or two of them by construct- 
ing truth tables, and leave the rest as exercises. We represent any contradiction by 
0 and any tautology by 1. 


1а. 
2а. 
За. 


4а. 
5а. 


AVBSBVA Ib. 
(4VB)VC«eAV(BVC) 2b. 
А\М (ВЛА С)= 3b. 
(AV B)A(AVC) 

AVO0SA 4b. 
AVA' 1 5b. 


| Section 1.1 Statements, Symbolic Representation, and Tautologies 9 


Some Tautological Equivalences 


АЛВ = ВЛА 


(A^B)AC«A^(B^C) 


A^GVC)e 
(A^B)V(A^C) 


AALSA 
АЛА &0 


(commutative properties) 
(associative properties) 


(distributive properties) 


(identity properties) 


(complement properties) 


Note that 2a allows us to write А V B V C with no need for parentheses because 
the grouping doesn't matter; similarly, 2b allows us to write 4 ^ B ^ C. 


The truth table in Table 1.9a verifies equivalence 1a, the commutative property for 
disjunction, and that in Table 1.9b verifies 4b, the identity property for conjunc- 
tion. Note that only two rows are needed for Table 1.9b because 1 (a tautology) 


cannot take on false truth values. e 
TABLE 1.9 
e| A | B | дув | BV AVBoBVA || A 1 | AA1| AA1GA 
T T T T T T T JT T 
T F T T T F i F T 
F T T T TD 
FF F F T 
PRACTICE 8 | Verify equivalence 5a. 
E 


The equivalences in the list are grouped into five pairs. In each pair, one 
equivalence can be obtained from the other by replacing ^ with V, V with ^, 0 
with 1, or 1 with 0. Each equivalence in a pair is called the dual of the other. Thus, 
la and 1b (commutativity of disjunction and commutativity of conjunction) are 
duals of each other. This list of equivalences appears in a more general setting in 


Chapter 8. 


Two additional equivalences that are very useful are De Morgan's laws, 
named for the nineteenth-century British mathematician Augustus De Morgan, 
who first stated them. This theorem is easy to prove (see Exercises 26e and 26f). 
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© THEOREM DE MORGAN'S LAWS 
(АУ В) &&A'^B' аі | (AA B)' &A'VB' 


Each is the dual of the other. De Morgan's laws help in expressing the negation of 
a compound statement, as in Practice 6. 

Think of these tautological equivalences as patterns; in order to use one 
of them, you must match its pattern exactly. For example, you can't say that 
(A ^ B) V C & A ^ (B V C) by either of the associative properties because nei- 
ther of these properties uses both conjunction and disjunction. 

Suppose that P and Q are equivalent wffs. Then in any wff where P appears, P 
can be replaced by Q with no change in the overall truth values. It's like replacing 
a $20 bill in your wallet with two $10 bills—the total value of your money hasn't 
changed. 


From Practice 7(d), А — В is equivalent to B’ > A’. The wff (А — B) > B should 
therefore be equivalent to (B' — А”) — B. This equivalence is verified by Tables 
1.10a and 1.10b. ө 


ТАВГЕ 1.10 


@| A В |А В (AB) B) (b 
T T F T T 

T 

F 

T 


F T 
T T 
T F 


4/4/> 


т| 14| ғ 
тт ғ 
т | п 

njalin | Ф 


Logical Connectives in the Real World 


Web search engines allow exploration of the vast resources available on the Web, 
but a little care in your search query can help focus the results more quickly. For 
example, if you enter 


used cars 


in a Web search engine, you may get back references to any Web site containing 
either the word used or the word cars; this could include sites for antique dealers 
and sites for the latest auto racing results. Entering the phrase 


“used cars" 


in quotes restricts the search, on most search engines, to Web sites containing this 
exact phrase. Most search engines also allow you to enter an expression using log- 
ical connectives as your search query, which can help make the query even more 
specific. To further narrow your used car search, for example, you could enter 
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“used cars” AND (Ford OR Buick) 


This would tend to limit your search to sites that mention only particular brands 
of used cars, although you could still wind up with a link to Jim Bob Ford’s Loan 
Shark Agency, which will lend you money for any used car. The query 


“used cars” AND (Ford OR Buick) AND NOT trucks 


would eliminate sites mentioning trucks. Many search engines use + (a plus sign) 
in place of AND, and — (a minus sign) in place of AND NOT. 

The logical connectives AND, OR, and NOT are also available in many pro- 
gramming languages, as well as on programmable graphing calculators. These 
connectives, in accordance with the truth tables we have defined, act on com- 
binations of true or false expressions to produce an overall truth value. Such 
truth values provide the decision-making capabilities fundamental to the flow of 
control in computer programs. Thus, at a conditional branch in a program, if the 
truth value of the conditional expression is true, the program will next execute 
one section of code; if the value is false, the program will next execute a different 
section of code. If the conditional expression is replaced by a simpler, equivalent 
expression, the truth value of the expression and hence the flow of control of 
the program is not affected, but the new code is easier to understand and may 
execute faster. 


| EXAMPLE7 | Consider a statement in a computer program that has the form 


if ((outflow > inflow) and not ((outflow > inflow) and (pressure < 1000))) 
do something; 

else 
do something else; 


Here the conditional expression has the form 
A ЛМ(А Л B) 


where А is “outflow > inflow” апа В is “pressure <1000.” This expression can be 
simplified by replacing some wffs with equivalent wffs. 


A ^ (A ^ B)' && A ^(A' V B^) (De Morgan's laws) 
«» (АЛА) V (A ^ B!) (tautology 3b) 
«» 0 V (A ^ B!) (tautology 5b) 
= (A^B)VO0 (tautology 1a) 
<=! ААЛ” (tautology 4а) 


The statement form can therefore be written 


if ((outflow > inflow) and not (pressure < 1000)) 
do something; 
else 
do something else; e 


€ DEFINITION 
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Finally, the truth tables for conjunction, disjunction, and negation are implemented 
by electronic devices called “gates” (AND gate, OR gate, inverter, respectively) 
that are fundamental building blocks in computer circuitry. We’ll see in Chapter 
8 (Boolean Algebra and Computer Logic) how to combine these gates into more 
complex logic networks to carry out specific tasks. 


An Algorithm 


To test whether a wff is a tautology, we can always write its truth table. For n 
statement letters, 2” rows will be needed for the truth table. Suppose, however, 
that the wff has implication as its main connective, so that it has the form P > О 
where P and Q are themselves wffs. Then we can use a quicker procedure than 
constructing a truth table to determine whether P > О is a tautology. We assume 
that P > О is not a tautology, and we see whether this leads to some impossible 
situation. If it does, then the assumption that P — О is not a tautology is also im- 
possible, and P — О must be a tautology after all. 

To assume that P — Q is not a tautology is to say that it can take on false 
values, and, by the truth table for implication, P — О is false only when P is 
true and Q false. By assigning P true and Q false, we determine possible truth 
values for the wffs making up P and Q. We continue assigning the truth values so 
determined until all occurrences of statement letters have a truth value. If some 
statement letter is assigned both true and false values by this process, we have an 
impossible situation, so the wff P — Q must be a tautology. Otherwise, we have 
found a way to make P > О false, and it is not a tautology. 

What we have described is a set of instructions—a procedure—for carry- 
ing out the task of determining whether P — Q is a tautology. This procedure 
can be executed by mechanically following the instructions; in a finite amount 
of time, we will have the answer. In computer science terms, the procedure is an 
algorithm. 


ALGORITHM 
An algorithm is a set of instructions that can be mechanically executed in a finite 
amount of time in order to solve some problem. 


Algorithms constitute the very heart of computer science, and we will have 
much to say about them throughout this book. You are probably already aware that 
the major task in writing a computer program for solving a problem consists of 
devising an algorithm (a procedure) to produce the problem solution. 

Algorithms are often described in a form that is a middle ground between 
a purely verbal description in paragraph form (as we gave for deciding whether 
P — О 15 a tautology) and a computer program (that, if executed, would actu- 
ally carry out the steps of the algorithm) written in a programming language. 
This compromise form to describe algorithms is called pseudocode. An algorithm 
written in pseudocode should not be hard to understand even if you know nothing 
about computer programming. The only thing to note about the pseudocode used 
in this book is that lines preceded by double slashes (//) are explanatory com- 
ments, not part of the algorithm itself. 

Following is a pseudocode form ofthe algorithm to determine whether P — О 
is a tautology. 
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ALGORITHM 


TAUTOLOGYTEST 


TautologyTest (м P; wtf О) 
//Given wffs P and О, decides whether the wff P — О is a tautology. 


//Assume P — Q is not a tautology 
P — true //assign T to P 
О = false //assign F to О 


repeat 
for each compound wff already assigned a truth value, 
assign the truth values determined for its components 
until all occurrences of statements letters have truth values 


if some letter has two truth values 
then //contradiction, assumption false 
write (“Р — О is a tautology.") 
else //found a way to make P — О false 
write (“Р — О is not a tautology.") 
end if 
end ZautologyTest 


EXAMPLE 8 


The algorithm first assigns the truth values “true” to P and "false" to О, con- 
sistent with the assumption that P — Q is not a tautology. The algorithm then 
enters a /oop, where a sequence of steps is repeated until some condition is met. 
Within the loop, truth assignments continue to be made to smaller and smaller 
components of the original P and Q until all occurrences of individual statement 
letters have truth values. Then the algorithm tests whether a contradiction has 
occurred, and writes out the information about whether P — Q is a tautology. 


Consider ће wff (А — B) > (B' — А”). This matches the pattern needed in or- 
der to use algorithm 7autologyTest, namely P + О, where P is A > B and О is 
B' — A'. Following the algorithm, we first assign truth values 


A — B true and В’ — A’ false 


Moving on to the loop, the assignment of false to the compound statement B' — A' 
determines the further assignments 


B' true and A’ false 
or 
B false and A true 
Now working with P, A true and А — B true determines the assignment 


B true 


REMINDER 


Algorithm TautologyTest 
applies only when the 
main connective is >. 
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At this point all occurrences of statement letters have truth values, as follows: 


A-T B-T B-F A-T 
(A > B) — (B' > A') 


Se ee 


TD 18 


This terminates the loop. In the final step of the algorithm, В now has ап аѕ- 
signment of both T and F, so the algorithm decides that (4 — B) > (B' > A^) 
is a tautology. Actually, we learned this earlier (1n Practice 7(d)) by building a 
truth table. ө 


Algorithm TautologyTest decides whether wffs of a certain form, namely, 
those where the main logical connective is —, are tautologies. However, the pro- 
cess of building a truth table and then examining all the truth values in the final 
column constitutes an algorithm to decide whether an arbitrary wff is a tautology. 
This second algorithm is therefore more powerful because it solves a more general 
problem, but algorithm 7autologyTest is usually faster for those wffs to which it 
applies. 


SPECIAL INTEREST PAGE 


Can “And” Ever Be “Ог”? 


In 2003, OfficeMax sued the United States government 
(the Internal Revenue Service) for a return of excise 
tax that OfficeMax had paid for telephone service. 
This was not a trivial financial matter-OfficeMax had 
paid over $380,000.00 in telephone excise tax. To un- 
derstand the nature of the argument, we need a brief 
history of the federal tax on telephone service. 

The first telephone tax was enacted by Congress in 
1898 (22 years after the invention of the telephone by 
Alexander Graham Bell). The tax was intended to help 
pay the federal debt incurred by the Spanish-American 
War, and it was repealed, as planned, in 1902. Over sub- 
sequent years, the telephone tax came and went with 
fluctuating rates as the government incurred debt. Res- 
urrected again in 1932, the tax, in one form or another, 
has been in effect ever since. In 1965, Congress defined 
local phone service and “toll telephone service" (long- 
distance calls) as two categories of taxable service, and it 
set the tax rate at 3%. Of interest to this discussion is the 
definition Congress gave at this time of “toll telephone 
service,” which states in part that it is “a telephonic qual- 
ity communication for which there is a toll charge which 
varies in amount with the distance and elapsed transmis- 
sion time of each individual communication.” Keep in 
mind that in 1965, there was essentially only a single 
telephone service provider in the United States, namely 
AT&T, and at that time AT&T charges were based on both 
the duration and the distance of each call. By the 1990s, 
AT&T had been broken up and there were a number of 
competitive telephone companies. In addition, telephone 
companies began to charge a flat rate per minute for na- 
tionwide long-distance calls. The phone companies col- 
lected the federal excise tax from their customers and 
passed the tax on to the federal government. 

OfficeMax used MCI as its phone service provider 
from 1999 to 2002, during which time MCI collected the 
excise tax from OfficeMax. In 2003, OfficeMax sued the 
federal government for a refund of the excise taxes MCI 
had collected on the basis that MCI was not providing 
“toll telephone service” as defined by Congress in 1965 
because MCI was charging a rate based not on time and 
distance but only on time. Here is the issue: What exact- 
ly is the meaning of the word “and” in “varies in amount 
with the distance and elapsed transmission time”? 

OfficeMax argument: “And” means the conjunc- 
tive “and,” as the truth table for “and” is defined in 


ө Chapter 


formal logic. For the tax to apply, the phone company 
had to charge its customers a rate based on both time 
and distance. 

Internal Revenue Service argument: Elsewhere 
in this same legislation, Congress did use “and” in a 
disjunctive sense when it defined “communication ser- 
vices” as “local telephone service, toll telephone ser- 
vice, and teletypewriter exchange service.” Because 
these three are mutually exclusive, “апа” here could 
not have a conjunctive meaning. 

The majority opinion of the United States Court 
of Appeals for the Sixth Circuit, in 2005, agreed with 
OfficeMax. Its reasoning was (1) dictionary defini- 
tions, legal usage guides, and case law assert that “and” 
is generally conjunctive, (2) the conjunctive usage is 
consistent with the billing mechanism used by the only 
telephone company in existence at the time the law was 
written, (3) the disjunctive interpretation would allow 
the possibility of a telephone charge based solely on dis- 
tance, which is a ridiculous idea that Congress surely 
did not intend, and (4) lower courts had found in favor of 
OfficeMax. In short, the IRS lost this case and a num- 
ber of similar cases, and in 2006 it announced that 
phone service that is charged on time and not distance 
is not taxable. (The 3% excise tax on local telephone 
service is still in effect.) 

One must, however, appreciate the humor of the 
dissenting opinion in the OfficeMax case: “A host sep- 
arately asked two prospective guests what they liked to 
drink. One said, “I like bourbon and water.” The other 
said, “I like beer and wine.” When the second guest 
arrived at the event, the host served the guest a glass of 
beer mixed with wine. “What's that awful drink?” said 
the guest, to which the host answered, “You said you 
liked beer and wine.” Sometimes we apparently do use 
“and” in a disjunctive sense. So here is a legal—and 
financial—case that hinged on the truth table for the 
logical connective AND. How cool is that?! 


OFFICEMAX, INC., Plaintiff-Appellee, у. UNITED 
STATES of America, Defendant-Appellant, 
No. 04-4009, United States Court of Appeals, 
Sixth Circuit, Argued: July 29, 2005, Decided and 
Filed: November 2, 2005, 428 F.3d 583. Online at 
http://law.justia.com/cases/federal/appellate-courts/ 
F3/428/583/565375/ 


| 16 


Formal Logic 


SECTION 1.1 REVIEW 


TECHNIQUES 


Q) Construct truth tables for compound wffs. 
ÙD Recognize tautologies and contradictions. 


MAIN IDEAS 


e Wffs are symbolic representations of statements. 


EXERCISES 1.1 


* Truth values for compound wffs depend on the 
truth values of their components and the types of 
connectives used. 

* Tautologies are “intrinsically true" wffs—true for 
all truth values. 


1. Which of the following sentences are statements? 


a. The moon is made of green cheese. 
. He is certainly a tall man. 
. Two is a prime number. 


. Next year interest rates will rise. 
Next year interest rates will fall. 
.x:— 4-20 
. What is the truth value of each of the following statements? 


b 
c 
d. The game will be over by 4:00. 
e 
f. 
g 


a. 8 is even or 6 is odd. 

b. 8 is even and 6 is odd. 

c. 8is odd or 6 is odd. 

d. 8 is odd and 6 is odd. 

e. If 8 is odd, then 6 1s odd. 

f. If 8 is even, then 6 is odd. 

g. If 8 is odd, then 6 is even. 

h. If 8 15 odd and 6 is even, then 8 « 6. 

. Given the truth values A true, В false, and C true, what is the truth value of each of the following wffs? 
a. AN (BV C) c. (A^B)VC 

b. (A^B)VC d. A V(B' ^C) 

. Given the truth values A false, В true, and C true, what is the truth value of each of the following wffs? 
a. АЭ (ВУС) c. C>(A' AB’) 

b. (4VB)—»C а. A V (B' С) 

. Rewrite each of the following statements in the form “If А, then B.” 

a. Healthy plant growth follows from sufficient water. 

b. Increased availability of information is a necessary condition for further technological advances. 

c. Errors were introduced only if there was a modification of the program. 

d. Fuel savings implies good insulation or storm windows throughout. 


10. 


I. 


Section 1.1 Statements, Symbolic Representation, and Tautologies 17 


. Rewrite each of the following statements in the form “If A, then В.” 


a. Candidate Lu winning the election will be a sufficient condition for property taxes to increase. 
b. The user clicks Pause only if the game level changes. 

c. The components are scarce, therefore the price increases. 

d. Healthy hair is a necessary condition for good shampoo. 


. Common English has many ways to describe logical connectives. Write a wff for each of the following 


expressions. 
a. Either A or B b. Neither А nor B 
. Common English has many ways to describe logical connectives. Write a wff for each of the following 
expressions. 
a. B whenever А c. A indicates B 
b. A is derived from В d. A exactly when B 


. Several forms of negation are given for each of the following statements. Which are correct? 


a. The answer 15 either 2 or 3. 

1. Neither 2 nor 3 is the answer. 

2. The answer is not 2 or not 3. 

3. The answer is not 2 and it is not 3. 

b. Cucumbers are green and seedy. 

1. Cucumbers are not green and not seedy. 

2. Cucumbers are not green or not seedy. 

3. Cucumbers are green and not seedy. 

c. 2 < 7 and 3 is odd. 

1. 2 > 7 and 3 is even. 3. 2 =7 or3 is odd. 

2. 2 > 7 and 3 is even. 4. 2 > T or3 is even. 
Several forms of negation are given for each of the following statements. Which are correct? 
a. The carton is sealed or the milk is sour. 

1. The milk is not sour or the carton is not sealed. 

2. The carton is not sealed and also the milk is not sour. 

3. If the carton 1s not sealed, then the milk will be sour. 
b. Flowers will bloom only if it rains. 

1. The flowers will bloom but it will not rain. 

2. The flowers will not bloom and it will not rain. 

3. The flowers will not bloom or else it will not rain. 

c. If you build it, they will come. 

1. If you build it, then they won't come. 

2. You don't build it, but they do come. 

3. You build it, but they don't come. 

Write the negation of each statement. 
a. If the food is good, then the service is excellent. 
b. Either the food is good or the service is excellent. 


12. 


13. 


14. 


15. 
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c. Either the food is good and the service is excellent, or else the price is high. 
d. Neither the food is good nor the service excellent. 
e. If the price is high, then the food is good and the service is excellent. 
Write the negation of each statement. 
a. The processor is fast but the printer is slow. 
b. The processor is fast or else the printer is slow. 
c. If the processor is fast, then the printer is slow. 
d. Either the processor is fast and the printer is slow, or else the file is damaged. 
e. If the file is not damaged and the processor is fast, then the printer is slow. 
f. The printer is slow only if the file is damaged. 
Using the letters indicated for the component statements, translate the following compound statements 
into symbolic notation. 
a. A: prices go up; B: housing will be plentiful; C: housing will be expensive 
If prices go up, then housing will be plentiful and expensive; but if housing is not expensive, then it will 
still be plentiful. 
b. A: going to bed; B: going swimming; C: changing clothes 
Either going to bed or going swimming is a sufficient condition for changing clothes; however, chang- 
ing clothes does not mean going swimming. 
c. A: it will rain; B: it will snow 
Either it will rain or it will snow but not both. 
d. A: Janet wins; B: Janet loses; C: Janet will be tired 
If Janet wins or if she loses, she will be tired. 
e. A: Janet wins; B: Janet loses; C: Janet will be tired 
Either Janet will win or, if she loses, she will be tired. 
Using the letters indicated for the component statements, translate the following compound statements 
into symbolic notation. 
a. A: the tractor wins; B: the truck wins; C: the race will be exciting. 
Whether the tractor wins or the truck wins, the race will be exciting 
b. A: snow; B: rain; C: yesterday was cloudy 
Yesterday was cloudy but there was neither snow nor rain. 


c. A: Koalas will be saved; B: climate change is addressed; C: rising water levels 
Koalas will be saved only if climate change is addressed; furthermore, failure to address climate change 
will cause rising water levels. 


d. A: the city’s economy will improve; B: a strong school system 
The city’s economy will improve conditional upon a strong school system. 


e. A: the city’s economy will improve; B: a strong school system 
A strong school system is a necessary condition for the city’s economy to improve. 


Let A, B, and C be the following statements: 


A Roses аге red. 
B Violets are blue. 
C Sugar is sweet. 


Translate the following compound statements into symbolic notation. 
a. Roses are red and violets are blue. 
b. Roses are red, and either violets are blue or sugar is sweet. 


— 


| Section 1.1 Statements, Symbolic Representation, and Tautologies 19 


c. Whenever violets are blue, roses are red and sugar is sweet. 

d. Roses are red only if violets aren’t blue or sugar is sour. 

e. Roses are red and, if sugar is sour, then either violets aren’t blue or sugar is sweet. 
. Let A, B, and C, and D be the following statements: 

A The villain is French. 

B The hero is American. 

C The heroine is British. 

D The movie is good. 

Translate the following compound statements into symbolic notation. 

a. The hero is American and the movie is good. 

b. Athough the villain is French, the movie is good. 

c. If the movie is good, then either the hero is American or the heroine is British. 

d. The hero is not American, but the villain is French. 

e. A British heroine is a necessary condition for the movie to be good. 
. Use A, B, and C as defined in Exercise 15 to translate the following statements into English. 


а. BV С' е. (BAC')' >A 

b. В'\/(А— С) f. AV(BAC') 

c. (С^А') В g. (AVB)AC' 

d. CA (4' < B) 

. Use A, В, and C as defined in Exercise 16 to translate the following statements into English. 
a. Вэ А e. A (BV C) 

b. BACAD' f. D'2(AVCy 

c. B— (CV A) g. (C— D) ^(A— В!) 

а. (4V C)—>B' 


. Using letters H, K, A for the component statements, translate the following compound statements into 
symbolic notation. 


a. If the horse is fresh, then the knight will win. 

b. The knight will win only if the horse is fresh and the armor is strong. 

c. A fresh horse is a necessary condition for the knight to win. 

d. The knight will win if and only if the armor is strong. 

e. A sufficient condition for the knight to win is that the armor is strong or the horse is fresh. 


. Using letters A, T, E for the component statements, translate the following compound statements into 
symbolic notation. 


a. If Anita wins the election, then tax rates will be reduced. 

b. Tax rates will be reduced only if Anita wins the election and the economy remains strong. 

c. Tax rates will be reduced if the economy remains strong. 

d. A strong economy will follow from Anita winning the election. 

e. The economy will remain strong if and only if Anita wins the election or tax rates are reduced. 


. Using letters F, В, S for the component statements, translate the following compound statements into 
symbolic notation. 


a. Plentiful fish are a sufficient condition for bears to be happy. 
b. Bears are happy only if there are plentiful fish. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 
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c. Unhappy bears means that the fish are not plentiful and also that there is heavy snow. 
d. Unhappy bears are a necessary condition for heavy snow. 
e. The snow is heavy if and only if the fish are not plentiful. 


Using letters P, C, B, L for the component statements, translate the following compound statements into 
symbolic notation. 


a. If the project is finished soon, then the client will be happy and the bills will be paid. 

b. If the bills are not paid, then the lights will go out. 

c. The project will be finished soon only if the lights do not go out. 

d. If the bills are not paid and the lights go out, then the client will not be happy. 

e. The bills will be paid if and only if the project is finished soon, or else the lights go out. 

f. The bills will be paid if and only if either the project is finished soon or the lights go out. 
Construct truth tables for the following wffs. Note any tautologies or contradictions. 

а. (A> В) < А' VB а. АЛВА 

b. (A^B)VC—A^A^(BV C) е. (A45 B) КАМ C) —5(B V C)J 
c. AA^(A' V B)' 

Construct truth tables for the following wffs. Note any tautologies or contradictions. 

а. А (В — A4) а. [(AVB)AC] SA VC 
bAABSB'VA' е. A’ > (BV C") 

с. (УВ) Л (А Л В)! 

Verify the equivalences in the list on page 9 by constructing truth tables. (We have already verified 
la, 4b, and 5a.) 


Verify by constructing truth tables that the following wffs are tautologies. Note that the tautologies in parts 
b, e, f, and g produce equivalences such as (4")' = A. 


а. AV A' е. (AVB) < А' ^B' (DeMorgan's law) 
b. (4) А f. (AA В) < А' У В (De Morgan’s law) 
c. АЛВ В g АУА < А 

а. АУ АУВ 


Prove the following tautologies by starting with the left side and finding a series of equivalent wffs that 
will convert the left side into the right side. You may use any of the equivalencies in the list on page 9 or 
the equivalencies from Exercise 26. 


а. (A^AB')ACeS(A^C)AB 

b. (AVB)A(AV В') OA 

с. AV(BAA')OAVB 

Prove the following tautologies by starting with the left side and finding a series of equivalent wffs that 


will convert the left side into the right side. You may use any of the equivalencies in the list on page 9 or 
the equivalencies from Exercise 26. 


a. (^B) VB«SA'VB 

b. A^(AA^B') GA^B 

c. (A^B) ^(AVB) eB 

We mentioned that (А ^ B) V C cannot be proved equivalent to A ^ (B V C) using either of the associative 


tautological equivalences, but perhaps it can be proved some other way. Are these two wffs equivalent? 
Prove or disprove. 


30. 


31. 
32. 


33. 
34. 
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a 


37. 


38. 


39. 
40. 


Section 1.1 Statements, Symbolic Representation, and Tautologies 21 


Let P be the wff А — B. Prove or disprove whether P is equivalent to any of the following related wffs. 
a. the converse of P, B— A 

b. the inverse of P, A' > B' 

c. the contrapositive of P, B' > A' 

Write a logical expression for a Web search engine to find sites pertaining to dogs that are not retrievers. 


Write a logical expression for a Web search engine to find sites pertaining to oil paintings by Van Gogh or 
Rembrandt but not Vermeer. 


Write a logical expression for a Web search engine to find sites pertaining to novels or plays about AIDS. 


Write a logical expression for a Web search engine to find sites pertaining to coastal wetlands in Louisiana 
but not in Alabama. 


. Consider the following pseudocode. 


repeat 
i-1 
read a value for x ID 
if (x < 5.0) and (2x < 10.7)) or (V/5x > 5.1) then 
write the value of x 
end if 
increase i by 1 
until i > 5 


The input values for x are 1.0, 5.1, 2.4, 7.2, and 5.3. What are the output values? 


. Suppose that А, B, and C represent conditions that will be true or false when a certain computer program 


is executed. Suppose further that you want the program to carry out a certain task only when A or B is true 
(but not both) and C is false. Using А, В, and C and the connectives AND, OR, and NOT, write a statement 
that will be true only under these conditions. 


Rewrite the following statement form with a simplified conditional expression, where the function odd(n) 
returns true if n is odd. 


if not((Valuel < Value2) or odd(Number)) 

or (not(Valuel < Value2) and odd(Number)) then 
statement 

else 
statement2 

end if 


You want your program to execute statement 1 when А is false, В is false, and C is true, and to execute 
statement 2 otherwise. You wrote 


if not(4 and B) and C then 
statement 1 

else 
statement 2 

end if 


Does this do what you want? 
Verify that A > В is equivalent to A’ V B. 


a. Using Exercise 39 and other equivalences, prove that the negation of A — В is equivalent to 
AA^B' 


b. Write the negation of the statement "If Sam passed his bar exam, then he will get the job." 
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41. Use algorithm ZautologyTest to prove that the following expressions are tautologies. 


а. [В' ^ (4 B)] > А! 
b. (4 > В) ^A] B 
с. (4V В) ^4 B 


42. Use algorithm TautologyTest to prove that the following expressions аге tautologies. 


a. (AA B)AB' >A 
b. (A^ B") > (4 В) 
c. (AAB)' VB'SA'VB' 


43. A memory chip from a digital camera has 2? bistable (ON-OFF) memory elements. What is the total num- 


44. 


45. 


ber of ON-OFF configurations? 

In each case, construct compound wffs P and Q so that the given statement is a tautology. 
а. РЛО 

b. PoP’ 

c. PA (Q— P") 


From the truth table for А V B, the value of A V B is true 1f A 1s true, 1f B 1s true, or 1f both are true. This 
use of the word “or,” where the result is true if both components are true, is called the inclusive or. It is 
the inclusive or that is understood in the sentence, “Ме may have rain or drizzle tomorrow,” which might 
also be expressed as, “We may have rain or drizzle or both tomorrow.” Another use of the word “ог” in 
the English language is the exclusive or, sometimes written XOR, in which the result is false when both 
components are true. The exclusive or is understood in the sentence, “At the intersection, you should turn 
north or south,” (but obviously not both). Exclusive or is symbolized by A @ B. Write the truth table for 
the exclusive or. 


46. Prove that 4 ® В < (A => B)' is tautology. Explain why this makes sense. 


Exercises 47—50 show that defining four basic logical connectives (conjunction, disjunction, implication, and 
negation) is a convenience rather than a necessity because certain pairs of connectives are enough to express 
any wff. Exercises 51—52 show that a single connective, properly defined, is sufficient. 


47. Every compound statement is equivalent to a statement using only the connectives of conjunction and ne- 


48. 


49. 


50. 


5 


Её 


gation. To see this, we need to find equivalent wffs for 4 V В and for A — B that use only ^ and '. These 
new statements can replace, respectively, any occurrences of A V B and A — B. (The connective <> was 
defined in terms of other connectives, so we already know that it can be replaced by a statement using 
these other connectives.) 

a. Show that A V B is equivalent to (4' ^ B")' 

b. Show that A — В is equivalent to (4 ^ В’) 


Show that every compound wff is equivalent to a wff using only the connectives of V and '. (Hint: See 
Exercise 47.) 


Show that every compound wff is equivalent to a wff using only the connectives of — and '. (Hint: See 
Exercise 47.) 


Prove that there are compound statements that are not equivalent to any statement using only the 
connectives — and V. 


. The binary connective | is called the Sheffer stroke, named for the American logic professor Henry Shef- 


fer, who proved in 1913 that this single connective is the only one needed. The truth table for | is given 
here. Sheffer also coined the term “Boolean algebra," the topic of Chapter 8, where we will see that this 
truth table represents the NAND gate. 
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A | B |А|В 

Түт F Show that every compound wff is equivalent to a wff using only the 
Т|Е Т connective |. (Hint: Use Exercise 47 and find equivalent statements 
FIT T for A ^ B and A’ in terms of |.) 

FFIT 


52. The binary connective | is called the Peirce arrow, named for American philosopher Charles Peirce (not 


39. 


54. 


for the antique automobile). The truth table for | is given here. In Chapter 8 we will see that this truth table 
represents the NOR gate 


Show that every compound statement is equivalent to a statement 
using only the connective 4. (Hint: See Exercise 51.) 


m|imiyHi2M» 
түш |т | 4 | Ф 
т 


Propositional wffs and truth tables belong to a system of two-valued logic because everything has опе of 
two values, False or True. Three-valued logic allows a third value of Null or “unknown” (Section 5.3 dis- 
cusses the implications of three-valued logic on databases). The truth tables for this three-valued system 
follow. 


А | В | Алв А | В | АУВ А 1А 
Е: TS т T 
T|F| F T|F| T F|T 
тім N me т N| N 
FÍT] F prr T 

Е | Е Е Е |Е | F 

ЕМ | F Е|М| N 

N/T] N NIT] T 

м | Е | F МЕ М 

ммм ммм 


а. Viewing N as “unknown”, explain why it is reasonable to define TAN = N, FV N = №, апа 
М = М. 

Suppose the statement, “Flight 237 is on time,” is true, the statement, “Runway conditions аге icy,” is 

false, and the truth value of the statement, “Flight 51 is on time,” is unknown. Find the truth values of the 

following statements. 


b. Runway conditions are not icy and flight 51 is on time. 

c. Flight 51 is on time and flight 237 is not. 

d. Flight 51 is not on time or runway conditions are not icy. 

Propositional wffs and truth tables belong to a system of two-valued logic because everything has one of 
two values, F or T, which we can think of as 0 or 1. In fuzzy logic, or many-valued logic, statement letters 
are assigned values in a range between 0 and 1 to reflect some “probability” to which they are false or 


true. A statement letter with a truth value of 0.9 is “mostly true” or “has a high probability of being true” 
while a statement letter with a truth value of 0.05 “has a very high probability of being false.” Fuzzy logic 


33. 


56. 


57. 


58. 


59. 


60. 
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is used to manage decisions in many imprecise situations such as robotics, manufacturing, or instrument 
control. Truth values for compound statements are determined as follows. 


A' has the truth value 1 — A. 
A ^ B has the truth value that is the minimum of the values of A and of B. 
A V B has the truth value that is the maximum of the values of A and В. 


a. Explain why these are reasonable assignments for the truth values of A’, A ^ B, and A V B. 


Suppose the statement, “Flight 237 is on time,” is estimated to have a truth value of 0.84 and the state- 
ment, *Runway conditions are icy," is estimated to have a truth value of 0.12. Find the truth values of the 
following statements. 


b. Runway conditions are not icy. 
c. Runway conditions are icy and flight 237 is on time. 
d. Runway conditions are icy or flight 237 is not on time. 


In a three-valued logic system as described in Exercise 53, how many rows are needed for a truth table 
with п statement letters? 


In 2003, then U.S. Secretary of Defense Donald Rumsfeld won Britain's Plain English Campaign 2003 
Golden Bull Award for this statement: “Reports that say that something hasn't happened are always inter- 
esting to me, because as we know, there are known knowns, there are things we know we know. We also 
know there are known unknowns; that is to say we know there are some things we do not know. But there 
are also unknown unknowns—the ones we don't know we don’t know." 


What possibility did Secretary Rumsfeld omit? 


Four machines, А, B, C, and D, are connected on a computer network. It is feared that a computer virus 
may have infected the network. Your security team makes the following statements: 


a. If D 1s infected, then so is C. 

b. If C is infected, then so is A. 

c. If D is clean, then В is clean but C is infected. 

d. If A is infected, then either В is infected or C is clean. 

Assuming that these statements are all true, what can you conclude? Explain your reasoning. 


The Dillies have five teenaged children, two boys named Ollie and Rollie, and three girls named Mellie, 
Nellie, and Pollie. Each is a different number of years old, from 13 to 17. There are three bedrooms for the 
children in the Dillie house, so two share the yellow room, two share the white room, and one alone has 
the smaller green room. Can you match each one's name and age, and tell who sleeps where? 


a. No one shares a room with a sibling of the opposite sex. 

b. Pollie is exactly one year older than Mellie. 

c. The two teenagers who share the yellow room are two years apart in age. 

d. The two who share the white room are three years apart in age. 

e. Rollie is somewhat older than Ollie but somewhat younger than the sibling who has the green room. ? 
Determine who sleeps in each room and what their ages are. Explain your reasoning. 


An advertisement for a restaurant at an exclusive club in Honolulu says, “Members and nonmembers 
only." Give two possible interpretations of this statement. 


The following newspaper headline was printed during a murder trial: 
“Tam a liar" says murder defendant! 
Can the jury reach any conclusion from this statement? 


Scott Marley, Dell Logic Puzzles, April, 1998 
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In Exercises 61—64, you are traveling in a certain country where every inhabitant is either a truthteller who 
always tells the truth or a liar who always lies.? 


61. 


62. 


63. 


64. 


You meet two of the inhabitants of this country, Percival and Llewellyn. Percival says, “At least one of us 
is a liar.” Is Percival a liar or a truth teller? What about Llewellyn? Explain your answer. 

Traveling on, you meet Merlin and Meredith. Merlin says, “If I am a truth teller, then Meredith is a truth 
teller.” Is Merlin a liar or a truth teller? What about Meredith? Explain your answer. 

Next, you meet Rothwold and Grymlin. Rothwold says, “Either I am a liar or Grymlin is a truth teller.” Is 
Rothwold a liar or a truth teller? What about Grymlin? Explain your answer. 

Finally, you meet Gwendolyn and Merrilaine. Gwendolin says, “I am a liar but Merrilaine is not.” Is 
Gwendolyn a liar or a truth teller? What about Merrilaine? 


SECTION 1.2 


PROPOSITIONAL LOGIC 


The argument of the defense attorney at the beginning of this chapter made a 
number of (supposedly true) statements and then asked the jury to draw a specific 
conclusion based on those statements. In Section 1.1, we used the notation of for- 
mal logic to represent statements in symbolic form as wffs; because statements 
are sometimes called propositions, these wffs are also called propositional wffs. 
Now we want to use tools from formal logic to see how to reach logical conclu- 
sions based on given statements. The formal system that uses propositional wffs 
is called propositional logic, statement logic, or propositional calculus. (The 
word calculus is used here in the more general sense of “calculation” or “reason- 
ing,” not “differentiating” or "integrating.") 


Valid Arguments 


An argument can be represented in symbolic form as 
Р, ЛР, ЛР Л... ЛР, э О 


where Р, Р», ... , P, are the given statements, called the hypotheses, of the argu- 
ment, and О is the conclusion of the argument. As usual, the P’s and the О rep- 
resent wffs, not merely statement letters. When should this be considered a valid 
argument? This question can be stated in several equivalent ways: 


* When can О be logically deduced from P, ... , P? 
e When is О a logical conclusion from P,, ... , P,? 
* When does P}, ..., P, logically imply О? 

* When does О follow logically from P,, ... , P,? 


and so forth. 


?For more puzzles about “knights” and “knaves,” see What Is the Name of This Book? by the logician—and magician—Raymond Smullyan 


(Prentice-Hall, 1978). 
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An informal answer is that О is a logical conclusion from P}, ... , Р, whenever 
the truth of P,, ... , P, implies the truth of О. In other words, when the implication 


Р, AP, ЛР, Л... ЛР, > О 


is true. (Of course, this implication is true if any of the hypotheses is false, but 
in an argument we usually care about what happens when all the hypotheses are 
true.) Furthermore, this implication should be true based on the relationship of the 
conclusion to the hypotheses, not on any incidental knowledge we may happen to 
have about О. 


| EXANPLEÐ | Consider the following argument: 


George Washington was the first president of the United States. Thomas 
Jefferson wrote the Declaration of Independence. Therefore, every day has 
24 hours. 


This argument has the two hypotheses 


1. George Washington was the first president of the United States. 
2. Thomas Jefferson wrote the Declaration of Independence. 


and the conclusion 
Every day has 24 hours. 


Even though each of the individual hypotheses, as well as the conclusion, is a true 
statement, we would not consider this argument valid. The conclusion is merely an 
isolated true fact, not at all related to or “following from” the hypotheses. ө 


A valid argument should therefore be true based entirely on its internal struc- 
ture; it should be “intrinsically true.” Therefore we make the following formal 
definition. 


@ DEFINITION VALID ARGUMENT 
The propositional wff 
Р, ЛР, ЛР Л... ЛР, э О 


is a valid argument when it is a tautology. 


The argument in Example 9 would be symbolized as 
A^B—C 


which is clearly not a tautology. 


Consider the following argument. If George Washington was the first president of 
the United States, then John Adams was the first vice president. George Washing- 
ton was the first president of the United States. Therefore John Adams was the first 
vice president. 


* DEFINITION 
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This argument has the two hypotheses: 


1. If George Washington was the first president of the United States, then 
John Adams was the first vice president. 
2. George Washington was the first president of the United States. 


and the conclusion 
John Adams was the first vice president. 


A symbolic representation of this argument has the form 
(A> B)AA->B 


A truth table or algorithm 7autologyTest establishes that this argument is а tautol- 
ogy. The argument is valid; its form is such that the conclusion follows inevitably 
from the hypotheses. In fact, this form of argument, known by its Latin name of 
modus ponens (“method of assertion"), is one of the rules of reasoning we will use 
to build propositional logic. ө 


To test whether a wff P, A Р, A Р; Л... ЛР, О is a tautology, we could 
build a truth table or use algorithm ZautologyTest. Instead, we will turn to formal 
logic, which uses a system of derivation rules that manipulate wffs in a truth- 
preserving manner. You begin with the hypotheses P}, ... , Р, (assumed true) and 
attempt to apply the manipulation rules in such a way as to end up with the conclu- 
sion Q (which must then also be true because truth is preserved under the rules). 


PROOF SEQUENCE 

A proof sequence is a sequence of wffs in which each wff is either a hypothesis 
or the result of applying one of the formal system's derivation rules to earlier 
wffs in the sequence. 


Using formal logic to prove that О is a valid conclusion from P}, ... , P,, we 
must produce a proof sequence of the form 


Р, (hypothesis) 
Р, (hypothesis) 


P, (hypothesis) 
wff, | (obtained by applying a derivation rule to earlier wffs) 
wff, | (obtained by applying a derivation rule to earlier wffs) 


Q (obtained by applying a derivation rule to earlier wffs) 


The derivation rules for a formal system must be carefully chosen. If they 
are too powerful, then they won't be truth preserving and we'll be able to deduce 
anything at all from a given set of hypotheses. If they are too weak, there will 
be logical conclusions that we won't be able to prove from given hypotheses. We 
want a formal logic system that is correct (only valid arguments should be prov- 
able) and complete (every valid argument should be provable). In addition, the 
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derivation rules should be kept to a minimum in order to make the formal system 
manageable. We would like the system to have the smallest set of rules that still 
allows it to be complete. 


Derivation Rules for Propositional Logic 


The derivation rules for propositional logic fall into two categories, equivalence 
rules and inference rules. Equivalence rules allow individual wffs to be rewrit- 
ten, while inference rules allow new wffs to be derived from previous wffs in the 
proof sequence. 

Equivalence rules state that certain pairs of wffs R and Sare equivalent. 
Remember from Section 1.1 that R <> S means that А < Sis a tautology and 
that S can be substituted for R in any wff with no change to the truth value of that 
wff. Equivalence rules are therefore truth-preserving; a true wff remains true if 
such a substitution is done within it. 

Table 1.11 lists the equivalence rules we will use in our formal system for 
propositional logic. (Additional rules could be formulated based on other tautolo- 
gies, but we are trying to keep our rule set to a minimum.) Each is given a name 
to make it easier to identify its use in a proof sequence. We saw the commutative 
and associative rules, as well as De Morgan’s laws, in Section 1.1. There they were 
given for statement letters only, here they are given for any wffs P, Q, R, but they 
are still tautologies. 


TABLE 1.11 
Equivalence Rules 
Expression Equivalent to Name/Abbreviation for Rule 
PVQ QVP Commutative —comm 
PAQ QAP 
(PVQ)VR PV (QV R) Associative — ass 
(PAQ)AR PA(QAR) 
(P v Q)' P'AQ' De Morgan's Laws—De 
(P ^ Q)' P'VvQ' Morgan 
P>Q P'VQ Implication —imp 
P (P^) Double negation — dn 
PQ (P — О) л (О P) Definition of equivalence — equ 


PRACTICE 9 | Prove the implication rule. 


That is, prove that 
(pO) cxt 0) 


is a tautology. и 


| EXAMPLE 11 | Suppose that one hypothesis of a propositional argument can be symbolized as 


(А'М/ B)V C 
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Then a proof sequence for the argument could begin with the following steps: 
1. (4 МВ') У С hyp (hypothesis) 
2. (A^B) VC 1, De Morgan 
3. (AAB)>C  2,imp 


The justification given for each step is not a required part of the proof sequence, 
but it does confirm that the step is a legitimate one. Step 1 is a hypothesis. Step 2 is 
derived from step | by applying one of De Morgan’s Laws. Step 3 is derived from 
step 2 by using the implication rule that P — О is equivalent to Р’ V О, where P is 
the wff A ^ B, and Q is the wff C. ө 


The equivalence rules allow substitution in either direction. That is, in Ex- 
ample 11 we replaced A’ V B’ with (А Л BY, but in some other proof sequence, 
using the same rule, we might replace (4 ^ В)! with A’ V B". 

Inference rules say that if one or more wffs that match the first part of the 
rule pattern are already part of the proof sequence, we can add to the proof se- 
quence a new wff that matches the last part of the rule pattern. Table 1.12 shows 
the propositional inference rules we will use, again along with their identifying 
names. 


TABLE 1.12 
Inference Rules 
From Can Derive Name/Abbreviation for Rule 
P,P>Q Q Modus ponens—mp 
P5Q,Q' P' Modus tollens — mt 
P,Q PAQ Conjunction—con 
PAQ P,Q Simplification—sim 
Р PVQ Addition—add 


Unlike equivalence rules, inference rules do not work in both directions. We 
cannot “reverse” the addition rule in Table 1.12; from P V О, we cannot infer 
either P or Q. 


Suppose that 4 — (B ^ С) and A are two hypotheses of an argument. A proof se- 
quence for the argument could begin with the following steps: 

1. A—(B^C) hyp 

2. A hyp 

3h. ANIC 1, 2, mp 
The justification at step 3 is that steps 1 and 2 exactly match the pattern required 


for modus ponens, where P is А and О is B ^ C. Modus ponens says that О can be 
derived from P and P — Q. e 


| 30 Formal Logic 


PRACTICE 10 | Give a next step and a justification for a proof sequence that begins 


1. (AAB')>C hyp 
2 Cn hyp н 


The inference rules are also truth-preserving. For example, suppose that Р 
and P — О are both true wffs in a proof sequence. Then О is deducible from these 
two wffs by modus ponens. If P and Р — О are both true, then—by the truth table 
for implication—(Q is also true. 

The derivation rules, like the tautological equivalencies of Section 1.1, repre- 
sent recipes or patterns for transforming wffs. A rule can be applied only when the 
wffs exactly match the pattern. 


REMINDER 


To use a derivation rule, 
wffs must exactly match 
the rule pattern. 


Suppose that (4 > B) V C and A are two hypotheses of an argument. A proof 
sequence for the argument could begin with the following steps: 


1. (A>B)VC_ hyp 

2. A hyp 
Unlike Example 12, however, nothing further can be done. Modus ponens requires 
the presence of wffs matching the pattern P and P — О. In P — Q, the main con- 


nective is an implication. The wff (4 — В) V C has disjunction, not implication, as 
its main connective. Modus ponens does not apply, nor does anything else. e 


Now we are ready to work our way through a complete proof of an argument. 


| EXAMPLE 14 | Using propositional logic, prove that the argument 


ANB CMG BD VC) АЎ => 


is valid. 

We must produce a proof sequence that begins with the hypotheses and ends 
with the conclusion. There are four hypotheses, so this gives us lots of *ammuni- 
tion” to use in the proof. The beginning of the proof is easy enough because it just 
involves listing the hypotheses: 


1. A hyp 
2. ВЭ С һур 
3. (АЛВ)—(ОЛС') hyp 
4. B hyp 


Our final goal is to arrive at D, the conclusion. But without even looking ahead, 
there are a couple of fairly obvious steps we can take that may or may not be helpful. 
S. (С 2, 4, mp 
6. AAB 1, 4, con 
7. DVC' 3,6, mp 


At least at this point we have introduced D, but it’s not by itself. Note that from 
step 5 we have C, which we haven't made use of. If only we had C > D, we'd be 


PRACTICE 11 


Section 1.2 Propositional Logic 31 


home free. Ah, look at the form of step 7; it’s a disjunction, and the implication rule 
says that we can transform a disjunction of a certain form into an implication. The 
disjunction must have a negated wff on the left. We can do that: 


8. СУЮ 7,comm 
9. CD 8,imp 


50 


10. D 5,9, mp e 


As in Example 14, proof sequences involve a certain amount of rewriting just 
because you can and a certain amount of keeping an eye on the desired goal and 
what it would take to get there. Although not as mechanical as constructing a truth 
table, the strict rules of the game nevertheless provide a more or less mechanical 
way to construct the proof sequence. There are only a certain number of legitimate 
things that can be done at any one point in the sequence. If one choice seems to 
lead down a blind alley, go back and make another. Also, there may be more than 
one correct proof sequence; as a relatively trivial instance, steps 6 and 7 could 
have been done before step 5 in Example 14. 

An analogy with programming, if not taken too literally, may be helpful. In 
traditional programming, you have known input, a desired output, and you write 
code to transform the given input into the desired output. You figure out the se- 
quence of statements that will accomplish this transformation, and each program 
statement in that sequence must conform to the exact syntax rules ofthe program- 
ming language you are using, be it C++, Java, Python, or whatever. In proposi- 
tional logic, you have known “input” (the hypotheses), a desired “output” (the 
conclusion), and you write “code statements" (a sequence of wffs) to transform 
the hypotheses into the conclusion. The sequence of code statements, or at least 
their justification, must conform to the exact syntax of the derivation rules for 
propositional logic. 


Using propositional logic, prove the validity of the argument. 


[4 V. 8^) ОС р) 2656 и 


ТАВІЕ 1.13 


Derivation Hints 


1. Modus ponens is probably the most intuitive inference rule. Think often about 
trying to use it. 


2. Wffs of the form (P ^ Q)' or (P V Q)' are seldom helpful in a proof sequence. Try 
using De Morgan's laws to convert them into P' V Q' and P' ^ Q', respectively, 
which breaks out the individual components. 


3. Wffs of the form P V Q are also seldom helpful in a proof sequence because 
they do not imply either P or Q. Try using double negation to convert P V Q to 
(P’)’ V О, and then using implication to convert to Р’ > О. 
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Deduction Method and Other Rules 


Suppose the argument we seek to prove has the form 


REMINDER 


Use the deduction method 
APINN Pa N iea 
when the conclusion of Pi ^P, ^P; ^ P, (R S) 


what you want to prove is 


animplication: where the conclusion is itself an implication. Instead of using P}, ... , P, as the hy- 


potheses and deriving R — S, the deduction method lets us add R as an additional 
hypothesis and then derive S. In other words, we can instead prove 


Р ЛАР, ЛР; Л... ЛР, ЛА $ 
This change is to our advantage because it gives us one more hypothesis, 1.е., 
additional ammunition for the proof, and it simplifies the desired conclusion. 


The deduction method approach agrees with our understanding of implica- 
tion, but Exercise 55 at the end of this section provides a formal justification. 


| EXAMPLE 15 | Use propositional logic to prove 


[A (4 > B)] ^ (4 > B) 


Using the deduction method, we get two hypotheses instead of one, and we want 


to derive В. 
l.A— (AB) hyp 
2. А һур 
3. А—В 1, 2, mp 
4. B 2,3,mp e 


PRACTICE 12 | Use propositional logic to prove 
(A> B) ^(B— C) э (A C) m 


The formal system we have described is correct and complete. Every argu- 
ment we can prove is a tautology (the system is correct), and every implication that 
is a tautology is provable (the system is complete). We can easily argue for correct- 
ness because each of the derivation rules is truth-preserving. Completeness would 
be more difficult to prove, and we will not do so. 

Correctness and completeness say that the set of derivation rules we have 
used is exactly right—not too strong, not too weak. Nonetheless, many formal 
systems for propositional logic use additional truth-preserving inference rules. 
We can prove these additional rules using our original rule set. Once such a 
rule is proved, it can be used as justification in a proof sequence because, if 
required, the single step invoking this rule could be replaced with the proof 
sequence for the rule. Nothing more can be proved by the addition of these 
rules, but the proof sequences might be shorter. (See Exercises 1.2 for a list of 
additional rules.) 
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| EXAMPLE 16 | The rule of hypothetical syllogism (hs) is 


From P > О and О > R, one can derive P > R. 


This rule is making the claim that 
(P>O)A(Q>R) > (Р К) 


is a valid argument. The proof sequence for this argument looks just like that for 
Practice 12. Because it is a legitimate derivation rule, hypothetical syllogism can 
be used to justify a step in a proof sequence. e 


| EXAMPLE 17 | Use propositional logic to prove 


(d'V B)^(B— С) > (A C) 
The following proof sequence will do. 
1. A VB hyp 
2. Вэ С hyp 


3. 45 B 1,imp 
4. А— C 2,3,hs 


Without use of the new rule, we could still have produced a proof sequence by 
essentially proving the new rule as part of this proof: 


1. A VB hyp 

2. B— C hyp 

3. A45 B  l,imp 

4. A hyp 

5). B 3,4, mp 

6, (C 2,5, mp 
Additional rules thus can shorten proof sequences but at the expense of having to 
remember additional rules! ө 

PRACTICE 13 | Prove 
(A>B)A(C'VA)AC>B m 


Verbal Arguments 


An argument in English (an attorney’s trial summary, an advertisement, or a 
political speech) that consists of simple statements can be tested for validity by a 
two-step process: 


1. Symbolize the argument using propositional wffs. 
2. Prove that the argument is valid by constructing a proof sequence for it 
using the derivation rules for propositional logic. 
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The first step, translating the argument from verbal to symbolic form, is often 
the most difficult. Look for keys in the verbal representation of the argument. 
“If... then" and “either... or" indicate implication and disjunction, respectively. A 
period (or sometimes a semicolon) signifies the end of a hypothesis. The separate 
hypotheses are joined by conjunctions. “Therefore” is a big key word; it indicates 
the end of the hypotheses and announces that the conclusion is about to be stated. 


ЭҮ Consider the argument, “If interest rates drop, the housing market will improve. 
Either the federal discount rate will drop or the housing market will not improve. 
Interest rates will drop. Therefore the federal discount rate will drop." Using 


I Interest rates drop. 
H The housing market will improve. 
Е Тһе federal discount rate will drop. 


the argument is 
I> H)^(FVH)^IF 


A proof sequence to establish validity is 


1l.]—H hyp 
2.FVH' hyp 

3, di hyp 

4. H VF 2, сотт 
5. H>F 4,imp 
6. IF 1, 5, 
1 JE 3, 6, mp 


Is the following argument valid? “My client is left-handed, but if the diary is not 
missing, then my client is not left-handed; therefore, the diary is missing.” There 
are only two simple statements involved here, so we symbolize them as follows: 
L My client is left-handed. 
D The diary is missing. 


The argument is then 
LA^(D'—L)—D 
The validity of the argument is established by the following proof sequence. 


IL. JU hyp 

2. D >L’ hyp 

3, UDI Wil? 2, ears 
4. DVL’ 3, dn 

Se JL WD 4, comm 
6. L>D 5, imp 
TED 1, 6, mp 
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The argument says that ifthe hypotheses are true, then the conclusion will be true. 
The validity of the argument is a function only of its logical form and has nothing 
to do with the actual truth of any of its components. We still have no idea about 
whether the diary is really missing. Furthermore, the argument “Skooses are pink, 
but if Gingoos does not like perskees, then skooses are not pink; therefore Gingoos 
does like perskees,” which has the same logical form, is also valid, even though it 
does not make sense. ө 


PRACTICE 14 | Use propositional logic to prove that the following argument is valid. Use statement letters 


S, R, and В. 


If security is a problem, then regulation will be increased. If security is not a problem, then business on the Web 
will grow. Therefore if regulation is not increased, then business on the Web will grow. н 


Formal logic is not necessary to prove the validity of propositional arguments. A 
valid argument is represented by a tautology, and truth tables provide a mechanical 
test for whether a wff is a tautology. So, what was the point of all of this? In the next 
section we will see that propositional wffs are not sufficient to represent everything 
we would like to say, and we will devise new wffs called predicate wffs. There is no 
mechanical test for the predicate wff analogue of tautology, and in the absence of such 
a test, we will have to rely on formal logic to justify arguments. We have developed 
formal logic for propositional arguments as a sort of dry run for the predicate case. 

In addition, the sort of reasoning we have used in propositional logic carries 
over into everyday life. It is the foundation for logical thinking in computer science, 
mathematics, the courtroom, the marketplace, and the laboratory. Although we have 
approached logic as a mechanical system of applying rules, enough practice should 
ingrain this way of thinking so that you no longer need to consult tables of rules, but 
can draw logical conclusions and recognize invalid arguments on your own. 


SECTION 1.2 REVIEW 


TECHNIQUES 


W) Apply derivation rules for propositional logic. 
W Use propositional logic to prove the validity of a 


verbal argument. 


MAIN IDEAS 


* A proof sequence in a formal logic system is a se- 
quence of wffs that are either hypotheses or derived 
from earlier wffs in the sequence by the derivation 
tules of the system. 

* The propositional logic system is complete and 
correct; valid arguments and only valid arguments 
are provable. 


* A valid argument can be represented by a м of 
the form P, ЛР, ЛР, Л... ЛР, > О that is a 


tautology. 


EXERCISES 1.2 


For Exercises 1—4, what inference rule is illustrated by the argument given? 


1. If Martina is the author, then the book is fiction. But the book is nonfiction. Therefore Martina 1s not the 


author. 


2. If the business declares bankruptcy, then all assets must be confiscated. The business declared bankruptcy. 
It follows that all assets must be confiscated. 
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3. The dog has a shiny coat and loves to bark. Consequently, the dog loves to bark. 


4. If Paul is a good swimmer, then he is a good runner. If Paul is a good runner, then he is a good biker. 


Therefore if Paul is a good swimmer, then he is a good biker. 


For Exercises 5—8, decide what conclusion, if any, can be reached from the given hypotheses and justify your 


answer. 


5. If the car was involved in the hit-and-run, then the paint would be chipped. But the paint is not chipped. 


6. Either the weather will turn bad or we will leave on time. If the weather turns bad, then the flight will be 


canceled. 


7. If the bill was sent today, then you will be paid tomorrow. You will be paid tomorrow. 


8. The grass needs mowing and the trees need trimming. If the grass needs mowing, then we need to rake the 


leaves. 


9. Justify each step in the proof sequence of 


A^(B— С)— (В — (АЛ С)) 


1. А 4. С 
2.B>C 5.АЛС 
3. B 


10. Justify each step in the proof sequence of 


B^[B^C)2A']^(B— C) 4! 


1. B 4. C 
2.(B^C)—A' 5. BAC 
3. ВЭС 6. A' 
11. Justify each step in the proof sequence of 
[A— (DVC)^AB'^AC'—A' 
1. A—5 (BV C) 4. В' ЛС 
2. В 5. (ВУС) 
3. C" 6. A’ 


12. Justify each step in the proof sequence of 


A'ABA^A[B2 (А\ С)] > C 


1. A’ 5. (Л) VC 
2. В 6. A4! э C 
3. B— (AVC) 7. C 

4. AV C 


In Exercises 13—24, use propositional logic to prove that the argument is valid. 


13. (4d V B'Y A(B— C) э (4' ^C) 
14. 4 ^(B— А) эВ 

15.(4—5 B) ^[A—5 (B > C)] > (4 C) 
16.[(C—5 D) > C] > [(C D) > DJ] 
17. A ^(AVB)—B 
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18. [4 > (BSC) ^(A V D)^A B—5 (D> С) 
19.(4' -B') AB^(A5 С) э С 


20. (А > B) A [B > (C> DJ A [А > (В > C) > (4 >D) 


21. [4 > (B 5 C)] > [B > (4 > C)] 


22. (4 ^ B) O (A — B 


23.04 5 CYAQC BY AG esa 
24. [A> (BV C] A C' 2 (4 B) 


Use propositional logic to prove the validity of the arguments in Exercises 25—33. These will become additional 


derivation rules for propositional logic, summarized in Table 1.14. 


25.(PV Q) ЛР 5 Q 
26. (P > Q) э (Q' > P) 
27.(Q' > Р') (P0) 
28.P>P/P 


29. P V P — P (Hint: Instead of assuming the hypothesis, begin with a version of Exercise 28; also make use 


of Exercise 27.) 


30. (P A О) > R] > [P > (О  R)] 


31. РЛР' >Q 


32. PA (Q V R) — (P A О) V (PA В) (Hint: First rewrite the conclusion.) 


33.PV(OAR)> (РУО) (РУ К) (Hint: Prove both РУ(ОЛА) 5 (РУО) and РУ (ОЛА) 


(P V R); for each proof, first rewrite the conclusion.) 


TABLE 1.14 

More Inference Rules 
From Can Derive Name/Abbreviation for Rule 
P5Q,QoR |P —>R [Example 16] Hypothetical syllogism—hs 
PVQ,P' Q [Exercise 25] Disjunctive syllogism — ds 
P+Q Q' — P’ [Exercise 26] Contraposition — cont 
Q'—P' P — Q [Exercise 27] Contraposition — cont 
P P ^ P [Exercise 28] Self-reference — self 
PVP P [Exercise 29] Self-reference — self 
(PAQ)>R Р — (О — В) [Exercise 30] Exportation—exp 
PP Q [Exercise 31] Inconsistency — inc 
P ^(QV HR) (P ^ Q) V (P ^ В) [Exercise 32] Distributive — dist 
РУ (QAR) (P V Q) ^ (P V R) [Exercise 33] Distributive — dist 


For Exercises 34—42, use propositional logic to prove the arguments valid; you may use any ofthe rules in Table 


1.14 or any previously proved exercise. 


34. A' > (A 5 В) 


35 
36 
37 
38 
39 
40 
41 
42 
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РОР кара 

.(4' 2 B')^(A5 С) Э (BC) 

.(4' —9B)^(B— C)^(C D) 5 (4' > D) 
.(4VB)^(A—5 С) (Вэ С) э С 
.(Y»&Z)A^AQC 25 Y)^[Y5 (х W)|^(Y5 2) —5(Y №) 
.(A^B)A^A(B—A') э (Сл В”) 

. (4 ABY A(C' ^A) A(CAB'Y A' 

‚(РУ (ОЛ Е)) Л R V 8) Л (5 6 Т)—(Т— Р) 


In Exercises 43—54, write the argument using propositional wffs (use the statement letters shown). Then, using 
propositional logic, including the rules in Table 1.14, prove that the argument is valid. 


43 


44 


45. 


46. 


4T. 


48. 


49. 


50. 


5 


— 


52. 


935 


54. 


29; 


56. 


. If the program is efficient, it executes quickly. Either the program is efficient, or it has a bug. However, the 
program does not execute quickly. Therefore, it has a bug. E, O, B 


. If Jane is more popular, then she will be elected. If Jane is more popular, then Craig will resign. Therefore, 
if Jane is more popular, she will be elected and Craig will resign. J, Е, С 


If chicken is on the menu, then don’t order fish, but you should have either fish or salad. So, if chicken is 
on the menu, have salad. C, F, S 


The crop is good, but there is not enough water. If there is a lot of rain or not a lot of sun, then there is 
enough water. Therefore, the crop is good and there is a lot of sun. C, W, R, S 

If the ad is successful, then the sales volume will go up. Either the ad is successful or the store will close. 
The sales volume will not go up. Therefore, the store will close. А, 5, С 

If DeWayne is not tall then Jayden is not DeWayne's brother. If DeWayne is tall then Trevor is DeWayne's 
brother. Therefore, if Jayden is DeWayne's brother, then Trevor is DeWayne's brother. D, J, T 


Russia was a superior power, and either France was not strong or Napoleon made an error. Napoleon did 
not make an error, but if the army did not fail, then France was strong. Hence, the army failed and Russia 
was a superior power. R, F, N, A 


Itis not the case that if electric rates go up, then usage will go down, nor is it true that either new power plants 
will be built or bills will not be late. Therefore, usage will not go down and bills will be late. R, U, P, B 


. If Jose took the jewelry or Mrs. Krasov lied, then a crime was committed. Mr. Krasov was not in town. If 
a crime was committed, then Mr. Krasov was in town. Therefore, Jose did not take the jewelry. J, L, C, T 


If the birds are flying south and the leaves are turning, then it must be fall. Fall brings cold weather. The 
leaves are turning but the weather is not cold. Therefore, the birds are not flying south. B, L, F, C 


If a Democrat is elected then taxes will go up. Either a Democrat will be elected or the bill will pass. 
Therefore, if taxes do not go up, then the bill will pass. D, T, B 


Either Emily was not home or if Pat did not leave the tomatoes, then Sophie was ill. Also, if Emily was not 
home, then Olivia left the peppers. But it is not true that either Sophie was ill or Olivia left the peppers. 
Therefore, Pat left the tomatoes and Olivia did not leave the peppers. E, P, $, O 


a. Usea truth table to verify that А > (B > C) <> (A^ B) > C is a tautology. 
b. Prove that А > (B — C) = (А ^ B) > C by using a series of equivalences. 


c. Explain how this equivalence justifies the deduction method that says: to prove 
Р, ЛР, Л... A P,—(R- S), deduce S from P,, P5, ... , P,, and А. 


The argument of the defense attorney at the beginning of this chapter was 


If my client is guilty, then the knife was in the drawer. Either the knife was not in the drawer or Jason 
Pritchard saw the knife. If the knife was not there on October 10, it follows that Jason Pritchard 
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didn’t see the knife. Furthermore, if the knife was there on October 10, then the knife was in the 
drawer and also the hammer was in the barn. But we all know that the hammer was not in the barn. 
Therefore, ladies and gentlemen of the jury, my client is innocent. 


Use propositional logic to prove that this is a valid argument. 


SECTION 1.3 | QUANTIFIERS, PREDICATES, AND VALIDITY 


Quantifiers and Predicates 


Propositional wffs have rather limited expressive power. For example, we would 
consider the sentence “For every x, x > 0” to be a true statement about the 
positive integers, yet it cannot be adequately symbolized using only statement 
letters, parentheses, and logical connectives. It contains two new features, a 
quantifier and a predicate. Quantifiers are phrases such as “for every” or “for 
each” or “for some” that tell in some sense how many objects have a certain 
property. The universal quantifier is symbolized by an upside down A, V, and 
is read “for all,” “for every,” “for each,” or “for any.” Thus the example sentence 
can be symbolized by 


(Vxy(x > 0) 


A quantifier and its named variable are always placed in parentheses. The second 
set of parentheses shows that the quantifier acts on the enclosed expression, which 
in this case is “x > 0.” 

The phrase “x > 0” describes a property of the variable x, that of being posi- 
tive. A property is also called a predicate; the notation P(x) is used to represent 
some unspecified predicate or property that x may have. Thus, our original sen- 
tence is an example of the more general form 


(Vx) P(x) 


The truth value of the expression (Vx)(x > 0) depends on the domain of ob- 
jects in which we are "interpreting" this expression, that is, the collection of ob- 
jects from which x may be chosen. This collection of objects is called the domain 
of interpretation. We have already agreed that if the domain of interpretation con- 
sists of the positive integers, the expression has the truth value true because every 
possible value for x has the required property of being greater than zero. If the 
domain of interpretation consists of all the integers, the expression has the truth 
value false, because not every x has the required property. We impose the condi- 
tion that the domain of interpretation contain at least one object so that we are not 
talking about a trivial case. 

An interpretation of the expression (Vx)P(x) would consist of not only the col- 
lection of objects from which x could take its value but also the particular property 
that P(x) represents in this domain. Thus an interpretation for (Vx)P(x) could be 
the following: The domain consists of all the books in your local library, and P(x) 
is the property that x has a red cover. In this interpretation, (Vx)P(x) says that ev- 
ery book in your local library has a red cover. The truth value of this expression, 
in this interpretation, is undoubtedly false. 
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PRACTICE 15 | What is the truth value of the expression (Vx)P(x) in each of the following interpretations? 


a. P(x) is the property that x is yellow, and the domain of interpretation is the collection of 
all daffodils. 

b. P(x) is the property that x is yellow, and the domain of interpretation is the collection of all flowers. 

c. P(x) is the property that x is a plant, and the domain of interpretation is the collection of all 


flowers. 
d. P(x) is the property that x is either positive or negative, and the domain of interpretation consists 
of the integers. о 
The existential quantifier is symbolized by a backward Е, Э, апа is read 
“there exists one,” “for at least one,” or “ог some.” Thus the expression 
(3x)(x > 0) 
is read “there exists an x such that x is greater than zero.” 
утере Again, the truth value of this expression depends on the interpretation. If the 


domain of interpretation contains a positive number, the expression has the value 
all, every, each, any-use Y — true; otherwise, it has the value false. The truth value of (Эх)Р(х), if the domain 
Soma Cne, E leestene= consists of all the books in your local library and Р(х) is the property that x has а 
Dore red cover, is true if there is at least one book in the library with a red cover. 


PRACTICE 16 


a. Construct an interpretation (1.е., give the domain and the meaning of P(x)) in which (Vx)P(x) 
has the value true. 

b. Construct an interpretation in which (Vx)P(x) has the value false. 

c. Can you find one interpretation in which both (Vx)P(x) is true and (3x)P(x) is false? 

d. Can you find one interpretation in which both (Vx)P(x) is false and (3x)P(x) is true? и 


The predicates we have seen so far, involving properties of a single variable, 
are unary predicates. Predicates can be binary, involving properties of two vari- 
ables, ternary, involving properties of three variables, or, more generally, n-ary, 
involving properties of n variables. 


The expression (Vx)(3y)O(x, y) is read “for every x there exists а y such that 
Q(x, y).” Note that there are two quantifiers for the two variables of the binary 
property. In the interpretation where the domain consists ofthe integers and Q(x, y) 
is the property that x « y, this just says that for any integer, there is a larger integer. 
The truth value of the expression is true. In the same interpretation, the expression 
(Ay)(Vx)O(x, y) says that there is a single integer y that is larger than any integer x. 
The truth value here is false. e 


Example 20 illustrates that the order in which the quantifiers appear is 
important. 


© DEFINITION 
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In expressions such as (Vx)P(x) or (3x)P(x), x is a dummy variable; that is, the 
truth values of the expressions remain the same in a given interpretation if they 
are written, say, as (Vy)P(y) or (3z)P(z), respectively. Similarly, the truth value 
of (Vx)(3y)O(x, y) is the same as that of (Vz)(3w)QO(z, w) in any interpretation. 
However, (Vx)(Ax)O(x, x) says something quite different. In the interpretation of 
Example 20, for instance, (Vx)(3x)Q(x, x) says that for every integer x, there is an 
integer x such that x < x. This statement is false, even though (Vx)(3y)Q(x, y) was 
true in this interpretation. We cannot collapse separate variables together into one 
without changing the nature of the expression we obtain. 

Constants are also allowed in expressions. A constant symbol (a, b, c, 0, 1, 2, 
etc.) is interpreted as some specific object in the domain. This specification 15 part 
of the interpretation. For example, the expression (Vx)Q(x, а) is false in the inter- 
pretation where the domain consists of the integers, Q(x, y) is the property x < y, 
and a is assigned the value 7; it is not the case that every integer is less than 7. 

Now we can sum up what is required in an interpretation. 


INTERPRETATION 
An interpretation for an expression involving predicates consists of the following: 


a. A collection of objects, called the domain of the interpretation, which must 
include at least one object 


b. An assignment of a property of the objects in the domain to each predicate in 
the expression 


c. An assignment of a particular object in the domain to each constant symbol in 
the expression 


Expressions can be built by combining predicates with quantifiers, grouping 
symbols (parentheses or brackets), and the logical connectives of Section 1.1. As 
before, an expression must obey rules of syntax to be considered a well-formed 
formula. Well-formed formulas containing predicates and quantifiers are called 
predicate wffs to distinguish them from propositional wffs, which contain only 
statement letters and logical connectives. 

The expression P(x)(Vx) ^)3y is not a well-formed formula. Examples of 
predicate wffs are 


Р(х) V Q(y) (1) 
(Мх)[Р(х) > О(х)] (2) 
(Ух)((Чу)[Р@, y) ^ О(х, y)] > R) (3) 
and 
EVSA) V (VVT) (4) 


“Grouping symbols” such as parentheses and brackets identify the scope of a 
quantifier, the section of the wff to which the quantifier applies. (This is analo- 
gous to the scope of an identifier in a computer program as the section of the 
program in which that identifier has meaning.) There are no quantifiers in wff (1). 
In (2), the scope of the quantifier (Vx) is P(x) > О(х). In (3), the scope of (Ay) is 
P(x, y) ^ Q(x, y), while the scope of (Vx) is the entire expression in parentheses 
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following it. In (4), the scope of (Ax) is S(x) and the scope of (Vy) is T(y); paren- 
theses or brackets can be eliminated when the scope is clear. 

If a variable occurs somewhere in a wff where it is not part of a quantifier and 
is not within the scope of a quantifier involving that variable, it is called a free 
variable. For example, у is a free variable in 


(VOLL, у) > Gy)RG. y)] 


because of the first occurrence of y, which is neither the variable of a quantifier nor 
within the scope of a quantifier using y. A wff with free variables may not have a 
truth value at all in a given interpretation. For example, in the interpretation where 
the domain is all of the integers, the predicate P(x) means “х > 0”, and 5 means (of 
course) the integer 5, the wff 


PO) ^ PS) 


has no truth value because we don't know which element of the domain y refers to. 
Some elements of the domain are positive and others are not. The wff 


P(y) V PS) 


is true in this interpretation even though we don't know what y refers to because 
P(5) is true. In both of these wffs y is a free variable. 


| EXAMPLE 21 | In the wff 
(Vx)(Ay)[S@, y) ^ L(y, a)] 


the scope of (Зу) is all of S(x, y) ^ L(y, a). The scope of (Vx) is Ay)[S(x, y) ^ L(y, a)]. 
Consider the interpretation where the domain consists of all the cities in the United 
States, S(x, y) is the property “x and y are in the same state," L(y, z) is the property 
“y’s name begins with the same letter as z’s name,” and a is assigned the value Albu- 
querque. So the interpretation of the entire wff is that for any city x there is a city y in 
the same state that begins with the letter 4. The wff is true in this interpretation. (At 


least it is true if every state has a city beginning with the letter А.) ө 


PRACTICE 17 | What is the truth value of the wff 


Ax)(A@ ^ VIBE, y) > CV) 


in the interpretation where the domain consists of all integers, A(x) is “x > 0,” B(x, y) is “x > y," and 
C(y) is “у = 0”? Construct another interpretation with the same domain in which the statement has the 
opposite truth value. m 


Translation 


Many English language statements can be expressed as predicate wffs. For ex- 
ample, “Every parrot is ugly,” is really saying, “For any thing, if it is a parrot, 


REMINDER 


Think 


and 


Vo 


EIN 
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then it is ugly." Letting P(x) denote “х is a parrot” and U(x) denote “х is ugly,” the 
statement can be symbolized as 


(Vx)[PQ)) > О0)] 


Other English language variations that take the same symbolic form are, “АП par- 
rots are ugly,” and, “Each parrot is ugly.” Notice that the quantifier is the univer- 
sal quantifier and the logical connective is implication; V and — almost always 
belong together. The wff (Vx)[P(x) ^ U(x)] is an incorrect translation because it 
says that everything in the domain—understood here to be the whole world— 
is an ugly parrot. This says something much stronger than the original English 
statement. 

Similarly, “There is an ugly parrot,” is really saying, “There exists something 
that is both a parrot and ugly.” In symbolic form, 


(x)[P@) ^ U@)] 


Variations are, “Some parrots are ugly,” and, “There are ugly parrots.” Here the 
quantifier is the existential quantifier and the logical connective is conjunction; 3 
and ^ almost always belong together. The wff (3x)[P(x) — U(x)] is an incorrect 
translation. This wff is true as long as there is anything, call it x, in the domain 
(the whole world) that is not a parrot, because then P(x) is false and the implication 
is true. Indeed, this wff is true if there are no parrots in the world at all! 

To translate an English statement into a wff, it may help to first write an in- 
termediate English language statement and then symbolize that statement. We did 
this with the parrot examples. 

The word “only” seems particularly troublesome in translations because its 
placement in a sentence can completely change the meaning. For example, the 
English statements 


1. John loves only Mary. 
2. Only John loves Mary 
3. John only loves Mary. 


say three entirely different things. Using the predicate symbols J(x) for “х is John,” 
M(x) for “x is Mary,” and L(x, y) for “х loves y,” they can be rewritten as 


1. If John loves any thing, then that thing is Mary. 


ог 
1. For any thing, if it is John then, if it loves anything, that thing is Mary. 
(vx)(JG) > (VW, у) > M(y)) 
2. If any thing loves Mary, then that thing is John. 
or 


2. For any thing, if it is Mary then, if anything loves it, that thing is John. 


(Vx)(M(x) > VEO, x) > Jy) 


3. If John does any thing to Mary, then that thing is love. 
or 
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3. For any thing, if it is John then, for any other thing, if that thing is Mary, 
then John loves it. 


(Ух) Лх) > (VMO) > LO, y)) 


In each case, the consequent of the implication is the word following “only” in the 
original English statement. 


| EXAMPLE? | Given the predicate symbols 


D(x) is “x is a dog” 
R(x) is “x is a rabbit” 
C(x, y) is “x chases y" 


Table 1.15 shows examples of an English statement, an intermediate English state- 
ment, and a wff translation. Note that in wff 2, the connective associated with 3 
is /\ and the connective associated with V is —. In wff 3, the first version shows 
two implications associated with the two V quantifiers. The second version is 
equivalent because of the tautology [4 ^ B — C] «€» [A — (В > C)]. This ver- 
sion may appear to violate the rule that universal quantifiers should be used with 
implication, not conjunction, but this tautology provides another way to write two 
implications. The second version also shows more clearly that *dogs," the word 
following “only,” is the conclusion. 


TABLE 1.15 


English Statement | Intermediate Statement Wff 


1. All dogs chase | For any thing, if itis a dog, | (Vx)[D(x)  (Vy)(R(y) > С(х,у))] 

all rabbits. then for any other thing, if 
that thing is a rabbit, then 
the dog chases it. 


2. Some dogs There is some thing that G»ID(x) ^ (Vy)(R(y) > С(х,у))] 

chase all rabbits. | is a dog and, for any other 
thing, if that thing is a rab- 
bit, then the dog chases it. 


3. Only dogs chase | For any thing, if it is a rabbit | (Vy)[R(y) — (Vx)(C(x,y) > D(x))] 
rabbits. then, if anything chases it, 

that thing is a dog. 

For any two things, if one is | (Vy)(Vx)[R(y) ^ C(x,y) > D(x)] 

a rabbit and the other chas- 

es it, then the other is a dog. 


Often more than one wff exists that is a correct representation of an English state- 
ment, as seen with statement (3) in Table 1.15. Also wff (2) is equivalent to 


EVDE) ^ VIRO) V Cœ, у))] 


because of the implication equivalence rule that says (R > C) €» (A' V C), even 
though here R and C are predicates instead of just statement letters. 
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In addition, it is legitimate to “slide” a quantifier over a predicate that does not 
involve the variable of that quantifier. Because D(x) does not involve y, we could 
slide the universal quantifier in wff (2) to the front (but not past the existential 
quantifier), giving the equivalent wff 


GxX Vy)LDG) ^ RO) > CO, у))] (a) 


In wff (a), we still have grouping symbols around A(y) — C(x,y). Without the 
grouping symbols, this wff becomes 


GxX Vy)LDG) ^ RO) > С(х, y)] (b) 


which, according to the order of precedence of connectives, is equivalent to 


EIVI) ^ RCGy)) > CO, y)] 


A quick truth table exercise shows that D Л (А С) is not equivalent to 
(D ^ R) — C, so wff (b) is not equivalent to wff (a) and thus does not represent 
statement (2) in the table. ө 


Translation from an English statement into a predicate wff is a little harder 
than translation into a propositional wff, partly because of the added expressive- 
ness of the verbal form and partly because there can be multiple correct predicate 
wffs. Here is a summary of translation tips: 


* Look for the key words that signify the type of quantifier: 
for all, for every, for any, for each: use a universal quantifier 
for some, there exists: use an existential quantifier. 
* English sometimes uses “understood” universal quantifiers. For example, 
“Dogs chase rabbits,” is understood to mean, “All dogs chase all rabbits.” 
e Ifyou use a universal quantifier, the connective that goes with it is almost 
always implication. 
* If you use an existential quantifier, the connective that goes with it is 
almost always conjunction. 
* Whatever comes after the word “only” is the conclusion of an implication; 
that is, it comes after “then” in an “if—then” statement. 
* You are most apt to arrive at a correct translation if you follow the order 
of the English words. 


| EXAMPLE 23 | Let’s do a couple of examples in great detail. The first is 


All giraffes are tall. 


The property of being a giraffe and the property of being tall are unary predicates. 
We'll use G(x) for “x is a giraffe” and 7(x) for “х is tall”. Following the sentence 
structure, we first see “All,” which tells us that there's a universal quantifier, so the 
wff begins with 


(Vx...) 
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All what? All giraffes, so 
(Vx)(G(x) ... ) 


Because of the universal quantifier, we expect to use the implication connective, 
so now we have 


(Ух)(С(х) >...) 


Thinking of the implication as an “if—then,” we have “if a giraffe, then ... ." Then 
what? Then it's tall. The final wff is 


(Vx)(G(x) > T(x) 
The second example is 
Only giraffes are taller than elephants. 


The property of being a giraffe and the property of being an elephant are unary 
predicates, and we’ll use G(x) and E(x) to represent them. But “taller than” is a 
property that compares two things, so it’s a binary predicate; 7(x, у) will mean “x 
is taller than у”. There are no obvious quantifier key words, so we understand that 
we are talking about all giraffes and all elephants (universal quantifiers). The word 
“giraffes” follows the word “only,” so the property of being a giraffe is going to 
be the conclusion of an implication and the overall form will be “if xxx, then a 
giraffe.” Indeed, if something is taller than an elephant, then it’s a giraffe. Putting 
in the universal quantifiers, “if any thing is taller than any elephant, then that thing 
is a giraffe,” or (even more tortured English), “for any thing, if it is an elephant, 
then for any other thing, if it's taller than the elephant, then it’s a giraffe.” Now we 
can pretty much translate directly into a wff. “For any thing, if it is an elephant, 
then" becomes 


(Vx)E(x) >...) 


and, "for any other thing, if it's taller than the elephant, then," adds a second im- 
plication to the wff: 


(Vx)(E(x) > WT x) > ... )) 


Notice that we introduced y here, a second variable, because we’ ve already given 
x the elephant property. Also, we've written 7( y, x), not T(x, y), because we want 
this new thing to be taller than the elephant, and our definition of the taller predi- 
cate was that the first variable was taller than the second. We are ready for the final 
conclusion—this new thing is a giraffe. 


(Vx)(E(x) > WTO, х) > G(y))) 


As in Table 1.15(3), a tautology allows us to also write this wff as 


(Vx)(Vy)(E(a) A Ty, x) > G(y)) 


“For any two things, if one is an elephant and the other is taller than the elephant, 
then the other thing is a giraffe.” 
With some practice, you won’t have to go quite this slowly! e 
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PRACTICE 18 | Using the predicate symbols S(x) for “х is a student,” Дх) for “х is intelligent," and M(x) 
for “х likes music,” write wffs that express the following statements. (The domain is the 
collection of all people.) 


a. All students are intelligent. 

b. Some intelligent students like music. 

c. Everyone who likes music is a stupid student. 

d. Only intelligent students like music. и 


PRACTICE 19 | Using the predicate symbols F(x) for “x is a fruit," V(x) for “x is a vegetable,” апа S(x,y) for 
“x is sweeter than y,” write wffs that express the following statements. (The domain is the 
whole world.) 


a. Some vegetable is sweeter than all fruits. 

b. Every fruit is sweeter than all vegetables. 

c. Every fruit is sweeter than some vegetable. 

d. Only fruits are sweeter than vegetables. н 


Negating statements with quantifiers, as in negating compound statements, 
requires care. The negation of the statement, “Everything is beautiful,” is, “It is 
false that everything is beautiful,” or, “Something is nonbeautiful.’” Symbolically, 


[(Vx)A(x)]’ is equivalent to (Ax)[A(x)]’ 


Note that, “Everything is nonbeautiful,” or (Vx)[4(x)|', says something stronger 
than the negation of the original statement. 

The negation of, “Something is beautiful,” is, “Nothing is beautiful,” or, 
“Everything fails to be beautiful.” Symbolically, 


[(Чх)А(х)]' is equivalent to (Vx)[A(x)]’ 


In English, the statement, “Everything is not beautiful,’ would often be misinter- 
preted as, “Not everything is beautiful,” or, “There is something nonbeautiful.” 
However, this misinterpretation, symbolized by (Ax)[A(x)]’, is not as strong as the 
negation of the original statement. 


PRACTICE 20 | Which of the following statements expresses the negation of, “Everybody loves somebody 
sometime"? 


a. Everybody hates somebody sometime. 

b. Somebody loves everybody all the time. 

c. Everybody hates everybody all the time. 

d. Somebody hates everybody all the time. н 
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Validity 


The truth value of a propositional wff depends on the truth values assigned to the 
statement letters. The truth value of a predicate wff depends on the interpretation. 
Choosing an interpretation for a predicate wff is thus analogous to choosing truth 
values in a propositional wff. However, there are an infinite number of possible 
interpretations for a predicate wff and only 2” possible rows in the truth table for 
a propositional wff with n statement letters. 

A tautology is a propositional wff that is true for all rows of the truth table. 
The analogue to tautology for predicate wffs is validity—a predicate wff is valid 
if it is true in all possible interpretations. The validity of a wff must be derived 
from the form of the wff itself, since validity is independent of any particular in- 
terpretation; a valid wff is “intrinsically true.” 

An algorithm exists to decide whether a propositional wff is a tautology— 
construct the truth table and examine all possible truth assignments. How can we 
go about deciding validity for predicate wffs? We clearly cannot look at all pos- 
sible interpretations, because there are an infinite number of them. As it turns out, 
no algorithm to decide validity for any wff exists. (This does not mean simply that 
no algorithm has yet been found—it means that it has been proved that there is no 
such algorithm.) We must simply use reasoning to determine whether the form of 
a particular wff makes the wff true in all interpretations. Of course, if we can find 
a single interpretation in which the wff has the truth value false or has no truth 
value at all, then the wff is not valid. 

Table 1.16 compares propositional and predicate wffs. 


TABLE 1.16 
Propositional Wffs Predicate Wffs 
Truth values True or false, depending on | True, false, or perhaps (if the 
truth value assignments to wff has a free variable) neither, 
statement letters depending on interpretation 
“Intrinsic truth” | Tautology —true for all truth Valid wff—true for all 
value assignments interpretations 
Methodology Algorithm (truth table) to No algorithm to determine 
determine whether wff is a whether wff is valid 
tautology 


Now let's try our hand at determining validity for specific wffs. 


| EXAMPLE 24 | a. The wff 


(Vx)P(x) > Gx)PQ) 


is valid. In any interpretation, if every element of the domain has a certain prop- 
erty, then there exists an element of the domain that has that property. (Remember 
that the domain of any interpretation must have at least one object in it.) Therefore, 


PRACTICE 21 


Section 1.3 Quantifiers, Predicates, and Validity 49 


whenever the antecedent is true, so is the consequent, and the implication is there- 
fore true. 


b. The wff 
(Vx)P(x) > P(a) 


is valid because in any interpretation, а is a particular member of the domain and 
therefore has the property that is shared by all members of the domain. 


c. The wff 
VOLPE) ^ Q(x)] < (Ух)Р(х) ^ (Vx) Q(x) 


is valid. If both Р and О are true for all the elements of the domain, then Р is true 
for all elements and Q is true for all elements, and vice versa. 


d. The wff 


Р(х) > [Q(x) > PQ)] 


is valid, even though it contains a free variable. To see this, consider any inter- 
pretation, and let x be any member of the domain. Then x either does or does not 
have property P. If x does not have property P, then P(x) is false; because P(x) 
is the antecedent of the main implication, this implication is true. If x does have 
property P, then P(x) is true; regardless of the truth value of Q(x), the implication 
О (x) — Р(х) is true, and so the main implication is also true. 


e. The wff 
(ax)P(x) > (Ух)Р(х) 


is not valid. For example, in the interpretation where the domain consists of the 
integers and P(x) means that x is even, it is true that there exists an integer that is 
even, but it is false that every integer is even. The antecedent of the implication is 
true and the consequent is false, so the value of the implication is false. ө 


We do not necessarily have to go to a mathematical context to construct an 
interpretation in which a wff is false, but it is frequently easier to do so because 
the relationships among objects are relatively clear. 


Is the wff valid or invalid? Explain. 


(Vx)[PQ) V QG)] > (СУх)Р@) V (VX) OQ) и 
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SECTION 1.3 REVIEW 


TECHNIQUES 


W Determine the truth value of a predicate wff in a 
given interpretation. 
Ù Translate English language statements into predi- 
cate wffs, and vice versa. 
* Recognize a valid wff and explain why it is valid. 
W) Recognize a nonvalid wff and construct an inter- 
pretation in which it is false or has no truth value. 


EXERCISES 1.3 


MAIN IDEAS 


The truth value of predicate wffs depends on the 
interpretation considered. 

Valid predicate wffs are “intrinsically true"—true 
in all interpretations. 


1. What is the truth value of each of the following wffs in the interpretation where the domain consists of the 
integers, O(x) is “x is odd,” L(x) is “x < 10,” and G(x) is “x > 9”? 
a. (Ax)O(x) с. (3x)[L(x) ^ G(x)] 


b. YVL > OQ] d. (Vx)[L(@x) V GG] 


2. What is the truth value of each of the following wffs in the interpretation where the domain consists of the 


integers, A(x) is “x < 5" and B(x) is “x « 7”? 


a. (Ax)A(x) с. (Vx)[A(x) > В(х)] 
b. Gx)[4(x) ^ В(х)] а. (Vx)[B(x) > A(x)] 
3. What is the truth value of each of the following wffs in the interpretation where the domain consists of the 
integers? 
a. (Vx)(y)(x + y = x) e. (Vx)(Vy)(x € y V y € x) 
b. Gy)(Vx)(x + y = x) f. (Vx)x « 0 5 (3y)((y 2» 0 Лх * y = 0)] 
c. (Vx)Gy)(x + y = 0) в. EVENE = y) 
d. (d3y)(Vx)(x + y = 0) h. (Vx)G? > 0) 


4. What is the truth value of each of the following wffs in the interpretation where the domain consists of the 


real numbers? 


a. (Vx) Gy) = у?) с. GxX(Vy)(x = у”) 


b. (Ух)(Уу)(х = у?) d. EVENE = у?) 


5. Give the truth value of each of the following wffs in the interpretation where the domain consists of 
the states of the United States, О(х, y) 1s “х is north of y," P(x) is “х starts with the letter M," and a is 


"Massachusetts." 

a. (Vx)P(x) 

b. (Vx)(Vy)(VZ)[O@, y) ^ Обу, 2) > OG, z)] 
с. (3) Gx)OCy», x) 

d. (Vx)Gy)[PQ) ^ Ох, у)] 

e. (Ay)O(, у) 

f. EDIPE) ^ Оф, a)] 


6. Give the truth value of each of the following wffs in the interpretation where the domain consists of 
people, M(x, y) is “х is the mother of y", F(x) is “x is female", М(х) is “x is male." 


а. (Vx)Gy)(M(y, x) d. GVE) M, y) ^ My) 
b. (ах)(Уу)(М(х, у)) e. EVM, y) > F(y)) 


с. (VAME, у) > M(y)) 
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7. For each wff, find an interpretation in which it is true and one in which it is false. 
a. (Vx)y([4Q) V В(х)] ^ [4Q) ABET) 
b. (Vx)(Vy)L[PGo, у) > PH, x)] 
с. (VIPE) > €y), у)] 
8. For each wff, find an interpretation in which it is true and one in which it is false. 
а. EDME) ^ (Vy) BQ, у)] 
b. [(Vx)A(x) > (Ух)В(х)] > (Ух)[А(х) > B(x)] 
с. Gx)[P(x) V O@)] ^ (Vx)[P(x) > О(х)] 
9. Identify the scope of each of the quantifiers in the following wffs and indicate any free variables. 
a. VDP) > 007] с. (Эх)[(Уу)Р(х, y) ^ Оф, у)] 
b. EVAS) ^ (Vy)BG))] d. Gxyay)[A4Gs y) ^ BO, z) > AG z)] 
10. Explain why each of the following expressions is written incorrectly. 
a. (9)(О(х) ^ Р(х) 
b. (VVOO) Р(у)) 
с. (Vx)(Vy)OQ) > PO) 
11. Which of the following sentences are equivalent to the statement 


АП circles are round. 


a. If it's round, it's a circle. 
b. Roundness is a necessary property of circles. 
c. Something that isn't round can't be a circle. 
d. Some round things are circles. 
12. Which of the following sentences are equivalent to the statement 


Cats are smarter than dogs. 


. Some cats are smarter than some dogs. 

. There is a cat that is smarter than all dogs. 
. All cats are smarter than all dogs. 

. Only cats are smarter than dogs. 


o co c g 


. All cats are smarter than any dog. 

13. Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 
a predicate wff. (The domain is the whole world.) 
D(x): x is a day 
S(x): x is sunny 
R(x): x is rainy 
M: Monday 
T: Tuesday 
а. АП days are sunny. 

b. Some days are not rainy. 
c. Every day that is sunny is not rainy. 
d. Some days are sunny and rainy. 


14. 


15. 


16. 
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. No day is both sunny and rainy. 
. Itis always a sunny day only if it is a rainy day. 
. No day is sunny. 


Бо rho 


. Monday was sunny; therefore, every day will be sunny. 
i. It rained both Monday and Tuesday. 


j. If some day is rainy, then every day will be sunny. 


Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 
a predicate wff. (The domain is the whole world.) 
B(x): x is a ball 
R(x): x is round 
S(x): x 1s a soccer ball 
АП balls are round. 

. Not all balls are soccer balls. 

. All soccer balls are round. 

. Some balls are not round. 


a. 
b 
© 
d 
e. Some balls are round but soccer balls are not. 
f. Every round ball is a soccer ball. 
g. Only soccer balls are round balls. 
h. If soccer balls are round, then all balls are round. 
Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 
a predicate wff. (The domain is the whole world.) 
M(x): x is aman 

W(x): x is a woman 
T(x): x is tall 

. All men are tall. 

. Some women are tall. 

. All men are tall but no woman is tall. 

. Only women are tall 


a 
b 

с 

а 

е. No тап 15 tall. 
f. If every man is tall, then every woman is tall. 

g. Some woman is not tall. 

h. If no man is tall, then some woman 15 not tall. 

Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 
a predicate wff. (The domain is the whole world.) 

A(x): x is an animal 

B(x): x is a bear 

A(x): x is hungry 

W(x): x is a wolf 

a. Bears are animals. 


b. No wolf is a bear. 


17. 


18. 


19. 


20. 
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. Only bears are hungry. 
. If all wolves are hungry, so are bears. 
. Some animals are hungry bears. 
Bears are hungry but some wolves are not. 


. If wolves and bears are hungry, so are all animals. 


Dos mo а 0 


. Some wolves are hungry but not every animal is hungry. 


Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 
a predicate wff. (The domain is the whole world.) 


P(x): x is a person 

T(x): x is a time 

F(x, y): x 1s fooled at y 

a. You can fool some of the people all of the time. 
b. You can fool all of the people some of the time. 
c. You can't fool all of the people all of the time. 


Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 
a predicate wff. (The domain is the whole world.) 


L(x): x is a lion 

R(x): x roars 

P(x): x is a predator 

Z(x): x is a zebra 

E(x, y): x eats y 

a. All lions are predators. 
b. Some lions roar. 

c. Only lions roar. 

d. Some lions eat all zebras. 
e. АП lions eat all zebras. 


Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 
a predicate wff. (The domain is the whole world.) 


G(x): x is a game 

M(x): x 15 a movie 

F(x, y): x is more fun than y 

a. Any movie is more fun than any game. 

b. No game is more fun than every movie. 
c. Only games are more fun than movies. 

d. АП games are more fun than some movie. 


Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 
a predicate wff. (The domain is the whole world.) 


C(x): x is a child 
T(x): x is a toy 


2 


— 


22. 


23. 
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V(x): x is a vegetable 

W(x, y): x wants y 

a. Every child wants toys. 

b. Only children want toys. 

c. Some child wants only toys. 
d. No child wants vegetables. 


. Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 


a predicate wff. (The domain is the whole world.) 
J(x): x is a judge 

L(x): x is a lawyer 

W(x): xis a woman 

C(x): x is a chemist 

A(x, у): x admires у 

a. There are some women lawyers who are chemists. 
b. No woman is both a lawyer and a chemist. 

c. Some lawyers admire only judges. 

d. All judges admire only judges. 

e. Only judges admire judges. 

f. All women lawyers admire some judge. 

g. Some women admire no lawyer. 

Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 
a predicate wff. (The domain is the whole world.) 
C(x): x is a Corvette 

F(x): x is a Ferrari 

P(x): x is a Porsche 

S(x, y): x is slower than y 


a. Nothing is both a Corvette and a Ferrari. 

b. Some Porsches are slower than only Ferraris. 

c. Only Corvettes are slower than Porsches. 

d. All Ferraris are slower than some Corvettes. 

e. Some Porsches are slower than no Corvette. 

f. If there is a Corvette that is slower than a Ferrari, then all Corvettes are slower than all Ferraris. 
Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 
a predicate wff. (The domain is the whole world.) 

В(х):х is a bee 

F(x): x is a flower 

L(x, y): x loves y 

a. АП bees love all flowers. d. Every bee hates only flowers. 

b. Some bees love all flowers. e. Only bees love flowers. 

c. АП bees love some flowers. f. Every bee loves only flowers. 


24. 


2 


2 


сл 


lon 
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g. No bee loves only flowers. j. Every bee hates some flowers. 
h. Some bees love some flowers. k. Every bee hates all flowers. 
i. Some bees love only flowers. l. No bee hates all flowers. 


Using the predicate symbols shown and appropriate quantifiers, write each English language statement as 
a predicate wff. (The domain is the whole world.) 
S(x): x is a spy novel 
L(x): x is long 
M(x): x is a mystery 
B(x, y): x is better than у 
. All spy novels are long. 
. Not every mystery is a spy novel. 
. Only mysteries are long. 
. Some spy novels are mysteries. 
. Spy novels are better than mysteries. 
Some mysteries are better than all spy novels. 


gg mo бс осо сь 


. Only spy novels are better than mysteries. 


. Give English language translations of the following wffs if 


L(x, у): x loves y 

A(x): x is handsome 

M(x): x is aman 

P(x): x is pretty 

W(x): x is a woman 

J: John 

k: Kathy 

L(x, y): x loves y 

а. HG) ^ Lk, j) 

- (Vx)[M(x) > Н(х)] 

‚ (Vx) W(x) > VILE, y) > MY) ^. Н(у)]) 

‚ EDIM) ^ Hx) ^ L(x, k)] 

‚ EDW) A Р(х) ^ VEK, y) > HG) ^ М(у)]) 
(VIWE) ^ P(x) > LG, x)] 


к» о с о c 


. Give English language translations of the following wffs if 


M(x): x is aman 

W(x): x is a woman 

i: Ivan 

p: Peter 

W(x, y): x works for y 

а. EYW ^ (VM у) > We, YI!) 
b. (Vx)[MQ) > Gy)(M(y) A Wa, y))] 

c. (VOM > (Vy), у) > ЙЎ/(у))] 


27. 


28. 


29, 
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d. (Vx)(Vy (M(x) ^ Wy, x) > Wy) 

e. Wi, p) ^ (Vx)[W(p, х) > (V(x))' ) 

Ё (X) DWG, d > (W))'] 

Three forms of negation are given for each statement. Which is correct? 
a. Some people like mathematics. 

1. Some people dislike mathematics. 

2. Everybody dislikes mathematics. 

3. Everybody likes mathematics. 

b. Everyone loves ice cream. 

1. No one loves ice cream. 

2. Everyone dislikes ice cream. 

3. Someone doesn’t love ice cream. 

c. All people are tall and thin. 

1. Someone is short and fat. 

2. No one is tall and thin. 

3. Someone is short or fat. 

d. Some pictures are old or faded. 

1. Every picture is neither old nor faded. 

2. Some pictures are not old or faded. 

3. All pictures are not old or not faded. 
Three forms of negation are given for each statement. Which is correct? 
a. Nobody is perfect. 

1. Everyone is imperfect. 

2. Everyone is perfect. 

3. Someone is perfect. 

b. All swimmers are tall. 

1. Some swimmer is not tall. 

2. There are no tall swimmers. 

3. Every swimmer is short. 

c. Every planet is cold and lifeless. 

1. No planet is cold and lifeless. 

2. Some planet is not cold and not lifeless. 

3. Some planet is not cold or not lifeless. 
d. No bears are hungry. 

1. Only bears are hungy. 

2. All bears are hungry. 

3. There is a hungry bear. 

Write the negation of each of the following statements. 
a. Some Web sites feature audio. 

b. Every Web site has both audio and video. 

c. Every Web site has either audio or video. 


30. 


3 


— 


32. 


33. 


34. 


35. 


36. 


37. 


38. 


| Section 1.3 Quantifiers, Predicates, and Validity 57 


d. Some Web sites have neither audio nor video. 

e. Every Web site either has text or else has both audio and video. 
Write the negation of each of the following statements. 

a. Only students eat pizza. 

b. Every student eats pizza 

c. Some students eat only pizza. 


. Write the negation of each of the following statements. 


a. Some farmer grows only corn. 

b. All farmers grow corn. 

c. Corn is grown only by farmers. 

Write the negation of each of the following statements 

a. Some child fears all clowns. 

b. Some children fear only clowns. 

c. No clown fears any child. 

Explain why each wff is valid. 

- (Vx)(VY)AQ, y) => (VY)(VX)A(X, y) 

b. EVEA, у) < (Зу)(Ах)Д(х, y) 

c. (ах)(Уу)Р(х, y) > (Vy)Gx)P(x, у) 

d. A(a) > (Әх)А(х) 

e. (Vx)[4(x) > B] > [(Vx)A(x) > (Vx)B(x)] 

Give interpretations to prove that each of the following wffs is not valid: 

а. x)A(x) ^ (ах)В(х) > EVIA) ^ В(х)] 

b. (Vx)Gy)PG, y) > (Әх)СУу)Р(х, у) 

с. (VIPE) > 00)] > [(ах)Р(х) > (Vx)O@)] 

а. YDA > [(Vx)4@)]' 

Decide whether each of the following wffs is valid or invalid. Justify your answer. 

а. (Эх)4(х) e (VVA 

b. (Vx)P(x) V EDLE) > VIPE) V O@)] 

Decide whether each of the following wffs is valid or invalid. Justify your answer. 

a. (Vx)A(x) > ((Әх)[А(х)]')' 

b. (VIPE) > 0609] ^ Gxy[PQ) V Об)] > EVP) ^ gw] 

с. (VIPE) V O(x)] > VIP V Ay)O(y) 

From Example 24c, we know that (Vx)[P(x) ^ Q(x)] > (Vx)P(x) ^ (Vx)O(x) is valid. From Prac- 
tice 21, we know that (Vx)[P(x) V О(х)] > (Vx)P(x) V (Vx)Q(x) is not valid. From Exercise 34a, we 
know that (3x)[P(x) ^ Q(x)] < (3x)P(x) ^ (3x)Q(x) is not valid. Explain why (3x)[P(x) V Q(x)] > 
(3x)P(x) V (dx)Q(x) is valid. 

A predicate wff is in prenex normal form if all the quantifiers appear at the front of the wff. Write each of 
the following expressions as an equivalent wff in prenex normal form. 


a. (Vx)P(x) ^ (Vy)OQ) 
b. (VPE) > MIO) > Wx, у)]) 
c. (Ax)P(x) ^ (Ях)О(х) 


e 
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We can imagine arguments of the form 
Ру ЛАР, АР: л... AP, О 


where Ше wffs are built from predicates and quantifiers as well as logical соппес- 
tives and grouping symbols. For a valid argument, О must follow logically from 
P,, ... , P, based solely on the internal structure of the argument, not on the truth 
or falsity of Q in any particular interpretation. In other words, the wff 


Р, ЛР, ЛР, Л... ^P, О 


must be valid—true in all possible interpretations. No equivalent of the truth table 
exists to easily prove validity, so we turn to a formal logic system called predicate 
logic. We again use a system of derivation rules to build a proof sequence lead- 
ing from the hypotheses to the conclusion. The rules should once more be truth- 
preserving, so that if in some interpretation, all the hypotheses are true, then the 
conclusion will also be true in that interpretation. The system will then be correct 
(only valid arguments will be provable). We also want the system to be complete 
(every valid argument should be provable), yet at the same time the rule set should 
be minimal. 


Derivation Rules for Predicate Logic" 


The equivalence rules and inference rules of propositional logic are still part of 
predicate logic. An argument of the form 


P/A(P>Q)>@Q 
is still valid by modus ponens, even if the wffs involved are predicate wffs. 
(Vx)R(x) ^ [CV xX) R(x) > VIS] — (Vx)S(x) 


A proof sequence is 


1. (Vx)R(x) hyp 
2. (Vx)R(x) > (Vx)S(x) hyp 
3. (Vx)S(x) il; 2, mira ө 


However, there are many arguments with predicate wffs that are not tautolo- 
gies but are still valid because of their structure and the meaning of the univer- 
sal and existential quantifiers (see Example 24). The overall approach to proving 
these arguments is to strip off the quantifiers, manipulate the unquantified wffs, 


4А complete list of derivation rules for propositional and predicate logic is given in Appendix A. 
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and then put the quantifiers back in. The new rules of inference provide mecha- 
nisms to strip off and insert quantifiers. Hence there are four new rules—one each 
to strip off the universal and existential quantifier, respectively, and one each to 
insert the universal and existential quantifier, respectively. The four rules are given 
in Table 1.17; their details will be explained shortly. In Table 1.17, the notation P(x) 
does not imply that P is a unary predicate with x as its only variable; it simply 
means that x is one of the variables in the predicate P. Thus P(x) might actually be 
something like (4y)(Vz)O(, у, 2). 


TABLE 1.17 
Inference Rules 
From Can Derive Name/Abbreviation for Rule Restrictions on Use 
(Vx)P(x) P(t), where t is a variable or | Universal instantiation —ui If tis a variable, it must not fall 
constant symbol within the scope of a quantifier 
for 1. 
(ax)P(x) P(a) where a is a constant Existential instantiation — ei Must be the first rule used that 
symbol not previously used introduces a. 
in proof sequence 
P(x) (vx)P(x) Universal generalization — ug P(x) has not been deduced from 
any hypotheses in which x is a 
free variable nor has P(x) been 
deduced by ei from any wff in 
which x is a free variable. 
P(x) or P(a) Existential generalization — eg To go from P(a) to Gx)P(x), x must 
where a is not appear in Р(а). 
a constant 
symbol 


Now let's examine these rules more closely, particularly the necessity for their 
restrictions. 


Universal Instantiation 

The universal instantiation rule says that from (Vx)P(x) we can derive P(x), 
P( y), P(z), P(a), and so on, thus stripping off a universal quantifier. The justifica- 
tion is that if P is true for every element of the domain, we can name such an ele- 
ment by an arbitrary variable name like x, y, or z, or we can specify a particular 
constant in the domain, and P is still true for all of these things. 


Universal instantiation can be used to prove one of the classical "syllogisms" of 
the Greek philosopher and scientist Aristotle, who lived from 384 to 322 B.C.E. and 
who first developed a system of formal logic. 

The argument has the form, *All humans are mortal. Socrates is human. There- 
fore Socrates is mortal." Using the notation 


Hx) is “х is human.” 
s is a constant symbol (Socrates) 
M(x) is “х is mortal.” 
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the argument is 
(Vx)[H(x) > M(x)] ^ Hs) > M(s) 
and a proof sequence is 


(Vx)(H(x) > M(x)) hyp 


Hs) hyp 

Н(ѕ) > M(s) 1, ui 

М) 2, 3, mp 
In step 3, a constant symbol has been substituted for x throughout the scope of the 
universal quantifier, as allowed by universal instantiation. ө 


Without the restriction on universal instantiation, a hypothesis of the form 
(Vx)(3y)P(x, y) could lead to the wff (Ay)P(), y); here y has been substituted for x 
within the scope of a quantifier on y. This would be invalid. For example, in the 
domain of the integers, if Р(х, y) means “у > x,” then (Vx)(Sy)P(x, y) is true (for 
every integer there is a bigger integer) but (3y)P(y, y) is false (no integer has the 
property that it is bigger than itself). 


PRACTICE 22 | Prove the argument 


VOLPE) > К(х)] ^ [RO] [POY] a 


Existential Instantiation 

The existential instantiation rule allows us to strip off an existential quantifier. 
It says that from (3x)P(x) we can derive Р(а) or P(b) or P(c) provided that these 
are new constant symbols. The justification is that if P is true for some element 
of the domain, we can give that element a specific name, but we cannot assume 
anything else about it. 


EXAMPLE 27 


The following expressions would be legitimate steps in a proof sequence: 


1. (Vx)[P@) > О(х)] hyp 


2. GVP) hyp 
3. P(a) DUCI 
4. P(a) > O(a) 1, ui 
5. Q(a) 3, 4, mp 


In step 3, the specific element with property P was given the name a. In step 4, 
ui was then used to say that an implication that is universally true in the domain 
is certainly true for this a. Steps 3 and 4 cannot be reversed. If ui is first used on 
hypothesis 1 to name a constant a, there is then no reason to assume that this par- 
ticular a is the one that is guaranteed by hypothesis 2 to have property P. e 
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The effect of the restriction on existential instantiation is that you should look 
at all your hypotheses and, if you plan to use ei on any of them, do it first. 


REMINDER 


Use existential instan- 
tiation early in the proof 
sequence. 


Universal Generalization 

Universal generalization allows a universal quantifier to be inserted. This must 
be done pretty carefully, however. If we know that P(x) is true and that the x is 
absolutely arbitrary, i.e., that x could be any element of the domain, then we can 
conclude (Vx)P(x). But if x is supposed to represent some specific element of the 
domain that has property P, then we can’t generalize that every element of the 
domain has property P. 


| EXAMPLE 28 | РЕН Use predicate logic to prove 


VOLPE) > QG)] ^ VPE > (VX) OC) 


Here is a proof sequence. 


1. (УРО) > Q@)] hyp 


2. (Vx)P(x) hyp 

3. P(x) > О(х) 1, ш 

4. Р(х) 2, ш Note that there is по restriction on ш about 
reusing a name. 

5 Q(x) 3, 4, mp 

6. (Vx)O(x) 5, ug 


The use of universal generalization at step 6 is legitimate because x was not a 
free variable in any hypothesis nor was ei used anywhere in the proof. The vari- 
able x in steps 3 and 4 is just an arbitrary name, representative of any element 
in the domain. ө 


There аге two restrictions on universal generalization. Without the first re- 
striction, the sequence 


1. Р(х) hyp 
2. (Vx)P(x) 1, incorrect ug; x was free in the hypothesis. 


would be a proof of the wff P(x) — (Vx)P(x), but this is not a valid wff. Element x 
of the domain may have property P, but that does not mean that every element of 
the domain has property Р. In the hypothesis, x is naming some fixed if unspeci- 
fied element of the domain. For instance, in the interpretation where the domain 
consists of automobiles and P(x) means “х is yellow,” some particular car may be 
yellow but it is certainly not true that all cars are yellow. 

Without the second restriction, the sequence 


1. (Vx)Gy)OGo у) hyp 


2. (Ay)Q@, y) l, ui 
3. Q(x, a) 2,el 
4. (Vx)O(, a) 3, incorrect ug; О(х, a) was deduced by ei from the wff 


in step 2, in which x is free. 
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would be a proof of the wff (Vx)(3y)QO(x, y) > (Vx)Q(x, a). This is also not a valid 
wff. For instance, in the interpretation where the domain consists of the integers 
and O(x, у) means that x + у = 0, then it is the case that for every integer x there 
is an integer y (the negative of x) such that x + y = 0. However, if a is a particular 
fixed element in the domain, then it will not be true that adding that same integer 
a to every x will always produce zero. 


PRACTICE 23 | Prove the argument 


(УР) ^ О()] > VILE) ^ P]. m 


Existential Generalization 

The last rule allows insertion of an existential quantifier. From P(x) or P(a) we can 
derive (Ax)P(x); something has been named as having property P, so we сап say 
that there exists something that has property P. 


Prove the argument (Vx)P(x) > (3x)P(x). 
Here is a proof sequence. 


1. (Vx)P(x) hyp 
2. Р(х) 1, ui 
ЗЕРО) Co: e 


Without the restriction on existential generalization, from P(a, y) one could 
derive (Ay)P(y, y); here the quantified variable y, which replaced the constant sym- 
bol a, already appeared in the wff to which existential generalization was applied. 
But the argument P(a, y) > (Ay)P(y, y) is not valid. In the domain of integers, if 
Р(х, y) means “у > x" and a stands for 0, then if y > 0, this does not mean that 
there is an integer y that is greater than itself. 


More Work with Rules 


As is the case with propositional logic rules, predicate logic rules can be applied 
only when the exact pattern of the rule is matched (and, of course, when no 
restrictions on use of the rule are violated). In particular, notice that the instan- 
tiation rules strip off a quantifier from the front of an entire wff that 1s in the 
scope of that quantifier. Both of the following would be illegal uses of existential 


instantiation: 
І. EVP) V (ах)О(х) hyp 
2. P(a) V Q(a) 1, incorrect ei. The scope of the first existential quan- 


tifier in step 1 does not extend to the whole rest of 
the wff. 


| Section 1.4 Predicate Logic 63 


1. (VEE, y) hyp 


2. (Ух)О(х, a) 1, incorrect ei. The existential quantifier in step 1 is not at 
the front. 


Similarly, the rules to insert a quantifier put the quantifier in the front of a wff that 
is then entirely within its scope. 

The new derivation rules all have restrictions, but in practice, you are most 
likely to violate the ei restriction. Pay special attention to this restriction and be 
sure that when you substitute a constant using ei, it's nota previously used constant. 

Even though we have added only four new derivation rules, the rule set is 
complete and correct. We can prove every valid argument and only valid argu- 
ments using these rules. Application of the rules, as in the case of propositional 
logic, is somewhat mechanical because there are only a limited number of options 
at each step. Again, the general plan of attack is usually as follows: 


* Strip off the quantifiers. 
* Work with the separate wffs. 
* Insert quantifiers as necessary. 


Using predicate logic, prove the argument 


VOLPE) ^ QQ)] > СУх)Р@) ^ (Vx)OQ) 


In Example 24(c) we noted that this wff is valid, so if all valid arguments are prov- 
able, we should be able to find a proof sequence. As usual, the hypothesis gives us 
a starting point. 


1. (Vx)[P@) ^ Об)] hyp 


Stripping off the universal quantifier that appears in step 1 will yield access to 
P(x) ^ Q(x), which can then be separated. The universal quantifier can then be 
inserted separately on each of those two wffs using universal generalization. The 
conclusion (Vx)P(x) ^ (Ух)О(х) will follow. A proof sequence is 


1. (Vx)[P@) ^ О()] hyp 


2. P(x) ^ Q(x) 1, ui 
SRE) 2, sim 
4. Q(x) 2, sim 
5. (Vx)P(x) 3, ug 
6. (Vx)Q(x) 4, ug 


7. (Vx)P(x) ^(Vx)O(x) 5, 6, con 


Neither restriction on universal generalization has been violated because x is not 
free in the hypothesis and existential instantiation has not been used. e 


PRACTICE 24 | Using predicate logic, prove the following argument. (Hint: The deduction 
method still applies.) 


(Vy)[P(x) > Оф, y)] > ГРО) > (Vy) OO у)] и 
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EXAMPLE 31 


As an extension to the deduction method, we can insert a “temporary” 
hypothesis into a proof. If some wff 7 is introduced into a proof sequence as a 
temporary hypothesis, and eventually a wff W is deduced from Т and other hy- 
potheses, then the wff T — W has been deduced from the other hypotheses and 
can be inserted in the proof sequence. 


The argument 


[P(x) > (Vy) OC, y)] > (VVIP) > Об, у)] 


is valid. In the following proof sequence, P(x) is introduced at step 2 as a tempo- 
rary hypothesis, which allows us to deduce Q(x, y) at step 4. The indented steps 
show that these wffs depend on the temporary hypothesis. At step 5, the temporary 
hypothesis is “discharged,” as the dependency of Q(x, y) on the temporary hy- 
pothesis is explicitly acknowledged as an implication. Of course, the entire wff at 
step 5, P(x) > О(х, у), still depends on the hypothesis of step 1. At step 6, neither 
restriction on universal generalization is violated because y is not a free variable 
in step | (the only hypothesis at this point) and existential instantiation is not used 
in the proof. 


1. P(x) > (Уу)О(х,у) hyp 


2, Р(х) temporary hyp 

3. (VVC, у) 1, 2, mp 

4. Q(xy) 3, ui 

5. P(x) > Q(x, y) temp. hyp discharged 


6. VIPE) > О(х, »)) 5, ug 


Notice how the temporary hypothesis gives us enough ammunition to make some- 
thing happen. Without this technique, it would be difficult to know what to do after 
step 1. e 


The technique of introducing a temporary hypothesis is seldom needed. 
Again, think of this as an extension to the deduction method. If the desired con- 
clusion is of the form P — Q, the deduction method says we can assume P as a 
hypothesis and deduce Q as the conclusion. If the desired conclusion is ofthe form 
(Vx)(P(x) > Q(x)) or GVP) > Qx), then the deduction method does not apply, 
but P(x) can be used as a temporary hypothesis. 

Practice 24 and Example 31 show that the wff 


PE) > Об, y)] => Pœ) > (Vy), у)] 


is valid. It says that the universal quantifier can "slide over" subwffs that do not 
contain the quantified variable; in this case, (Vy) is passed over P(x). A similar 
result holds for the existential quantifier. We noted this feature in Example 22, and 
here is the formal justification. This is one reason why there may be two or more 
equivalent ways of expressing English language sentences as predicate wffs, as in 
Exercises 13 through 24 of Section 1.3. 
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PRACTICE 25 | Prove the argument 


(Ух)[(8(х) V Cw) > AQ)] > HBE) > А(х)] и 


In Section 1.3 we observed that, based on our understanding of negation апа 
the meaning of the quantifiers, [(3x)4(x)|' is equivalent to (Vx)[4(x)]. We should 
be able to formally prove that 


[(ах)А(х)]' > (Ух) [40)] 
is a valid wff. 
| ЕХАМРІЕЗ2 | Prove that 
[(Әх)А(х)]' > (Ух)[А(х)]' 


is valid. We must prove the implication in each direction. 
a. [EDAC] > (Ух)[А(х)]' 


The hypothesis alone gives us little to work with, so we introduce a (somewhat 
surprising) temporary hypothesis. A proof sequence is 


1. [EAA] hyp 

2. A(x) temporary hyp 

3.  (3x)A(x) 2.095 

4. A(x)  (Эх)А(х) temporary hyp discharged 
5. [А(х)]' 1, 4, mt 

6. (V YLA] 5, ug 


b. (Vx)[46)]' > [Gx)460]' 


This proof also requires a temporary hypothesis. It is even more surprising than 
case (a) because we assume the exact opposite of the conclusion we are trying to 


reach. 
1. YDM hyp 
2. (dx)A(x) temporary hyp 
3. A(a) DAC 
4. Ma 1, ui 
5. [)40)]7' 3, 4, inc 
6. (3x) A(x) > [(Vx)[A(x)]']' temporary hyp discharged 
7. KDAT 1, dn 
8. [EDA] 6, 7, mt e 


The proofs of Example 32 are rather difficult because they require con- 
siderably more imagination than most and an unexpected use of a temporary 
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hypothesis. As a result, however, we do have the following equivalence, to which 
we've given a name: 


[(Әх)А(х)]' < (Vx)[A@)]’ (Negation—neg) 


This equivalence might be useful in a proof sequence. As an extension of the 
equivalence rules, whenever Р <> О is valid, О can be substituted for P within an 
expression in a proof sequence. 


| EXAMPLE3S | Is the wff 
(Vx)[P(x) V Q(x)] > Gx)PQ) V (Ух)О(х) 


a valid argument? Prove or disprove. 

Let's first consider whether the wff seems valid. If so, we should try to find a 
proof sequence for it; if not, we should try to find an interpretation in which it is 
not true. This wff says that 1f every element of the domain has either property P 
or property Q, then at least one element must have property P or else all elements 
have property Q. This seems very reasonable, so we'll try to find a proof. 

First we'll use an equivalence to rewrite the conclusion in a more useful form. 
Changing the V to an implication will allow use ofthe deduction method. Thus we 
want to prove 


(Vx)[P(x) V W] > EDP] > ()00)] 
A proof sequence is 


1. (Vx)[PG) V Ow] hyp 


2. [EVP] hyp 

3. (Vx)[P(x)]' 2, neg 

4. [P(x)]’ 3, ui 

5. P(x) V О(х) 1, ui 

6. Q(x) 4, 5, ds 

7. (Vx)O(x) 6, ug e 


| EXAMPLEA | Is the wff 
(Ax)P(x) ^ (Ax)Q(x) > (а)[Р(х) ^ Q(x)] 


a valid argument? Prove or disprove. 

If something in a domain has property P and something has property О, that 
does not mean that some one thing has both property P and Q. For example, in the 
domain of integers, if P(x) means “x is even" and Q(x) means “х is odd," then the 
hypotheses are true, but the conclusion is false because there is no single integer 
that is both even and odd. One interpretation in which the wff is false is enough to 
disprove it. 
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It is useful, however, to see where a potential proof sequence goes wrong. We 
begin with the two hypotheses and then remove one of the existential quantifiers. 


1. (9х)Р(х) hyp 


2. (Ax)O(x) hyp 
SE) Il; Gi 


Now here's the problem. The next step would be to remove the existential quan- 
tifier from the wff at step 2, but, according to the rules for ei, we have to name the 
object that has property О by some different name, not a. So we could eventually 
get to a wff in the proof sequence that looks like 


P(a) ^ Q(b) 


but this does us no good. Existential generalization could not be used to replace 
both constant symbols with a single variable. At best, we could arrive at 


Gy)Gx)[PC9 ^ QCy)] 


which is not what we want. e 


PRACTICE 26 | Is ће wff a valid argument? Prove or disprove. 


EVRO) A LEVRE) ^ 569] ]' > EVS’ и 


Verbal Arguments 


To prove the validity of a verbal argument, we proceed much as before. We cast 
the argument in symbolic form and show that the conclusion can be deduced from 
the hypotheses. If the argument involves predicate wffs, then the derivation rules 
of predicate logic are available. 


Show that the following argument is valid: “Every laptop has an internal disk 
drive. Some laptops have a DVD drive. Therefore some laptops have both an in- 
ternal disk drive and a DVD drive.” Using 


L(x) is “x is a laptop.” 
I(x) is “x has an internal disk drive.” 
D(x) is “x has a DVD drive.” 


the argument is 


VLE) > 109] ^ EVLE) ^ D > EDLE) ^ Да) \ DO] 
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Note that if we attempt to symbolize this argument in propositional logic, we get 
A ^ B — C, which is not a valid argument. Propositional logic is simply not ex- 
pressive enough to capture the interrelationships among the parts of this argument 
that serve to make it valid. 

A proof sequence is 


1. (мх) Ш) > 16] hyp 

2. (аху) ^ Род] hyp 

3. L(a) ^ D(a) е! 

4. L(a) > Қа) 1, ui 

5. L(a) 3, sim 
6. Қа) 4, 5, mp 
7. Ма) ^ D(a) ^ Қа) 3, 6, con 
8. L(a) ^ Қа) ^ D(a) 7, comm 


9. (39[L(x) ^ Дх) ^D(x)) 8, eg 


Once again, it is the form of the argument that matters, not the content. ө 


PRACTICE 27 | Show that the following argument is valid: “АП rock music is loud music. Some rock music 
exists, therefore some loud music exists.” Use predicates R(x) and L(x). 


Conclusion 


We've now finished our study of formal logic. What has been accomplished? 
The goal of formal logic, often called symbolic logic, is to make arguments as 
meaningless as possible! The symbolic notation of propositional and predicate 
logic allows us to symbolize arguments. An argument cast in symbolic notation 
removes any possibility that we will be swayed by our opinions or our external 
knowledge about the topic of the argument, and we can concentrate solely on its 
structure to determine its logical validity. Furthermore, the derivation rules allow 
the proof of an argument's validity to be produced by symbol manipulation. A 
proof requires no external knowledge, only a careful adherence to the forms and 
restrictions of the rules. In theory, then, producing a proof sequence should be 
almost mechanical. Again, one objective of practice with this mechanical process 
of applying rules is that it will ultimately transform into a habit of logical thinking 
in everyday life. 

Nonetheless, you may still feel that it is difficult to produce a proof sequence. 
Practice does make the process easier, because after a while, you become familiar 
with the various forms an argument might take and you recognize which rules 
you should try to apply. At any rate, you should at least find it easy at this point to 
check whether a proposed proof sequence 15 logically correct. 

Philosophers through the ages believed logical thinking to be one of the high- 
est achievements of human existence. One additional example, however, will 
show how even the most careful application of logic can be frustrated. 
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The hunting grounds of a medieval king were forbidden to commoners, and any- 
one caught poaching the royal deer was subject to death. The hapless poacher was, 
however, granted a means to choose the manner of death. He (or she) was allowed 
to make a final statement. If the statement were judged to be true, death would be 
by beheading with a sword; if false, death would come by arrow shot from the 
bow of the best royal marksman. One day a particularly clever poacher was ap- 
prehended and allowed the usual final statement. The poacher said, “I will be shot 
to death by an arrow.” 

The king’s court faced a conundrum. If the poacher were shot to death by an 
arrow, then the statement he made would prove to be true, in which case he should 
have been beheaded. But if he were beheaded, then the statement he made would 
be false, in which case he should have been shot by an arrow. Unable to decide the 
manner of death, the court appointed the clever poacher to the post of king’s press 
secretary, where he served happily for many years. 

This sort of paradox—a riddle with no solution—has to be carefully constructed, 
and we will not spend any more time reflecting on the potential shortcomings of 
classical logic systems that it may reveal. ө 


In addition to logical thinking in its pure sense, the notions of formal rules of 
inference have two very direct applications to computer science. An entire system 
of programming, and some programming languages, are based on applying rules 
of inference. We will see such a language in Section 1.5. Similarly, rules of infer- 
ence can be applied to formally prove program correctness, leading to increased 
confidence that code is error-free. We'll look at some of the inference rules for 
program correctness in Section 1.6. 


SECTION 1.4 REVIEW 


TECHNIQUES MAIN IDEA 

QJ Apply derivation rules for predicate logic. * The predicate logic system is correct and com- 

W Use predicate logic to prove the validity of a ver- plete; valid arguments and only valid arguments 
bal argument. are provable. 


EXERCISES 1.4 


For Exercises 1—6, decide what conclusion, if any, can be reached from the given hypotheses and justify your 
answer. 
1. АП flowers are plants. Pansies are flowers. 


. All flowers are plants. Pansies are plants. 
. All flowers are red or purple. Pansies are flowers. Pansies are not purple. 
. Some flowers are purple. АП purple flowers are small. 

Some flowers are red. Some flowers are purple. Pansies are flowers. 


ON tA ALUN 


. Some flowers are pink and have thorns. All thorny flowers smell bad. Every flower that smells bad is a 
weed. 
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м 


. Justify each step in the following proof sequence of 
Gx)[P(x) > O@)] > (VVP > Gx)00)] 
1. EDIPE) > О()] 
2. P(a) > O(a) 
3. (Vx)P(x) 
4. P(a) 
5. Q(a) 
6. (ах)О(х) 


. Justify each step in the following proof sequence of 


EXPE) ^ (Vx)(P(x)  QG)) > AOR) 


оо 


1. (9х)Р(х) 

2. (х)(Р(х) > О(х)) 
3. Р(а) 

4. Р(а) > Oa) 
5. O(a) 

6. (8х)0(х) 

9. Consider the wff 


(Vx)[Gy)PO, y) ^ EVL, V) > C)Gy)PGs y) ^ OG, y)] 
a. Find an interpretation to prove that this wff is not valid. 
b. Find the flaw in the following "proof" of this wff. 
1. (Vx)[(Gy)P@, y) ^ EQ, y)] hyp 
2. (Vx)[P(x, a) ^ Q(x, a)] l,ei 
3. (Vx)Ay)[P@ у) ^ Оф, у)] 2, eg 
10. Consider the wff 


(Vy)(3x)QGs y) > EVVK, y) 


a. Find an interpretation to prove that this wff is not valid. 
b. Find the flaw in the following "proof" of this wff. 


1. (Vy)Gx)OG, у) hyp 
2. (Ax)O(x, y) l, ui 
3. Oa, y) 2, el 
4. (Vy)O(a, у) 3, ug 
5. Ax)\(Vy)O,y) 4, eg 
In Exercises 11—16, prove that each wff is a valid argument. 
11. (Vx)P(x) > (Vx)LP(x) V Q(x)] 
12. (Vx)P(x) ^ (Чх)О(х)  Gx)LP(x) ^ О(х)] 
13. Qx)Gy)PG, y) > Gy)Gx)P(s, y) 
14. (Vx)(Vy)OG. y) > (Уу)бУх)О(х, y) 
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15. (Vx)P(x) ^ (Gx)LP(x)]' > (Ax)O(x) 
16. (Vx)[SQx) > EVP Ey) ^ TONI ^ ECE) ^ SQ)  Gx)GyXC€G) ^ Ty) ^ Рх, y) 
In Exercises 17—30, either prove that the wff is a valid argument or give an interpretation in which it is false. 

17. EDME) ^ B(x)] > (3x)A(x) ^ GVB) 

18. EVR V S(x)] > Gx)R(x) V (Gx)S(x) 

19. (Ax)P(x) ^ Gx)Gy)OG, y) > Gx)Gy)LPG) ^ Оф, y)] 

20. (Vx)[P(x) > О(х)] > VVP)  (Y2)909)] 

21. (Vx)(P(x))' 2 VIP) > QGQ)) 

22. [(Vx)P(x) > VDL] > (Vx)[P@) > О(х)] 

23. (ах)СУу)О(х, y) > VEOK, у) 

24. (Vx)P(x) V (3x)Q(x)  (Vx)LP(x) V Q(x)] 

25. (Vx)[4(x) > B(x)]  [Gx)A(x)  (3x)B(x)] 

26. (Уу)[О(х, у) > P(x)] > EVLE, у) ^ Р(х)] 

27. [Р(х) > @y)OG, y)] > EVP > OG, y)] 

28. (Vx)(P(x) V Q(x)) ^ (Ax) Q(x) > (Ax)P(x) 

29. EDLP) ^ QG)] ^ OIRO) > RO) > EDIPE) ^ RG)] 

30. (Vx) (Vy)[G*Q) ^ 5(х, y) > ООУ] ^ (Әх) В() ^ (Ух)(В(х) > P(x)) ^ VENSE, y) > EVA) 

31. The Greek philosopher Aristotle (384—322 в.с.к.) studied under Plato and tutored Alexander the Great. His 
studies of logic influenced philosophers for hundreds of years. His four "perfect" syllogisms are identified 


by the names given them by medieval scholars. For each, formulate the argument in predicate logic nota- 
tion and then provide a proof. 


a. "Barbara" 
All Mare P 
АП S are M 
Therefore all S are P 
b. “Celarent” 
NoMareP 
АП S are M 
Therefore no S are P 
c. "Darii" 
АП M are P 
Some S are M 
Therefore some S are Р 
d. “Ferio” 
NoMareP 
Some S are M 
Therefore some S are not P 
Using predicate logic, prove that each argument in Exercises 32—42 is valid. Use the predicate symbols 
shown. 


32. Some plants are flowers. All flowers smell sweet. Therefore, some plants smell sweet. P(x), F(x), S(x) 


33. 


34. 


35. 


36. 


37. 


3 


оо 


39. 


40. 


4 


— 


42. 
43. 


44. 
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Every crocodile is bigger than every alligator. Sam is a crocodile. But there is a snake, and Sam isn’t 
bigger than the snake. Therefore, something is not an alligator. C(x), A(x), B(x, y), s, S(x) 


There is an astronomer who is not nearsighted. Everyone who wears glasses is nearsighted. Furthermore, 
everyone either wears glasses or wears contact lenses. Therefore, some astronomer wears contact lenses. 
A(x), N(x), Gx), C(x) 

Every member of the board comes from industry or government. Everyone from government who has a 
law degree is in favor of the motion. John is not from industry, but he does have a law degree. Therefore, 
if John is a member of the board, he 15 in favor of the motion. M(x), I(x), G(x), L(x), F(x), j 

There is some movie star who is richer than everyone. Anyone who is richer than anyone else pays more 
taxes than anyone else does. Therefore, there is a movie star who pays more taxes than anyone. M(x), 
R(x, у), T(x, y) 

Everyone with red hair has freckles. Someone has red hair and big feet. Everybody who doesn’t 


have green eyes doesn’t have big feet. Therefore someone has green eyes and freckles. R(x), F(x), 
B(x), G(x) 


. Cats eat only animals. Something fuzzy exists. Everything that’s fuzzy is a cat. And everything eats some- 


thing. So animals exist. C(x), E(x, у), A(x), F(x) 

Every computer science student works harder than somebody, and everyone who works harder than any 
other person gets less sleep than that person. Maria is a computer science student. Therefore, Maria gets 
less sleep than someone else. C(x), W(x, y), S(x, y), m 


Every ambassador speaks only to diplomats, and some ambassador speaks to someone. Therefore, there is 
a diplomat. A(x), S(x, y), D(x) 


. Some elephants are afraid of all mice. Some mice are small. Therefore there is an elephant that is afraid of 


something small. E(x), M(x), A(x, y), S(x) 
Every farmer owns a cow. No dentist owns a cow. Therefore no dentist is a farmer. F(x), C(x), O(x, y), D(x) 
Prove that 


(VAW > GxIMG)] 


is valid. (Hint: Instead of a proof sequence, use Example 32 and substitute equivalent expressions.) 


The equivalence of Exercise 43 says that if it is false that every element of the domain has property А, 
then some element of the domain fails to have property А, and vice versa. The element that fails to have 
property А is called a counterexample to the assertion that every element has property А. Thus a counter- 
example to the assertion 


(Vx)(x is odd) 


in the domain of integers is the number 10, an even integer. (Of course, there are lots of other counterex- 
amples to this assertion.) Find counterexamples in the domain of integers to the following assertions. (An 
integer x > 1 is prime if the only factors of x are 1 and x.) 


a. (Vx)(x is negative) 

b. (Vx)(x is the sum of even integers) 
c. (Vx)(x is prime — x is odd) 

d. (Vx)(x prime  (—1)' = —1) 

e. (Vx)(x prime — 2* — 1 is prime) 
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SECTION 1.5 | LOGIC PROGRAMMING 


The programming languages with which you are probably familiar, such as C++ 
or Java, are known as procedural languages. Much of the content of a program 
written in a procedural language consists of instructions to carry out the algo- 
rithm the programmer believes will solve the problem at hand. The program- 
mer, therefore, is telling the computer how to solve the problem in a step-by-step 
fashion. 

Some programming languages, rather than being procedural, are declarative 
languages or descriptive languages. A declarative language is based on predi- 
cate logic; such a language comes equipped with its own rules of inference. A 
program written in a declarative language consists only of statements—actually 
predicate wffs—that are declared as hypotheses. Execution of a declarative pro- 
gram allows the user to pose queries, asking for information about possible 
conclusions that can be derived from the hypotheses. After obtaining the user’s 
query, the language turns on its “inference engine” and applies its rules of in- 
ference to the hypotheses to see which conclusions fit the user’s query. The 
program, remember, contains only the hypotheses, not any explicit instructions 
as to what steps to perform in what order. The inference engine of the language 
acts behind the scenes, so to speak, to construct a proof sequence. It is the me- 
chanical nature of applying inference rules that makes this “automated theorem 
proving” possible. 


Prolog 


The programming language Prolog, which stands for PROgramming in LOGic, 
is a declarative programming language. The set of declarations that constitutes 
a Prolog program is also known as a Prolog database. Items in a Prolog data- 
base take on one of two forms, known in Prolog as facts and rules. (Prolog rules, 
however, are just another kind of fact and should not be confused with a rule of 
inference.) 

Prolog facts allow predicates to be defined by stating which items in some 
domain of interpretation satisfy the predicates. As an example, suppose we wish to 
create a Prolog program that describes food chains in a given ecological region. We 
might begin with a binary predicate eat. We then describe the predicate by giving 
the pairs of elements in the domain that make eat true. Thus we might have the facts 


eat(bear, fish) 
eat(bear, fox) 
eat(deer, grass) 


in our database. (The exact details of Prolog statements vary from one Prolog 
implementation to another, so in this section we are only giving the spirit of the 
language by using a Prolog-like pseudocode.) Here “bear,” “fish,” “fox,” "deer," 
and "grass" are constants because they represent specific elements in the domain. 
Because the domain itself is never specified except by describing predicates, at 
this point we may take the domain to consist of “bear,” “fish,” “fox,” “deer,” and 
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“grass.” It is up to the user to maintain a consistent understanding and use of the 
predicates in a Prolog program. Thus 


eat(bear, fish) 


can be used either to represent the fact that bears eat fish or the fact that fish eat 
bears! 

We impose the convention that eat(X, Y) means “X eats Y" We could add 
descriptions of two unary predicates, animal and plant, to the database by adding 
the facts 


animal(bear) 
animal(fish) 
animal(fox) 
animal(deer) 
plant(grass) 


Armed with this Prolog program (database), we can pose some simple queries. 


| EXAMPLE 97 | The query 


?animal(bear) 


merely asks if the fact animal(bear) is in the database. Because this fact is in the 
database, Prolog would respond to the query by answering yes. (This is a one-step 
proof sequence—no rules of inference are required). Further dialogue with Prolog 
could include 


?eat(deer, grass) 

yes 

?eat(bear, rabbit) 

no ө 


Queries may include variables, as shown in the next example. 


| EXAMPLE 38 | КЕШ The query 


?eat(bear, X) 
produces 


fish 
fox 


as a response. Prolog has answered the query by searching the database for all 
facts that match the pattern eat(bear, X), where X is a variable. The answer “fish” 
is given first because the rules are searched in order from top to bottom. ө 
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Queries may contain the logical connectives and, or, and not. 


PRACTICE 28 | Given the database 


eat(bear, fish) 
eat(bear, fox) 
eat(deer, grass) 
animal(bear) 
animal(fish) 
animal(fox) 
animal(deer) 
plant(grass) 


what will be Prolog's response to the query 


?eat(X, Y) and plant(Y) " 


The second type of item in a Prolog database is a Prolog rule. A rule is a de- 
scription of a predicate by means of an implication (the implication arrow in the rule 
goes from right to left). For example, we might use a rule to define a predicate of prey: 


prey(X) <= eat(Y, X) and animal(X) 


This statement says that X 15 a prey if it is an animal that is eaten. If we add this 
rule to our database, then in response to the query 


?prey(X) 
we would get 


fish 
fox 


Horn Clauses and Resolution 


How do Prolog facts and rules relate to more formal predicate logic? We can de- 
scribe the facts in our database as wffs: 


E(b, fi) 
E(b, fo) 
E(d, g) 
A(b) 

A( fi) 
A( fo) 
A(d) 
P(g) 


and the rule by the wff 


Е(у, x) ^ A(x)  Pr(x) 


REMINDER 


Prolog’s resolution rule 
looks for a term and its 
negation to infer one Horn 
clause from two. 
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Universal quantifiers are not explicitly part of the rule as it appears in a Prolog 
program, but Prolog treats the rule as being universally quantified 


(Vy) Vx)LEGy, x) ^ A(x) 2 Pr(x)] 


and repeatedly uses universal instantiation to strip off the universal quantifiers 
and allow the variables to assume in turn each value of the domain. 

Both facts and rules are examples of Horn clauses. A Horn clause is a wff 
composed of predicates or the negations of predicates (with either variables or 
constants as arguments) joined by disjunctions, where at most one predicate is 
unnegated. Thus the fact 


E(d, g) 


is an example of a Horn clause because it consists of a single unnegated predicate. 
The wff 


[E(y, х)] V AOI V Ри) 


is an example of a Horn clause because it consists of three predicates joined by 
disjunction where only Pr(x) is unnegated. By De Morgan's law, it is equivalent to 


[E(y, x) ^ АО) V Рх) 
which in turn is equivalent to 
E(y, x) ^ A(x) > Pr(x) 
and therefore represents the rule in our Prolog program. 

The single rule of inference used by Prolog is called resolution. Two Horn 
clauses in a Prolog database are resolved into a single new Horn clause if one 
contains an unnegated predicate that matches a negated predicate in the other 
clause. The new clause eliminates the matching term and is then available to use 


in answering the query. For example, 


A(a) 
[4(a)]' V B(b) 


resolves to B(b). This says that from 
A(a), [4(a)]' V В(Ь) 
which is equivalent to 
Аа), A(a) > B(b) 
Prolog infers 


B(b) 
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which is just an application of modus ponens. Therefore Prolog’s rule of inference 
includes modus ponens as a special case. 

In applying the resolution rule, variables are considered to “match” any 
constant symbol. (This is the repeated application of universal instantiation.) In 
any resulting new clause, the variables are replaced with their associated con- 
stants in a consistent manner. Thus in response to the query ?prey(X), Prolog 
searches the database for a rule with the desired predicate Pr(x) as the conse- 
quent. It finds 


[E(y, x)]' V AOT V Pr(x) 
It then proceeds through the database looking for other clauses that can be re- 


solved with this clause. The first such clause is the fact E(b, fi). These two clauses 
resolve into 


[AC A’ V PrCfi) 


(Note that the constant fi has replaced x everywhere.) Using this new clause, it 
can be resolved with the fact A( fi) to conclude Pr( fi). Having reached all conclu- 
sions possible from resolution with the fact E(b, fi), Prolog backtracks to search 
for another clause to resolve with the rule clause; this time around it would find 


E(b, fo). 
As a more complex example of resolution, suppose we add the rule 
hunted(X) <= prey(X) 
to the database. This rule in symbolic form is 
[Pr(x)] > H(x) 
or, as a Horn clause, 
[Pr(x)]' V H(x) 
It resolves with the rule defining prey 
[E(y, х)]' V AOT V Pr(x) 
to give the new rule 
[E(y, х) V AOT V HQ) 
The query 
?hunted(X) 
will use this new rule to conclude 


fish 
fox 
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| EXAMPLE 39 | Suppose that a Prolog database contains the following entries: 


eat(bear, fish) 
eat(fish, littlefish) 
eat(littlefish, algae) 
eat(raccoon, fish) 
eat(bear, raccoon) 
eat(bear, fox) 
eat(fox, rabbit) 
eat(rabbit, grass) 
eat(bear, deer) 
eat(deer, grass) 
eat(wildcat, deer) 


animal(bear) 
animal(fish) 
animal(littlefish) 
animal(raccoon) 
animal(fox) 
animal(rabbit) 
animal(deer) 
animal(wildcat) 


plant(grass) 
plant(algae) 
prey(X) <= eat(Y, X) and animal(X) 


Then the following dialog with Prolog could take place: 


?animal(rabbit) 
yes 


?eat(wildcat, grass) 
no 


?eat(X, fish) 
bear 
raccoon 


?eat(X, Y) and plant(Y) 
littlefish algae 


rabbit grass 
deer grass 
?prey(X) 

fish 

littlefish 


fish 
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raccoon 
fox 
rabbit 
deer 
deer 


Note that fish is listed twice as satisfying the last query because fish are eaten by 
bear (fact 1) and by raccoon (fact 3). Similarly, deer are eaten by both bear and 
wildcat. e 


PRACTICE 29 


a. Formulate a Prolog rule that defines the predicate predator. 
b. Adding this rule to the database of Example 39, what would be the response to the query 


?predator(X ) ш 


Recursion 


Prolog rules are implications. Their antecedents (remember that these will appear 
on the right side of the rules) may depend on facts, as in 


prey(X) <= eat(Y, X) and animal(X) 
or on other rules, as in 
hunted(X) <= prey(X) 
The antecedent of a rule may also depend on that rule itself, in which case the rule 
is defined in terms of itself. A definition in which the item being defined is itself 
part of the definition is called a recursive definition. 

Asan example, suppose we wish to use the ecology database of Example 39 to 
study food chains. We can then define a binary relation infoodchain(X, Y), mean- 
ing “У is in X’s food chain.” This, in turn, means one of two things: 

1. X eats Y directly 
ог 

2. X eats something that eats something that eats something ... that eats Y. 
Case 2 can be rewritten as follows: 


2'. X eats Z and Y is in Z's food chain. 


Case 1 is simple to test from our existing facts, but without (2^), infoodchain 
means nothing different from eat. On the other hand, (2’) without (1) sends us 
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down an infinite path of something eating something eating something and so on, 
with nothing telling us when to stop. Recursive definitions always need a stopping 
point that consists of specific information. 

The Prolog rule for infoodchain incorporates (1) and (2’): 


infoodchain(X, Y) <= eat(X, Y) 
infoodchain(X, Y) <= eat(X, Z) and infoodchain(Z, Y) 
It is a recursive rule because it defines the predicate infoodchain in terms of 
infoodchain. 
A recursive rule is necessary when the predicate being described is passed on 
from one object to the next. The predicate infoodchain has this property: 


infoodchain(X, Y) ^ infoodchain(Y, Z) — infoodchain(X, Z) 


| EXAMPLE 40 | 7 After the infoodchain rule is added to the database of Example 39, the following 


query is made: 
?infoodchain(bear, Y) 


The response follows (numbers are added for reference purposes): 


1. fish 7. fish 

2. raccoon 8. littlefish 
3. fox 9. algae 
4. deer 10. rabbit 
5. littlefish 11. grass 
6. algae 12. grass 


Prolog applies the simple case of 
infoodchain(bear, Y) <= eat(bear, Y) 


first, obtaining answers 1 through 4 directly from the facts eat(bear, fish), eat(bear, 
raccoon), and so on. Moving to the recursive case, 


infoodchain(bear, Y) <= eat(bear, Z) and infoodchain(Z, Y) 


a match of eat(bear, Z) occurs with Z equal to “fish.” Prolog then looks for all 
solutions to the relation infoodchain(fish, Y). Using first the simple case of infood- 
chain, a match occurs with the fact eat(fish, littlefish). This results in response 5, 
littlefish. There are no other facts of the form eat(fish, Y), so the next thing to try 
is the recursive case of infoodchain(fish, Y): 


infoodchain(fish, Y) <= eat(fish, Z) and infoodchain(Z, Y) 


A match of eat(fish, Z) occurs with Z equal to “littlefish.” Prolog then looks for 
all solutions to the relation infoodchain(littlefish, Y). Using the simple case of 
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infoodchain, a match occurs with the fact eat(littlefish, algae). This results in re- 
sponse 6, algae. There are no other facts of the form eat(littlefish, Y), so the next 
thing to try is the recursive case of infoodchain(littlefish, Y): 


infoodchain(littlefish, Y) <= eat(littlefish, Z) and infoodchain(Z, Y) 


A match of eat(littlefish, Z) occurs with Z equal to “algae.” Prolog then looks for 
all solutions to the relation infoodchain(algae, Y). A search of the entire database 
reveals no facts of the form eat(algae, Y) (or eat(algae, Z)), so neither the simple 
case nor the recursive case of infoodchain(algae, Y) can be pursued further. 


bear fish 
bear raccoon 
bear fox 
bear deer 
bear fish 
fish littlefish 
Figure 1.2 littlefish algae 


Figure 1.2 shows the situation at this point. Prolog has reached a dead-end with 
infoodchain(algae, Y) and will backtrack up the path. Because there are no other 
facts of the form eat(littlefish, Z), the search for solutions to infoodchain(littlefish, 
Y) terminates. Then, because there are no other facts of the form eat(fish, Z), the 
search for solutions to infoodchain(fish, Y) terminates. Backing up still further, 
there is another match of eat(bear, Z) with Z equal to “raccoon” that will generate 
another search path. ө 


In Example 40, once Prolog began to investigate infoodchain(fish, Y), all que- 
ry answers that could be obtained from exploring this path (responses 5 and 6) 
were generated before other answers (responses 7—12). Exploring as far as possible 
down a given path and then backtracking up that path before exploring other paths 
is called a depth-first search strategy. 


PRACTICE 30 | Trace the execution ofthe Prolog program of Example 40 and explain why responses 7-12 


occur. 


Expert Systems 


Many interesting applications programs have been developed, in Prolog and sim- 
ilar logic programming languages, that gather a database of facts and rules about 
some domain and then use the database to draw conclusions. Such programs are 
known as expert systems, knowledge-based systems, or rule-based systems. 
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The database in an expert system attempts to capture the knowledge (“elicit 
the expertise”) of a human expert in a particular field, including both the facts 
known to the expert and the expert’s reasoning path in reaching conclusions 
from those facts. The completed expert system not only simulates the human 
expert’s actions but can be questioned to reveal why it made certain choices and 
not others. 

Expert systems have been built that simulate a medical specialist’s diagnosis 
from a patient’s symptoms, a factory manager’s decisions regarding valve control 
in a chemical plant based on sensor readings, the decisions of a fashion buyer for a 
retail store based on market research, the choices made by a consultant specifying 
a computer system configuration based on customer needs, and many more. The 
challenging part of building an expert system lies in extracting all pertinent facts 
and rules from the human expert. 


SECTION 1.5 REVIEW 


TECHNIQUES MAIN IDEA 

D Formulate Prolog-like facts and rules. * А declarative language incorporates predicate wffs 

QJ Formulate Prolog-like queries. and rules of inference to draw conclusions from 

* Determine the answer(s) to a query using a Prolog hypotheses. The elements of such a language are 
database. based on predicate logic rather than instructions 


that carry out an algorithm. 


EXERCISES 1.5 


Exercises 1—8 refer to the database of Example 39; find the results of the query in each case. 


© 00-10 tn BW мю н 


к= 
© 


12. 


. ?animal(wildcat) 

. ?plant(raccoon) 

. 2eat(bear, littlefish) 

. 2eat(fox, rabbit) 

. ?eat(raccoon, X) 

. Deat(X, grass) 

. 2eat(bear, X) and eat(X, rabbit) 

. ?prey(X) and not eat(fox, X) 

. Formulate a Prolog rule that defines “herbivore” to add to the database of Example 39. 

. If the rule of Exercise 9 is included in the database of Example 39, what is the response to the query 


?herbivore(X) 


. After infoodchain 1s added to the database of Example 39, add the facts eat(wolf, fox) and eat(wolf, deer). 


What is the result of the query 
?eat(wolf, X) and not eat(X, grass) 
After the modifications in Exercise 11, what 1s the result of the query 


?infoodchain(wolf, X) 


13. 


14. 


15. 


16. 


17. 


18. 
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A Prolog database contains the following, where boss(X, Y) means “Х is Y's boss" and supervisor(X, Y) 
means “Х is Y's supervisor”: 


boss(mike, joan) 

boss(judith, mike) 

boss(anita, judith) 

boss( judith, kim) 

boss(kim, enrique) 

boss(anita, sam) 

boss(enrique, jefferson) 

boss(mike, hamal) 

supervisor(X, Y) <= boss(X, Y) 

supervisor(X, Y) <= boss(X, Z) and supervisor (Z, Y) 

Find the results of the following queries: 

a. ?boss(X, sam) 

b. ?boss(judith, X) 

c. ?supervisor(anita, X) 

Using the Prolog database from Exercise 13, what are the results of the following queries? 
a. ?boss(hamal, X) 

b. ?supervisor(X, kim) 

Suppose a Prolog database exists that gives information about authors and the books they have written. 
Books are classified as fiction, biography, or reference. 

a. Write a query to ask whether Mark Twain wrote Hound of the Baskervilles. 
b. Write a query to find all books written by William Faulkner. 

c. Formulate a rule to define nonfiction authors. 

d. Write a query to find all nonfiction authors. 


Suppose a Prolog database exists that gives information about states and capital cities. Some cities are big, 
others small. Some states are eastern, others are western. 


a. Write a query to find all the small capital cities. 

b. Write a query to find all the states with small capital cities. 

c. Write a query to find all the eastern states with big capital cities. 

d. Formulate a rule to define cosmopolitan cities as big capitals of western states. 
e. Write a query to find all the cosmopolitan cities. 


Suppose a Prolog database exists that gives information about a family. Predicates of male, female, and 
parentof are included. 


a. Formulate a rule to define fatherof. 
b. Formulate a rule to define daughterof. 
c. Formulate a recursive rule to define ancestorof. 


Suppose a Prolog database exists that gives information about the parts in an automobile engine. Predicates 
of big, small, and partof are included. 


a. Write a query to find all small items that are part of other items. 
b. Write a query to find all big items that have small subitems. 


c. Formulate a recursive rule to define componentof. 
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19. Suppose a Prolog database exists that gives information about the ingredients in the menu items of a res- 
taurant. Predicates of dry, liquid, perishable, and ingredientof are included. 


a. Write a query to find all the dry ingredients of other ingredients. 
b. Write a query to find all perishable ingredients that contain liquid subingredients. 
c. Formulate a recursive rule to define foundin. 


20. Suppose a Prolog database exists that gives information about flights for AA (Always Airborne) airline. 
Predicates of city and flight are included. Here flight(X, Y) means that AA has a direct (nonstop) flight 
from city X to city Y. 


a. Write a query to find all cities you can get to on a direct flight from Indianapolis. 
b. Write a query to find all cities that have direct flights to San Francisco. 


c. Formulate a recursive rule to define route where route(X, Y) means that you can get from city X to city 
Y using AA but it might not be a direct flight. 


Exercises 21—22 refer to a “Toy Prolog interpreter” that can be found online at http://www.csse.monash.edu. 
au/~lloyd/tildeLogic/Prolog.toy. The syntax used in this section matches that of this online version except that 
in the online version each statement and query must end with a period. Here is a shortened version of Example 
39 as entered into the code window, followed by the response to the query: 


eat (bear, fish). 

eat (fish, littlefish). 
eat(littlefish, algae). 
eat (raccoon, fish). 

eat (bear, raccoon). 

eat (bear, fox). 

animal (bear). 

(fish). 
(littlefish). 
animal (raccoon). 

animal (fox). 

prey(X) <= eat(Y, X) and animal (X). 
?ргеу (X). 


animal 
animal 


--- running --- 


prey(fish) yes 

prey (littlefish) yes 
prey(fish) yes 

prey (raccoon) yes 
prey (fox) yes 


In addition, you can look at and run the online sample program to be sure you understand the syntax rules. 
21. Using the online Toy Prolog program, enter the Prolog database of Exercise 13. Run the queries from 
Exercises 13 and 14 and compare the results with your previous answers. 


22. Using the online Toy Prolog program, create a database for Exercise 20. Run the queries from Exercise 
20. Also run a query using the route predicate. 


SECTION 1.6 | PROOF OF CORRECTNESS 


As our society becomes ever more dependent on computers, it is more and 
more important that the programs computers run are reliable and error-free. 
Program verification attempts to ensure that a computer program is correct. 
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“Correctness” has a narrower definition here than in everyday usage. A pro- 
gram is correct if it behaves in accordance with its specifications. However, this 
does not necessarily mean that the program solves the problem that it was in- 
tended to solve; the program’s specifications may be at odds with or not address 
all aspects of a client’s requirements. Program validation, which we won’t 
discuss further, attempts to ensure that the program indeed meets the client’s 
original requirements. In a large program development project “program V & 
V” or “software quality assurance” is considered so important that a group of 
people separate from the programmers is often designated to carry out the as- 
sociated tasks. 

Program verification may be approached both through program testing and 
through proof of correctness. Program testing seeks to show that particular 
input values produce acceptable output values. Program testing is a major part 
of any software development effort, but it is well-known folklore that “testing 
can prove the presence of errors but never their absence.” If a test run under a 
certain set of conditions with a certain set of input data reveals a “bug” in the 
code, then the bug can be corrected. But except for rather simple programs, 
multiple tests that reveal no bugs do not guarantee that the code is bug-free, 
that there is not some error lurking in the code waiting to strike under the right 
circumstances. 

As a complement to testing, computer scientists have developed a more math- 
ematical approach to "prove" that a program is correct. Proof of correctness uses 
the techniques of a formal logic system to prove that if the input variables satisfy 
certain specified predicates or properties, the output variables produced by ex- 
ecuting the program satisfy other specified properties. 

To distinguish between proof of correctness and program testing, consider 
a program to compute the length c of the hypotenuse of a right triangle, given 
positive values a and b for the lengths of the legs. Proving the program correct 
would establish that whenever a and b satisfy the predicates a > 0 and b > 0, 
then after the program is executed, the predicate а? + b? = c? is satisfied. Test- 
ing such a program would require taking various specific values for a and b, 
computing the resulting c, and checking that a? + Р? equals c? in each case. 
However, only representative values for a and b can be tested, not all possible 
values. 

Again, testing and proof of correctness are complementary aspects of pro- 
gram verification. All programs undergo program testing; they may or may not 
undergo proof of correctness as well. Proof of correctness is labor-intensive, hence 
expensive; it generally is applied only to small and critical sections of code rather 
than to the entire program. 


Assertions 


Describing proof of correctness more formally, let us denote by X an arbitrary 
collection of input values to some program or program segment P. The actions 
of P transform X into a corresponding group of output values Y; the notation 
Y = Р(Х) suggests that the Y values depend on the X values through the actions 
of program P. 

A predicate Q(X) describes conditions that the input values are supposed to 
satisfy. For example, if a program is supposed to find the square root of a posi- 
tive number, then X consists of one input value, x, and Q(x) might be “x > 0.” 
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A predicate R describes conditions that the output values are supposed to satisfy. 
These conditions will often involve the input values as well, so R has the form 
R(X, Y) or R[X, P(X)]. In our square root case, if y is the single output value, then 
y is supposed to be the square root of x, so R(x, y) would Бе “y? = x.” Program P 
Is correct if the implication 


(VX)(Q(X) > RIX, PQX]) (1) 


is valid. In other words, whenever О is true about the input values, R should be 
true about the input and output values. For the square root case, (1) is 


(Ух)(х > 0 — PHP = x) 


The implication (1) is standard predicate wff notation, but the traditional program 
correctness notation for (1) is 


ОРАК) (2) 


{O}P{R} is called a Hoare triple, named for the British computer scientist 
Anthony Hoare. Condition Q is called the precondition for program P, and condi- 
tion R is the postcondition. In the Hoare notation, the universal quantifier does 
not explicitly appear; it is understood. 

Rather than simply having an initial predicate and a final predicate, a program 
or program segment is broken down into individual statements s;, with predicates 
inserted between statements as well as at the beginning and end. These predicates 
are also called assertions because they assert what is supposed to be true about 
the program variables at that point in the program. Thus we have 


where О, R,, Ro, ... , Р, = R are assertions. The intermediate assertions are often 
obtained by working backward from the output assertion R. 
P is provably correct if each of the following implications holds: 


{Q3 Ку} 
{Ris {R2} 
{Ro} so {Rs} 


{Ry Sp-1 {R} 
A proof of correctness for P consists of producing this sequence of valid 


implications, that is, producing a proof sequence of predicate wffs. Some new 
rules of inference can be used, based on the nature of the program statement s;. 
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Assignment Rule 


Suppose that statement s; is an assignment statement of the form x = e, that is, the 
variable x takes on the value of e, where e is some expression. The Hoare triple to 
prove correctness of this one statement has the form 

{Ri} x = e {Rint 


For this triple to be valid, the assertions R, and А; ‚у must be related in a particular way. 


| EXAMPLE% | Consider the following assignment statement together with the given precondition 


and postcondition: 


(e | > OF 
ape | 
{x > 0} 


For every x, if x — 1 > 0 before the statement is executed (note that this says that 
x > 1), then after the value of x is reduced by 1, it will be the case that x > 0. 
Therefore, 


oec Il 29 (0 sp = 36 — JL fe > O} 


is valid. ө 


In Example 41, we just reasoned our way through the validity of the wffrepresented 
by the Hoare triple. The point of predicate logic is to allow us to determine validity 
in a more mechanical fashion by the application of rules of inference. (After all, 
we don’t want to just “reason our way through” the entire program to convince 
ourselves of its correctness; the programmer already did that when the program 
was written!) 

The appropriate rule of inference for assignment statements is the assignment 
rule, given in Table 1.18. It says that if the precondition and postcondition are 
appropriately related, the Hoare triple can be inserted at any time in a proof sequence 
without having to be inferred from something earlier in the proof sequence. This 
makes the Hoare triple for an assignment statement akin to a hypothesis in our 
previous proofs. And what is the relationship? In the postcondition, locate all 
instances of the variable to which an assignment is being made in the assignment 
statement right above the postcondition. For each of those instances, substitute the 
expression being assigned. The result will be the precondition. 


TABLE 1.18 


From Can Derive Name of Rule Restrictions on Use 


{R}s AR} assignment 1. s; has the form x = e. 
2. FR; is Hj, with e substituted 
everywhere for x. 
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| EXAMPLE 42 | For the case of Example 41, 


be — JL 29 Of 
х=х= 1 
{x > 0} 
the triple 
[зе — Il SO} xe = ae — Il 5:29 09 
is valid by the assignment rule. The postcondition is 
a5 = (U 
Substituting x — 1 for x throughout the postcondition results in 
= >й e xol 


which is the precondition. Here we didn't have to think at all; we just checked that 
the assignment rule had been followed. e 


Certainly Example 41 seems easier than Example 42, and for such a trivial 
case you may be tempted to skip use of the assignment inference rule and just 
talk your way through the code. Resist this temptation. For one thing, real-world 
usage isn't this trivial. But more to the point, just as in our previous formal logic 
systems, you want to rely on the rules of inference instead of on some possibly 
flawed thought process. 


PRACTICE 31 | According to the assignment rule, what should be the precondition in the following 


program segment? 
{precondition} 
ED 
{x= у} п 


Because the assignment rule tells us what a precondition should look like 
based on what a postcondition looks like, a proof of correctness often begins with 
the final desired postcondition and works its way back up through what the earlier 
assertions should look like according to the assignment rule. Once it has been 
determined what the topmost assertion must be, a check is done to see that this 
assertion is really true. 


| EXAMPLE 43 | Verify the correctness of the following program segment to exchange the values of 


x and y: 


REMINDER 


To use the assignment 
rule, work from the bottom 
to the top. 


temp — x 
х=) 
у = temp 
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At the beginning of this program segment, x and y have certain values. Thus 
we may express the actual precondition as x = a and y = Б. The desired postcondi- 
tion is then x = b and y = a. Using the assignment rule, we can work backward 
from the postcondition to find the earlier assertions (read the following from the 
bottom to the top). 


{y = b,x =a} 
temp = x 

{у = b, temp = a} 
х= у 

{x = b, temp = a} 
y = temp 

{x = b, y = a} 


The first assertion agrees with the precondition; the assignment rule, applied 
repeatedly, assures us that the program segment is correct. ө 


PRACTICE 32 | Verify the correctness of the following program segment with the precondition and post- 
condition shown: 


Sometimes the necessary precondition is trivially true, as shown in the next 
example. 


ЭҮ Verify the correctness of the following program segment to compute y = x — 4. 


у= 
Dd 


Here the desired postcondition is y — x — 4. Using the assignment rule to work 
backward from the postcondition, we get (again, read bottom to top) 


{х-4=х—4}| 
ipo 

fy-4=x-4} 
D 

DEOS 


The precondition is always true; therefore, by the assignment rule, each successive 
assertion, including the postcondition, is true. e 
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Conditional Rule 


A conditional statement is a program statement of the form 


if condition B then 


When this statement is executed, a condition B that is either true or false is 
evaluated. If В 15 true, program segment P, is executed, but if B is false, program 
segment P» is executed. 

A conditional rule of inference, shown in Table 1.19, determines when a 
Hoare triple 


1Qisith) 


can be inserted in a proof sequence if s; is a conditional statement. The Hoare 
triple is inferred from two other Hoare triples. One of these says that if О is true 
and В is true and program segment P, is executed, then R holds; the other says that 
if О is true and В is false and program segment P; is executed, then R holds. This 
simply says that each branch of the conditional statement must be proved correct. 


TABLE 1.19 
From Can Derive | Name of Rule Restrictions on Use 
{Q A B} P, {А}, | {Q}s{R } conditional s; has the form 
(Q ^ B') P; {В} if condition B then 
Р, 
else 
P, 
end if 


| EXAMPLE 45 | Verify the correctness of the following program segment with the precondition and 


postcondition shown. 


w= э} 
if n >= 10 then 
y= 100 
else 
у=п+1 
end if 
iy = Gy 


Here the precondition is n = 5, and the condition B to be evaluated is n >= 10. In 
order to apply the conditional rule, we must first prove that 


{0 A BS Р, {R} 
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or 
ín = 5andn = 10! y = 100 {у = 6} 


holds. Remember that this stands for an implication, which will be true because its 
antecedent, n = 5 and n = 10, is false. We must also show that 


{Q ^ B’} Р, {R} 
or 
{п = 5andn < 10} y=n+1 {у = 6} 


holds. Working back from the postcondition, using the assignment rule, we get 


{n+ 1 = боги = 5} 
у= п+1 
Que 


Thus 
{п=5}у=п+1{у=6б} 
is true by the assignment rule and therefore 
{п = 5 апал < 10} у= п + 1 {y=6} 


is also true because the condition и < 10 adds nothing new to the assertion. The 
conditional rule allows us to conclude that the program segment is correct. e 


PRACTICE 33 | Verify the correctness of the following program segment with the precondition and 
postcondition shown. 


ip 4} 
if x < 5 then 
y=x-1 
else 
у=7 
end if 
15:593 = 


IW Est) Verify the correctness of the following program segment to compute тах(х, y), the 
maximum of two distinct values x and y. 


{x т у} 
if x >= y then 
max — x 
else 
max — y 


end if 
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The desired postcondition reflects the definition of the maximum, (x > y and 
max = x) or (x € y and max = y). The two implications to prove are 


{x Æ y and x = у} max = x {(х > y and max = x) or (x € y and max = y)} 
and 
{x Æ y and x € y} max = y {(х > y and max = x) or (x € y and max = y)} 


Using the assignment rule on the first case (substituting x for max in the postcondi- 
tion) would give the precondition 


(х>2 уЛх = х) У (х <уЛх = у) 
Since the second disjunct is always false, this is equivalent to 
(x >yAx =x) 
which in turn is equivalent to 
52 25g or xAyandx=y 


The second implication is proved similarly. ө 


In Chapter 2, we will see how to verify correctness for a loop statement, 
where a section of code can be repeated many times. 

As we have seen, proof of correctness involves a lot of detailed work. It is a 
difficult tool to apply to large programs that already exist. It is generally easier 
to prove correctness while the program is being developed. Indeed, the list of 
assertions from beginning to end specifies the intended behavior of the program 
and can be used early in its design. In addition, the assertions serve as valuable 
documentation after the program is complete. 
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TECHNIQUES MAIN IDEA 


QJ Verify the correctness of a program segment that ° A formal system of rules of inference can be used 
includes assignment statements. to prove the correctness of program segments. 

(00 Verify the correctness of a program segment that 
includes conditional statements. 


EXERCISES 1.6 


In the following exercises, * denotes multiplication. 


1. According to the assignment rule, what is the precondition in the following program segment? 
{precondition} 
х=х+1 
{х=у—1} 


— 
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. According to the assignment rule, what is the precondition in the following program segment? 
{precondition} 

x=2*x 
iy 
. According to the assignment rule, what is the precondition in the following program segment? 
{precondition} 

х=3*х—1 
х=2*у—1) 
. According to the assignment rule, what is the precondition їп the following program segment? 
{precondition} 

у= 3х +7 
{у= х +1} 
. Verify the correctness of ће following program segment with the precondition and postcondition shown. 
{x = 1} 

y=x+3 

y=2*y 
ty = 8} 
. Verify the correctness of the following program segment with the precondition and postcondition shown. 
{x > 0} 

у=х+?2 

2=у + 1 
{z > 3} 
. Verify the correctness of the following program segment with the precondition and postcondition shown. 
{x = 0) 

z=2*x+1 

y=z-1 
ty = 0} 
. Verify the correctness of the following program segment with the precondition and postcondition shown. 
{x < 8} 

z=x-1 

у=2- 5 
ty <2} 


. Verify the correctness of the following program segment to compute y = x(x — 1). 


у=х- 1 
у=х*у 
. Verify the correctness of the following program segment to compute у = 2x + 1. 
у=х 
yH pry 
у=у+1 
. Verify the correctness of the following program segment with the precondition апа postcondition shown. 
{у = 0} 
if y < 5 then 
у=у+1 
else 
у=з 
end if 
iy = 1} 
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12. Verify the correctness of the following program segment with the precondition and postcondition shown. 
{x = 7} 
if x <= 0 then 
vax 
else 
у=2*х 
end if 
=} 
13. Verify the correctness of the following program segment with the precondition and postcondition shown. 
{x # 0} 
if x > 0 then 
у=2*х 
else 
y=(-2)*x 
end if 
ty > 0} 
14. Verify the correctness of the following program segment to compute min(x, у), the minimum of two dis- 
tinct values x and y. 


{x # у} 
if x <= y then 
min — x 
else 
min — y 
end if 


15. Verify the correctness of the following program segment to compute |x|, the absolute value of x, for a 
nonzero number x. 


{x # 0} 
ifx >= 0 then 
abs = x 
else 
abs = —x 
end if 
16. Verify the correctness of the following program segment with the assertions shown. 
{2 = 3} 
x=z+1 
y=x+2 
{у = 6} 
if y > 0 then 
2=у +1 
else 
z=2*y 
end if 


{z= 7} 
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CHAPTER 1 REVIEW 


TERMINOLOGY 


algorithm (p. 12) 

antecedent (p. 3) 

assertion (p. 86) 

assignment rule (p. 87) 

binary connective (p. 3) 

binary predicate (p. 40) 

complete formal logic system 
(p. 27) 

conclusion (p. 25) 

conditional rule of inference 
(p. 90) 

conditional statement (p. 90) 

conjunct (p. 2) 

conjunction (p. 2) 

consequent (p. 3) 

contradiction (p. 8) 

correct formal logic system 
(p. 27) 

correct program (p. 85) 

De Morgan's laws (p. 10) 

declarative language (p. 73) 

depth-first search (p. 81) 

derivation rule (p. 27) 

descriptive language (p. 73) 

disjunct (p. 3) 

disjunction (p. 3) 

domain (p. 41) 

dual of an equivalence (p. 9) 


SELF TEST 


equivalence (p. 3) 
equivalence rule (p. 28) 
equivalent wffs (p. 8) 
existential generalization (p. 62) 
existential instantiation (p. 60) 
existential quantifier (p. 40) 
expert system (p. 81) 

free variable (p. 42) 

Hoare triple (p. 86) 

Horn clause (p. 76) 
hypothesis (p. 25) 

implication (p. 3) 

inference rule (p. 29) 
interpretation (p. 41) 
knowledge-based system (p. 81) 
logical connective (p. 2) 

main connective (p. 6) 

n-ary predicate (p. 40) 
negation (p. 4) 

postcondition (p. 86) 
precondition (p. 86) 

predicate (p. 39) 

predicate logic (p. 58) 
predicate wff (p. 41) 
procedural language (p. 73) 
program testing (p. 85) 
program validation (p. 85) 
program verification (p. 84) 


Prolog database (p. 73) 
Prolog fact (p. 73) 

Prolog rule (p. 75) 

proof of correctness (p. 85) 
proof sequence (p. 27) 
proposition (p. 2) 
propositional calculus (p. 25) 
propositional logic (p. 25) 
propositional wff (p. 25) 
pseudocode (p. 12) 

recursive definition (p. 79) 
resolution (p. 76) 

rule-based system (p. 81) 
scope (p. 41) 

statement (p. 2) 

statement letter (p. 2) 
statement logic (p. 25) 
tautology (p. 8) 

ternary predicate (p. 40) 
unary connective (p. 3) 
unary predicate (p. 40) 
universal generalization (p. 61) 
universal instantiation (p. 59) 
universal quantifier (p. 39) 
valid argument (p. 26, 58) 
valid predicate wff (p. 48) 
well-formed formula (wff) (p. 6) 


Answer the following true-false questions without looking back in the chapter. 


Section 1.1 


1. A contradiction is any propositional wff that is not 


a tautology. 


Section 1.2 


1. An equivalence rule allows one wff to be substi- 


tuted for another in a proof sequence. 


2. 


3. 


The disjunction of any propositional wff with a tau- 
tology has the truth value true. 

Algorithm Z7autologyTest determines whether any 
propositional wff is a tautology. 


. Equivalent propositional wffs have the same truth 


values for every truth value assignment to the 
components. 


. One of De Morgan’s laws states that the negation of 


a disjunction is the disjunction of the negations (of 
the disjuncts). 


2. 


If a propositional wff can be derived using modus 
ponens, then its negation can be derived using mo- 
dus tollens. 


. Propositional logic is complete because every tau- 


tology is provable. 


. A valid argument is one in which the conclusion is 


always true. 


. The deduction method applies when the conclusion 


is an implication. 
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Section 1.3 


1. 


A predicate wff that begins with a universal 
quantifier is universally true, that is, true in all in- 
terpretations. 


. In the predicate wff (Vx)P(x, y), y is a free variable. 
. Ап existential quantifier is usually found with the 


conjunction connective. 


. The domain of an interpretation consists of the 


values for which the predicate wff defined on that 
interpretation is true. 


. A valid predicate wff has no interpretation in which 


it is false. 


Section 1.4 


1. 


‚ Р(х) ^(dx)O(xX) сап Бе 


The inference rules of predicate logic allow ех- 
istential and universal quantifiers to be added or 
removed during a proof sequence. 


. Existential instantiation should be used only after 


universal instantiation. 
deduced from (Vx) 
[P(x) ^ Gy)O(y)] using universal instantiation. 


. Every provable wff of propositional logic is also 


provable in predicate logic. 


. A predicate wff that is not valid cannot be proved 


using predicate logic. 


ON THE COMPUTER 


For Exercises 1—5, write a computer program that pro- 
duces the desired output from the given input. 


1. 


Input: Truth values for two statement letters A and B 
Output: Corresponding truth values (appropriately 
labeled, of course) for 


AAB,AVB,A>B,AB,A' 


. Input: Truth values for two statement letters A and B 


Output: Corresponding truth values for the wffs 


A — B' and B' ^ [A V (А ^ B)] 


. Input: Truth values for three statement letters А, В, 


and C 
Output: Corresponding truth values for the wffs 


AV (BAC')>B' and AV C' o (AV C) 


. Input: Truth values for three statement letters 


A, B, and C, and a representation of a simple 


Section 1.5 


1. 
2. 


3. 


4. 


5. 


A Prolog rule describes a predicate. 

Horn clauses are wffs consisting of single negated 
predicates. 

Modus ponens is a special case of Prolog resolu- 
tion. 

A Prolog recursive rule is a rule of inference that is 
used more than once. 

A Prolog inference engine applies its rule of infer- 
ence without guidance from either the programmer 
or the user. 


Section 1.6 


1. 


2. 


3. 


4. 


Un 


A provably correct program always gives the right 
answers to a given problem. 

If an assertion after an assignment statement 15 
y > 4, then the precondition must be y > 4. 

Proof of correctness involves careful development 
of test data sets. 

Using the conditional rule of inference in proof 
of correctness involves proving that two different 
Hoare triples are valid. 


. The assertions used in proof of correctness can also 


be used as a program design aid before the program 
is written, and as program documentation. 


propositional wff. Special symbols can be used for 
the logical connectives, and postfix notation can be 
used; for example, 

ABAACVfor(A^B)VC 
ог 


A' B/\forA'AB 


Output: Corresponding truth value of the wff 


. Input: Representation of a simple propositional wff 


as in the previous exercise 

Output: Decision on whether the wff is a tautology 
Using the online Toy Prolog program that can be 
found at  http://www.csse.monash.edu.au/~lloyd/ 
tildeLogic/Prolog.toy, enter the Prolog database of 
Example 39 and perform the queries there. Note 
that each database entry requires a period. Also add 
the recursive rule for infoodchain and perform the 


query 


?infoodchain(bear, Y) 


Proofs, Induction, and 
Number Theory 


Chapter 


| CHAPTER OBJECTIVES |— _@ 


Question: 


After studying this chapter, you will be able to: 


e Attack the proofs of conjectures using the techniques of direct proof, proof by 
contraposition, and proof by contradiction. 

* Recognize when a proof by induction is appropriate and carry out such a proof 
using either the first or second principle of induction. 

* Mathematically prove the correctness of programs that use loop statements. 

* Test whether a given positive integer is prime; if not, find its prime factorization. 

* Work with number theoretic ideas of prime factorization, greatest common 
divisor, and the Euler phi function. 


The nonprofit organization at which you volunteer has received donations of 792 bars 
of soap and 400 bottles of shampoo. You want to create packages to distribute to 
homeless shelters such that each package contains the same number of shampoo 
bottles and each package contains the same number of bars of soap. 


How many packages can you create? 


This problem is solvable by trial and error, but it's much easier to use an ancient 
algorithm that is discussed in this chapter. 

First, however, we consider how to prove "real-world" arguments as opposed 
to the formal arguments of Chapter 1. It is helpful to have an arsenal of techniques 
for attacking a proof. Direct proof, proof by contraposition, and proof by 
contradiction are examined in Section 2.1. Many of the proofs given in this section 
are simple “number theory” results, that is, results about integers, such as “The 
product of two even integers is even." 

Section 2.2 concentrates on mathematical induction, a proof technique with 
particularly wide application in computer science. In Section 2.3 we see how, 
using induction, proof of correctness can be extended to cover looping statements. 
Finally, Section 2.4 explores some further number theory results, particularly 
concerning prime numbers. 
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SECTION 2.1 | PROOF TECHNIQUES 


Theorems and Informal Proofs 


The formal arguments of Chapter 1 have the form P — О, where P and О may 
represent compound statements. The point there was to prove that an argument 
is valid—true in all interpretations by nature of its internal form or structure, 
not because of its content or the meaning of its component parts. However, we 
often want to prove arguments that are not universally true but just true within 
some context. Meaning becomes important because we are discussing a particular 
subject—graph algorithms, or Boolean algebra, or compiler theory, or whatever— 
and we want to prove that if P is true in this context, then so is Q. If we can do 
this, then P — Q becomes a theorem about that subject. To prove a theorem about 
subject XXX, we can introduce facts about XXX into the proof; these facts act 
like additional hypotheses. Note that as we add more hypotheses, the universe of 
discourse shrinks; we are no longer considering universally valid arguments, only 
arguments that are true within the context in which the hypotheses hold.! 

It may not be easy to recognize which subject-specific facts will be helpful or 
to arrange a sequence of steps that will logically lead from P to О. Unfortunately, 
there is no formula for constructing proofs and no practical general algorithm or 
computer program for proving theorems. Experience is helpful, not only because 
you get better with practice, but also because a proof that works for one theorem 
can sometimes be modified to work for a new but similar theorem. 

Theorems are often stated and proved in a somewhat less formal way than 
the propositional and predicate arguments of Chapter 1. For example, a theorem 
may express the fact that every object in the domain of interpretation (the subject 
matter under discussion) having property P also has property Q. The formal 
statement of the theorem would be (Vx)[P(x) > О(х)]. But the theorem would be 
informally stated as P(x) — Q(x). If we can prove P(x) — Q(x) where x is treated 
as an arbitrary element of the domain, universal generalization would then give 
(Vx)[P@) > О()]. 

As another example, we may know that all objects in the domain have some 
property; that is, something of the form (Vx)P(x) can be considered as a subject- 
specific fact. An informal proof might proceed by saying, “Let x be any element 
of the domain. Then x has property Р” (Formally, we are making use of universal 
instantiation to get P(x) from (Vx)P(x).) 

Similarly, proofs are usually not written a step at a time with formal justifica- 
tions for each step. Instead, the important steps and their rationale are outlined 
in narrative form. Such a narrative, however, can be translated into a formal 
proof if required. In fact, the value of a formal proof is that it serves as a sort of 
insurance—if a narrative proof cannot be translated into a formal proof, it should 
be viewed with great suspicion. 


‘In the world of “pure predicate logic,” which is a correct and complete formal system, every true (valid) 
argument is provable. But in these more restrictive contexts, not everything that is “true” is necessarily 
provable, no matter how clever we are in adding additional hypotheses or “axioms.” In other words, these 
systems may not be complete. At the age of 25, the German logician Kurt Gódel proved in 1931 that, using 
reasonable hypotheses, even elementary arithmetic is an incomplete system. This result shocked the math- 
ematical community of the time, which had been depending on axiomatic systems since the days of Euclid. 


REMINDER 


One counterexample is 
enough to disprove a 
conjecture. 
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To Prove or Not to Prove 


A textbook will often say, “Prove the following theorem,” and the reader will 
know that the theorem is true; furthermore, it is probably stated in its most pol- 
ished form. But suppose you are doing research in some subject. You observe a 
number of cases in which whenever P is true, Q is also true. On the basis of these 
experiences, you may formulate a conjecture: P — О. The more cases you find 
where Q follows from P, the more confident you are in your conjecture. This pro- 
cess illustrates inductive reasoning, drawing a conclusion based on experience. 

No matter how reasonable the conjecture sounds, however, you will not be 
satisfied until you have applied deductive reasoning to it as well. In this pro- 
cess, you try to verify the truth or falsity of your conjecture. You produce a proof 
of P О (thus making it a theorem), or else you find a counterexample that 
disproves the conjecture, a case in which P is true but Q is false. (We were using 
deductive reasoning in predicate logic when we either proved that a wff was valid 
or found an interpretation in which the wff was false.) 

If you are simply presented with a conjecture, it may be difficult to decide 
which of the two approaches you should try—to prove the conjecture or to dis- 
prove it! A single counterexample to a conjecture is sufficient to disprove it. Of 
course, merely hunting for a counterexample and being unsuccessful does not 
constitute a proof that the conjecture is true. 


Fora positive integer n, n factorial is defined as n(n — 1)(n — 2): : : 1, andis denoted 


by n!. Prove or disprove the conjecture, “For every positive integer n, n! = n^." 


Let's begin by testing some cases: 


n | nt | n | т<п? 
1 1 yes 
2 4 yes 
3 9 yes 


So far, this conjecture seems to be looking good. But for the next case, 


n! | п? | ni<n? 


al || © IG no 


we have found a counterexample. The fact that the conjecture is true for n = 1, 2, 
and 3 does nothing to prove the conjecture, but the single case и = 4 is enough to 
disprove it. ө 


PRACTICE 1 | Provide counterexamples to the following conjectures. 


a. All animals living in the ocean are fish. 
b. Every integer less than 10 is bigger than 5. и 
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If a counterexample is not forthcoming, perhaps the conjecture is true and we 
should try to prove it. What techniques can we use to try to do this? For the rest of 
this section, we'll examine various methods of attacking a proof. 


Exhaustive Proof 


While “disproof by counterexample” always works, “proof by example” seldom 
does. The one exception to this situation occurs when the conjecture is an asser- 
tion about a finite collection. In this case, the conjecture can be proved true by 
showing that it is true for each member of the collection. Proof by exhaustion 
means that all possible cases have been exhausted, although it often means that 
the person doing the proof is exhausted as well! 


Prove the conjecture, “If an integer between | and 20 is divisible by 6, then it is 
also divisible by 3.” (“Divisible by 6,” means, “evenly divisible by 6,” that is, the 
number is an integral multiple of 6.) 

Because there is only a finite number of cases, the conjecture can be proved 
by simply showing it to be true for all the integers between | and 20. Table 2.1 is 


the proof. 
TABLE 2.1 
Number Divisible by 6 Divisible by 3 
1 no 
2 no 
3 no 
4 no 
© по 
6 yes:6=1x6 yes:6 2-2 x 3 
[ no 
8 no 
9 no 
10 no 
11 no 
12 yes: 122 2X 6 yes: 12 - 4x 8 
13 no 
14 no 
i no 
16 no 
17 no 
18 yes: 182 3X 6 yes: 182 6 X 8 
19 no 
20 no e 
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Prove the conjecture, “It is not possible to trace all the lines in Figure 2.1 without 
lifting your pencil and without retracing any lines.” 

There is only a finite number of different ways to trace 
the lines in the figure. By careful bookkeeping, each of the 
possibilities can be attempted, and each will fail. In Chap- 
ter 7, we will learn a much less tedious way to solve this 

Figure 2.1 problem than proof by exhaustion. ө 


PRACTICE 2 


a. Prove the conjecture “For any positive integer less than or equal to 5, the square of the integer 
is less than or equal to the sum of 10 plus 5 times the integer.” 
b. Disprove the conjecture “For any positive integer, the square of the integer is less than or equal 
to the sum of 10 plus 5 times the integer.” m 


Direct Proof 


In general (where exhaustive proof won't work), how can you prove that P > О 
is true? The obvious approach is the direct proof—assume the hypothesis P and 
deduce the conclusion О. A formal proof would require a proof sequence leading 
from P to Q. 

Example 4 shows a formal proof that if two numbers are even (that's the 
hypothesis P), then their product is even (that's the conclusion О). Recall that an 
even number is a number that is an integral multiple of 2, for example, 18 is even 
because 18 — 2(9). An odd number is 1 more than an integral multiple of 2, for 
example, 19 = 2(9) + 1. 


| EXAMPLE 4 | Consider the conjecture 


x is an even integer ^ у is an even integer — the product xy is an even integer 


A complete formal proof sequence might look like the following: 


l. x is an even integer ^ y is an even integer hyp 
2. (Vx)[x is even integer > number fact (definition 
(3K)(k an integer ^ x = 20] of even integer) 
3. xis even integer — (ЧЁ)(К an integer ^ x = 2k) 2,ui 
4. у is even integer — (3A)(k an integer ^ y = 2K) 22, it 
5. x iS an even integer 1, sim 
6. (Ak)(K is an integer ^ x = 2k) 3, 5, mp 
7. m is an integer ^ x = 2m 6, ei 
8. y is an even integer 1, sim 
9. (Ak)(k an integer ^ у = 2k) 4, 8, mp 
10. n is an integer and y = 2n 9, ei 
lll, == 2m 7, sim 


12. y = 2п 10, sim 
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13. xy = (2т)(2п) 11, 12, substitution of equals 
14. xy = 2(2mn) 13, multiplication fact 
15. m is an integer 7, sim 
16. nis an integer 10, sim 
17. 2mn is an integer 15, 16, number fact 
18. xy = 2(2mn) ^ 2mn is an integer 14, 17, con 
19. (Ak)(k an integer ^ ху = 2k) 18, eg 
20. (Vx)((3À)(k an integer ^ x = 2k) > number fact (definition 
x is even integer) of even integer) 
21. €k)(k an integer ^ xy = 2k) > 20, ui 


xy is even integer 
22. xy is an even integer 19, 21, mp 


It is understood that x and y are arbitrary, but this could be stated by expressing the 
conjecture as 


(Vx)(Vy)(x is an even integer ^ y is an even integer > 
the product xy is an even integer) 


Universal generalization can be applied twice to the result we already have in order 
to put the universal quantifiers on the front. ө 


We'll never again do a proof like the one in Example 4, and you won't have 
to either! A much more informal proof would be perfectly acceptable in most 
circumstances. 


Following is an informal direct proof that the product of two even integers is even. 

Let x — 2m and y — 2n, where m and n are integers. Then xy — (2m)(2n) — 
2(2mn), where 2mn is an integer. Thus xy has the form 2k, where / is an integer, and 
xy is therefore even. 

Notice that we set x = 2m for some integer m (the definition of an even 
number), but we set y — 2n. In the formal proof of Example 4, the restriction 
on the use of ei required that we use a different multiple of 2 for y than we used 
for x. Informally, if we were also to set y = 2m, we would be saying that x and 
y are the same integers, which is a very special case. ө 


The proof in Example 5 does not explicitly state the hypothesis (that x and 
y are even), and it makes implicit use of the definition of an even integer. Even 
in informal proofs, however, it is important to identify the hypothesis and the 
conclusion, not just what they are in words but what they really mean, by apply- 
ing appropriate definitions. If we do not clearly understand what we have (the 
hypothesis) or what we want (the conclusion), we cannot hope to build a bridge 
from one to the other. That’s why it is important to know definitions. 
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PRACTICE 3 | Give a direct proof (informal) of the theorem “If an integer is divisible by 6, then twice 
that integer is divisible by 4.” Є 


Contraposition 


If you have tried diligently but failed to produce a direct proof of your conjecture 
P — О, and you still feel that the conjecture is true, you might try some variants 
on the direct proof technique. If you can prove the theorem Q' — P', you can 
conclude P > О by making use of the tautology (Q' > Р') > (P > О). O' > P' 
is the contrapositive of P — О, and the technique of proving P > О by doing a 
direct proof of Q' — P' is called proof by contraposition. (The contraposition 
rule of inference in propositional logic (Table 1.14) says that P — Q can be derived 
from Q' > P") 


Prove that if the square of an integer is odd, then the integer must be odd. 
The conjecture is п? odd — n odd. We do a proof by contraposition, and prove 
n even — п? even. Let n be even. Then и? = n(n) is even by Example 5. ө 


Prove that ifn + 1 separate passwords are issued to л students, then some student 
gets > 2 passwords. 

Here the conclusion О has the form (3x)A(x), so Q' is [Gx)R(x)]', which is 
equivalent to (Vx)[A(x)]'. The contrapositive О’ — P’ is, “If every student gets 
< 2 passwords, then it is false that n + 1 passwords were issued." Suppose every 
student has < 2 passwords; then every one of the n students has at most 1 pass- 
word. The total number of passwords issued is at most л, not n + 1, so it is false 
that п + 1 passwords were issued. ө 


Example 7 is an illustration of the pigeonhole principle, which we will see in 
Chapter 4. 


PRACTICE 4 | Write the contrapositive of each statement in Practice 5 of Chapter 1. 


Practice 7 of Chapter 1 showed that the wffs 4 — B and B — A are not 
equivalent. B — A is the converse of А — В. If an implication is true, its converse 
may be true or false. Therefore, you cannot prove P — Q by looking at Q — P. 


ЭТТ The implication “If a > 5, then a > 2" is true, but its converse, “If a > 2, then 
a> 5," 1s false. e 


PRACTICE 5 | Write the converse of each statement in Practice 5 of Chapter 1. 
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REMINDER 


“If and only if” requires 
two proofs, one in each 
direction. 


Theorems are often stated in the form “P if and only if О, meaning P if Q 
and P only if О, or О > P and P > О. To prove such a theorem, you must prove 
both an implication and its converse. Again, the truth of one does not imply the 
truth of the other. 


Prove that the product xy is odd if and only if both x and y are odd integers. 

We first prove that if x and y are odd, so is xy. A direct proof will work. Suppose 
that both x and y are odd. Then x = 2л + 1 and y = 2m + 1, where m and n are inte- 
gers. Then ху = (2n + 1)(2m + 1) = 4ит + 2m + 2n + 1 = 2(2nm + m + n) +1. 
This has the form 2k + 1, where k is an integer, so xy is odd. 

Next we prove that if xy is odd, both x and y must be odd, or 


xy odd — x odd and y odd 


A direct proof would begin with the hypothesis that xy is odd, which leaves us little 
more to say. А proof by contraposition works well because we'll get more useful 
information as hypotheses. So we will prove 


(x odd and y odd)’ — (xy odd)’ 
By De Morgan's law (A ^ BY = А” V B', we see that this can be written as 
x even or y even xy even (1) 


The hypothesis “x even or y even” breaks down into three cases. We consider each 
case in turn. 


1. x even, y odd: Here x = 2m, у = 2n + 1, and then xy = (2т)(2п + 1) = 
2(2mn + m), which is even. 

2. x odd, y even: This works just like case 1. 

3. x even, y even: Then xy is even by Example 5. 


This completes the proof of (1) and thus of the theorem. e. 


The second part of the proof of Example 9 uses proof by cases, a form of 
exhaustive proof. It involves identifying all the possible cases consistent with the 
given information and then proving each case separately. 


Contradiction 


In addition to direct proof and proof by contraposition, you might use the tech- 
nique of proof by contradiction. (Proof by contradiction is sometimes called 
indirect proof, but this term more properly means any argument that is not a direct 
proof) As we did in Chapter 1, we will let 0 stand for any contradiction, that is, 
any wff whose truth value is always false. (А ^ A’ would be such a wff.) Once 


REMINDER 


To prove that something 
is not true, try proof by 
contradiction. 
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more, suppose you are trying to prove P — О. By constructing a truth table, we 
see that 


(PA Q'>0)> (P О) 


is a tautology, so to prove the theorem P — О, it is sufficient to prove P ^ Q' > 0. 
Therefore, in a proof by contradiction you assume that both the hypothesis and 
the negation of the conclusion are true and then try to deduce some contradiction 
from these assumptions. 


Let's use proof by contradiction on the statement, “If a number added to itself 
gives itself, then the number is 0." Let x represent any number. The hypothesis 
is x + x = x and the conclusion is x = 0. To do a proof by contradiction, assume 
x + x = x and x >= 0. Then 2x = x and x ¥ 0. Because x >= 0, we can divide 
both sides ofthe equation 2x — x by x and arrive at the contradiction 2 — 1. Hence, 
(x + х= x) (х = 0). ө 


Example 10 notwithstanding, a proof by contradiction most immediately 
comes to mind when you want to prove that something is not true. It's hard to 
prove that something is not true; it’s much easier to assume it is true and obtain 
a contradiction. 


EXAMPLE 11 


A well-known proof by contradiction shows that V2 is not a rational number. 
Recall that a rational number is one that can be written in the form p/q where p 
and q are integers, q ~ 0, and p and q have no common factors (other than + 1). 
Let us assume that V2 is rational. Then V2 = p/q, and 2 = p’/q’, or 2q° = p’. 
Then 2 divides p?, so—because 2 is itself indivisible—2 must divide p. This means 
that 2 is a factor of p; hence 4 is a factor of p? , and the equation 24” = p? can be 
written as 2q? = 4x, or q? = 2x. We see from this equation that 2 divides q^ and 
hence 2 divides q. At this point, 2 is a factor of q and a factor of p, which con- 
tradicts the statement that p and q have no common factors. Therefore V2 is not 


rational. e 


The proof of Example 11 involves more than just algebraic manipulations. It 
is often necessary to use lots of words in a proof. 


PRACTICE 6 | Prove by contradiction that the product of odd integers is not even. (We did a direct 


proof of an equivalent statement in Example 9.) 


Proof by contradiction can be a valuable technique, but it is easy to think we 
have done a proof by contradiction when we really haven't. For example, suppose 
we assume P ^ Q' and are able to deduce О without using the assumption Q’. Then 
we assert О Л Q' as a contradiction. What really happened here is a direct proof 
of P — Q, and the proof should be rewritten in this form. Thus in Example 10, 
we could assume x + x = x and x = 0, as before. Then we could argue that from 
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x +x =x we get 2x = x and, after subtracting x from both sides, x = 0. We then 
have x = 0 and x ¥ 0, a contradiction. However, in this argument we never made 
use of the assumption x # 0; we actually proved directly that x + x = x implies 
x = 0. 

Another misleading claim of proof by contradiction occurs when we assume 
P ^ Q' and are able to deduce P' without using the assumption P. Then we assert 
P ^ P' asa contradiction. What really happened here is a direct proof of О’ > P’, 
and we have constructed a proof by contraposition, not a proof by contradiction. 
In both this case and the previous one, it is not that the proofs are wrong, just that 
they are not proofs by contradiction. 

Table 2.2 summarizes useful proof techniques we have discussed so far. 


TABLE 2.2 
Proof Technique Approach to Prove P —^ Q | Remarks 
Exhaustive proof Demonstrate P — Q for all | May be used only to prove 
possible cases. a finite number of cases. 
Direct proof Assume P, deduce Q. The standard approach — 
usually the thing to try. 
Proof by contraposition | Assume Q', deduce P'. Use this if Q' as a hypoth- 
esis seems to give more 
ammunition than P would. 
Proof by contradiction | Assume P ^ Q', deducea | Use this when О says 
contradiction. something is not true. 


Serendipity 


Serendipity means a fortuitous happening, or good luck. While this isn't really 
a general proof technique, some of the most interesting proofs come from clever 
Observations that we can admire, even if we would never have thought of them 
ourselves. We'll look at two such proofs, just for fun. 


EXAMPLE 12 


A tennis tournament has 342 players. A single match involves 2 players. The win- 
ner of a match plays the winner of a match in the next round, while losers are 
eliminated from the tournament. The 2 players who have won all previous rounds 
play in the final game, and the winner wins the tournament. Prove that the total 
number of matches to be played is 341. 

The hard way to prove this result is to compute 342/2 — 171 to get the num- 
ber of matches in the first round, resulting in 171 winners to go on to the second 
round. For the second round, 171/2 — 85 plus 1 left over; there are 85 matches 
and 85 winners, plus the 1 left over, to go on to the third round. The third round 
has 86/2 — 43 matches, and so forth. The total number of matches is the sum of 
71 se QS ae 46) dp ооо, 

The clever observation is to note that each match results in exactly 1 loser, so 
there must be the same number of matches as losers in the tournament. Because 
there is only 1 winner, there are 341 losers and therefore 341 matches. e 
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A standard 64-square checkerboard is arranged in 8 rows of 8 squares each. Adja- 
cent squares are alternating colors of red and black. A set of 32 1 X 2 tiles, each 
covering 2 squares, will cover the board completely (4 tiles per row, 8 rows). Prove 
that if the squares at diagonally opposite corners of the checkerboard are removed, 
the remaining board cannot be covered with 31 tiles. 

The hard way to prove this result is to try all possibilities with 31 tiles and see 
that they all fail. The clever observation is to note that opposing corners are the 
same color, so the checkerboard with the corners removed has two less squares of 
one color than of the other. Each tile covers one square of each color, so any set 
of tiles must cover an equal number of squares of each color and cannot cover the 
board with the corners removed. ө 


Common Definitions 


Many of the examples in this section and many of the exercises that follow involve 
elementary number theory, that is, results about integers. It’s useful to work in 
number theory when first starting to construct proofs because many properties of 
integers, such as what it means to be an even number, are already familiar. The 
following definitions may be helpful in working some of these exercises. 


* A perfect square is an integer n such that n = К? for some integer k. 


* A prime number is an integer n > 1 such that n is not divisible by any 
integers other than 1 and n. 


* Acomposite number л is a nonprime integer; that is, п = ab where a and 
b are integers with 1 <a < n and 1 < b <и. 


* For two numbers x and y, x € y means y — x > 0. 


* For two integers n and m, n divides m, n |m, means that m is divisible by 
n—that 15, m = k(n) for some integer К. 


* The absolute value of a number x, |х|, is x if x = 0 and is —x if x < 0. 


SECTION 2.1 REVIEW 


TECHNIQUES is used either to refute a conjecture by finding a 
counterexample or to prove a conjecture. 

* [n proving a conjecture about some subject, facts 
about that subject can be used. 

* Under the right circumstances, proof by contra- 
position or contradiction may work better than a 
direct proof. 


* Look for a counterexample. 

(0) Construct direct proofs, proofs by contraposition, 
and proofs by contradiction. 

MAIN IDEAS 


* [nductive reasoning is used to formulate a con- 
jecture based on experience. Deductive reasoning 


EXERCISES 2.1 


1. Write the contrapositive of each statement in Exercise 5 of Section 1.1. 
2. Write the converse of each statement in Exercise 5 of Section 1.1. 
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. Provide counterexamples to the following statements. 


a. Every geometric figure with four right angles is a square. 
b. If a real number is not positive, then it must be negative. 
c. All people with red hair have green eyes or are tall. 

d. АП people with red hair have green eyes and are tall. 


. Provide counterexamples to the following statements. 


a. If a and b are integers where a | b and b |a, then а = b. 
b. If i? > 0 then n > 0. 
c. If n is an even number, then n? + 1 is prime. 


d. If n is a positive integer, then л? > n!. 


. Provide a counterexample to the following statement: The number n is an odd integer if and only if 


Зп + 5 is an even integer. 


. Provide a counterexample to the following statement: The number 7 is an even integer if and only if 


3n +2 is an even integer. 


. а. Find two even integers whose sum is not a multiple of 4. 


b. What is wrong with the following “proof” that the sum of two even numbers is a multiple of 4? 


Let x and y be even numbers. Then x = 2m and y = 2m, where m is an integer, sox + y = 2m + 2m = 4m, 
which is an integral multiple of 4. 


. &. Find an example of an odd number x and an even number y such that x — y — 7. 


b. What is wrong with the following “proof” that an odd number minus an even number is always 1? 
Let x be odd and y be even. Then x = 2m + 1, y = 2m, where m is an integer, and x — y = 2m + 1 — 2m = 1. 


For Exercises 9—46, prove the given statement. 


9. 


10. 
П. 
12. 
13. 
14. 
15. 
16. 
17. 
18. 
19. 
20. 
21. 
22, 
23. 
24. 
25. 
26. 


If n = 25, 100, or 169, then л is a perfect square and is a sum of two perfect squares. 
If n is an even integer, 4 <n = 12, then n is a sum of two prime numbers. 

For any positive integer л less than or equal to 3, n! < 2”. 

For2znzx4,m = 2". 

The sum of two even integers is even (do a direct proof). 

The sum of two even integers is even (do a proof by contradiction). 

The sum of two odd integers is even. 

The sum of an even integer and an odd integer is odd. 

An odd integer minus an even integer is odd. 

If n is an even integer, then n* — 1 is odd. 

The product of any two consecutive integers is even. 

The sum of an integer and its square is even. 

The square of an even number is divisible by 4. 

For every integer n, the number 3(n* + 2n + 3) — 2п” is a perfect square. 

If a number x is positive, so is x + 1 (do a proof by contraposition). 

If n is an odd integer, then it is the difference of two perfect squares. 

The number л is an odd integer if and only if 3n + 5 = 6k + 8 for some integer k. 
The number и is an even integer if and only if 3n + 2 = 6k + 2 for some integer k. 


27. 
28. 
29. 
30. 
31. 
32. 
33. 
34. 
35. 
36. 
37. 
38. 
39. 
40. 


4 


um 


42. 


43. 


44 
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For x and y positive numbers, x < y if and only if x? < y’. 

If x) + 2x — 3 = 0, then x 42. 

If n is an even prime number, then n = 2. 

The sum of three consecutive integers 15 divisible by 3. 

If two integers are each divisible by some integer п, then their sum is divisible by и. 

If the product of two integers is not divisible by an integer и, then neither integer is divisible by л. 
If n, m, and p are integers and n | m and m |p, then n |p. 

If n, m, p, and q are integers and n |p and m |q, then nm | pq. 

The square of an odd integer equals 8k + 1 for some integer К. 

The sum of the squares of two odd integers cannot be a perfect square. (Hint: Use Exercise 35.) 
The product of the squares of two integers is a perfect square. 

The difference of two consecutive cubes is odd. 

For апу two numbers x and у, |x + y| = |x| + | y. 

For any two numbers x and y, |xy| = |x|] y|. 


. The value A is the average of the n numbers x}, x», ..., Xp. Prove that at least one of ху, x», ..., x, is greater 


than or equal to A. 


Suppose you were to use the steps of Example 11 to attempt to prove that \/4 is not a rational number. At 
what point would the proof not be valid? 


Prove that V3 is not a rational number. 


. Prove that V5 is not a rational number. 


45. 
46. 


Prove that 2 is not a rational number. 


Prove that log, 5 is not a rational number (log, 5 = x means 2* = 5). 


For Exercises 47—72, prove or disprove the given statement. 


4T. 
48. 
49. 
50. 
51; 
92; 
53. 
54. 
59; 
56. 
27. 
58. 
59. 
60. 


6 


— 


0 is an even number. 

91 is a composite number. 

297 is a composite number. 

83 is a composite number. 

The difference between two odd integers is odd. 

The difference between two even integers is even. 

The product of any three consecutive integers is even. 

The sum of any three consecutive integers is even. 

The sum of an integer and its cube is even. 

The number л is an even integer if and only if n° + 13 is odd. 

The product of an integer and its square is even. 

Any positive integer can be written as the sum of the squares of two integers. 
The sum of the square of an odd integer and the square of an even integer is odd. 
If n is a positive integer that is a perfect square, then и + 2 is not a perfect square. 


ae 1 
. For a positive integer n, n + m > 2. 
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62. If n, m, and p are integers and n | mp, then n | m or n | p. 
63. For every prime number л, п + 4 is prime. 

64. For every positive integer n, n^ + n + 3 is not prime. 
65. For п a positive integer, n > 2, п? — 1 is not prime. 
66. For every positive integer л, 2" + 1 is prime. 

67. For every positive integer n, n^ + n + 1 is prime. 

68. For n an even integer, n > 2, 2" — 1 is not prime. 

69. The sum of two rational numbers 15 rational. 

70. The product of two rational numbers is rational. 

7 
72. The sum of a rational number and an irrational number is irrational. 


— 


. The product of two irrational numbers is irrational. 


For Exercises 73—75, use the following facts from geometry and the accompanying figure. 


* The interior angles of a triangle sum to 180°. 

* Vertical angles (opposite angles formed when two lines intersect) are the same size. 
* Astraight angle is 180°. 

* Aright angle is 90°. 


73. Prove that the measure of angle 5 plus the measure of angle 3 is 90°. 
74. Prove that the measure of angle 4 is the sum of the measures of angles 1 and 2. 
75. If angle 1 and angle 5 are the same size, then angle 2 is a right angle. 


76. Prove that the sum of the integers from 1 through 100 is 5050. (Hint: Instead of actually adding all the 
numbers, try to make the same clever observation that the German mathematician Karl Friederick Gauss 
[1777—1855] made as a schoolchild: Group the numbers into pairs, using 1 and 100, 2 and 99, etc.) 


SECTION 2.2 | INDUCTION 


First Principle of Induction 


There is one final proof technique especially useful in computer science. To 
illustrate how the technique works, imagine that you are climbing an infinitely 
high ladder. How do you know whether you will be able to reach an arbitrarily 
high rung? Suppose we make the following two assertions about your climbing 
abilities: 


1. You can reach the first rung. 
2. Once you get to a rung, you can always climb to the next one up. (Notice 
that this assertion 1s an implication.) 
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If both statement 1 and the implication of statement 2 are true, then by 
statement 1 you can get to the first rung and therefore by statement 2 you can 
get to the second; by statement 2 again, you can get to the third rung; by state- 
ment 2 again you can get to the fourth; and so on. You can climb as high as you 
wish. Both assertions here are necessary. If only statement 1 is true, you have 
no guarantee of getting beyond the first rung, and if only statement 2 is true, 
you may never be able to get started. Let’s assume that the rungs of the ladder 
are numbered by positive integers—1, 2, 3, and so on. 

Now think of a specific property a number might have. Instead of “reach- 
ing an arbitrarily high rung,” we can talk about an arbitrary, positive integer 
having that property. We use the shorthand notation P(n) to mean that the posi- 
tive integer has the property P. How can we use the ladder-climbing tech- 
nique to prove that for all positive integers n, we have P(n)? The two assertions 
we need to prove are 


1. Pd) (1 has property P.) 
2. For any positive integer k, P(k) > P(k + 1). (If any number has 


property P, so does the 
next number.) 


If we can prove both assertions 1 and 2, then P(n) holds for any positive inte- 
ger n, just as you could climb to an arbitrary rung on the ladder. 

The foundation for arguments of this type is the first principle of mathemati- 
cal induction. 


€ PRINCIPLE FIRST PRINCIPLE OF MATHEMATICAL INDUCTION 
1. P(1) is true 


2. (УЮ[РОЮ) true > P(k + 1) true] | — P(n) true for all positive integers n 


The first principle of mathematical induction is an implication. The con- 
clusion is a statement of the form, “Р(и) is true for all positive integers л.” 
Therefore, whenever we want to prove that something is true for every positive 
integer n, it is a good bet that mathematical induction is an appropriate proof 
technique to use. 

To know that the conclusion of this implication is true, we show that the two 
hypotheses, statements | and 2, are true. To prove statement 1, we need only show 
that property P holds for the number 1, usually a trivial task. Statement 2 is also 
an implication that must hold for all k. To prove this implication, we assume for 
an arbitrary positive integer k that P(x) is true and show, based on this assump- 
tion, that P(A + 1) is true. Therefore P(k) — P(k + 1) and, using universal gener- 
alization, (VA)[P(k) > P(k + 1)]. You should convince yourself that assuming that 
property P holds for the number / is not the same as assuming what we ultimately 
want to prove (a frequent source of confusion when one first encounters proofs of 
this kind). It is merely the way to proceed with a direct proof that the implication 
P(k) > P(k + 1) is true. 

In doing a proof by induction, establishing the truth of statement 1, P(1), is 
called the basis, or basis step, for the inductive proof. Establishing the truth of 
P(k) > P(k + 1) is called the inductive step. When we assume P(k) to be true 


REMINDER 


To prove something true 
for all n > some value, 
think induction. 


Figure 2.2 
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so as to prove the inductive step, P(A) is called the inductive assumption, or 
inductive hypothesis. 

All the proof methods we have talked about in this chapter are techniques 
for deductive reasoning—ways to prove a conjecture that perhaps was for- 
mulated by inductive reasoning. Mathematical induction is also a deductive 
technique, not a method for inductive reasoning (don’t get confused by the ter- 
minology here). For the other proof techniques, we can begin with a hypothesis 
and string facts together until we more or less stumble on a conclusion. In fact, 
even if our conjecture is slightly incorrect, we might see what the correct con- 
clusion is in the course of doing the proof. In mathematical induction, however, 
we must know right at the outset the exact form of the property P(n) that we 
are trying to establish. Mathematical induction, therefore, is not an exploratory 
proof technique—it can only confirm a correct conjecture. 


Proofs by Mathematical Induction 


Suppose that the ancestral progenitor Smith married and had two children. Let's 
call these two children generation 1. Now suppose each of those two children had 
two children; then in generation 2, there were four offspring. This trend contin- 
ued from generation unto generation. The Smith family tree therefore looks like 
Figure 2.2. (This figure looks exactly like Figure 1.1b, where we looked at the 
possible T—F values for n statement letters.) 


Generation Offspring 
1 222! 
2 422* 
3 $223 


It appears that generation n contains 2" offspring. More formally, if we let P(n) 
denote the number of offspring at generation n, then we guess that 


P(n) = 2" 


We can use induction to prove that our guess for Р(и) is correct. 
The basis step is to establish P(1), which is the equation 


P(1)-2' 22 


This is true because we are told that Smith had two children. We now assume that 
our guess is correct for an arbitrary generation k, k = 1; that is, we assume 


P(k) = 2* 
and try to show that 


P(k + 1) = 2+! 
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In this family, each offspring has two children; thus the number of offspring at 
generation k + 1 will be twice the number at generation k, or P(k + 1) = 2P(K). 
By the inductive assumption, P(k) = 2*, so 


P(e + 1) = 2P(k) = 2(25 = 2"! 


and so indeed 
P(k + 1) = 24+! 


This completes our proof. Now that we have set our mind at ease about the 
Smith clan, we can apply the inductive proof technique to less obvious problems. 


| EXAMPLE14 | Prove that the equation 


jaa dS ap eta On = WS (1) 


is true for any positive integer n. Here the property P(n) is equation (1). (Notice 
that P(n) is a property of n, or—in language from Chapter 1—a unary predicate. It 
is a statement about л, expressed here as an equation. Thus it is incorrect to write 
something like P(n) = 1+3 +5 +: + (2n — 1)) 

The left side of this equation is the sum of all the odd integers from 1 to 
2n — 1. The right side is a formula for the value of this sum. Although we can 
verify the truth of this equation for any particular value of n by substituting that 
value for л, we cannot substitute all possible positive integer values. Thus a proof 
by exhaustion does not work. A proof by mathematical induction is appropriate. 

The basis step is to establish P(1), which is equation (1) when л has the value 
1. When we substitute 1 for n in the left side of equation (1), we get the sum of all 
the odd integers starting at 1 and ending at 2(1) — 1 — 1. The sum of all the odd 
numbers from 1 to 1 equals 1. When we substitute 1 for n in the formula on the 
right side of this equation, we get (1). Therefore 


A I= 


This is certainly true. For the inductive hypothesis, we assume Р(Ё) for an arbitrary 
positive integer k, which is equation (1) when n has the value k, or 


Р(Ю: 14+34+54+-+-4+Qk-1 =F (2) 
(Note that P(x) is not the equation (2k — 1) = £^, which is true only for k = 1.) 
Using the inductive hypothesis, we want to show P(k + 1), which is equation (1) 
when л has the value k + 1, or 


Р(Е +1): 1434+54+---+ [24+ 1) – 11 241) (3) 


(The question mark over the equals sign is to remind us that this is the fact we want 
to prove, as opposed to something we already know.) 
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The key to an inductive proof is to find a way to relate what we want to show— 
P(k + 1), equation (3)—to what we have assumed—P(K), equation (2). The left 
side of P(k + 1) can be rewritten to show the next-to-last term: 


Dream Iss pet 


This expression contains the left side of equation (2) as a subexpression. Because 
we have assumed Р(/) to be true, we can substitute the right side of equation (2) 
for this subexpression. Thus, 


14+345+---4+[2(k+ 1) – 1] 
=14+34+5+4+---+(2k- 1) + [2(k - 1) – 1] 
=? + [2(k + 1) – 1] 
= 02 + [26+ 2 – 1] 
= К + 2К + 1 
= (k + 1y 
Therefore, 


ПОРА = 1] = (з. 1 


which verifies P(A + 1) and proves that equation (1) is true for any positive 
integer л. ө 


Table 2.3 summarizes the three steps necessary for a proof using the first 
principle of induction. 


TABLE 2.3 

To Prove by First Principle of Induction 
Step 1 Prove base case. 

Step 2 Assume P(k). 

Step 3 Prove Р(к + 1). 


Any “summation” induction problem works exactly the same way. Write the summa- 
tion including the next-to-last term, and you will find the left side of the P(k) equation 
and can use the inductive hypothesis. After that, it’s just a question of algebra. 


| EXAMPLE 15 | Prove that 


ПЕ te i | 


for any n = 1. 


Again, induction is appropriate. P(1) is the equation 


[CE UNIES Or і 
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REMINDER which is true. We take P(x) 

To prove 

P(k) > Р(К + 1), you have 14+24+2?4+---4+2% = 201 – | 
to discover the Р(К) case 

within the P(k + 1) case. as the inductive hypothesis and try to establish P(k + 1): 


[р tt OP Ji pay cL ул нЕ 


Rewriting the sum on the left side of P(k + 1) reveals how the inductive as- 
sumption can be used: 
de 2 d D d ooo 5 
= rea a 
oe Eo (from the inductive hypothesis P(K)) 


= ле | (adding like terms) 
= okt i+ = 1 
Therefore, 
ecu E eee 
which verifies P(k + 1) and completes the proof. ө 


The following summation formula is the most important one because it ос- 
curs so often in the analysis of algorithms. If you don’t remember anything else, 
you should remember this formula for the sum of the first п positive integers. 


PRACTICE 7 | Prove that for any positive integer n, 


n(n+1) 


1+24+3+---+n= 
3 n 2 d 


Not all proofs by induction involve formulas with sums. Other algebraic iden- 
tities about the positive integers, as well as nonalgebraic assertions like the num- 
ber of offspring in generation n of the Smith family, can be proved by induction. 


ZOVE Prove that for any positive integer n, 2” > n. 


Р(1) is the assertion 2! > 1, which is surely true. Now we assume P(k), 2% > k, 
and try to conclude P(k + 1), 2^*! > k + 1. Now, where is P(k) hidden in here? 
Aha—we can write the left side of P(k + 1), 2**!, as 2k - 2, and there's the left side 
of P(k). Using the inductive hypothesis 2‘ > k and multiplying both sides of this 
inequality by 2, we get 2^- 2 > k- 2. We complete the argument 


tl = 24.2 > 6:2 = k + k > К + 1 (because k= 1) 
Or 


2ERL OS Kod] e 
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For the first step of the induction process, it may be appropriate to begin at 0 
or at 2 or 3 instead of at 1. The same principle applies, no matter where you first 
hop on the ladder. 


ЭТЕТ Prove that n? > Зи for n = 4. 


Here we should use induction and begin with a basis step of P(4). (Testing 
values of n — 1, 2, and 3 shows that the inequality does not hold for these values.) 
Р(4) is the inequality 4° > 3(4), or 16 > 12, which is true. The inductive hypoth- 
esis is that /2 > 3k and that k = 4, and we want to show that (k + 1)? > 3(k + 1). 


(A+ 1P =k +2k+1 
2» Be tb Dk sb 1] (by the inductive hypothesis) 
=>3k+8+4+1 (because k = 4) 
=3k+9 
>J pI (because 9 > 3) 
= (510) 


In this proof we used the fact that 3k + 9 > 3k + 3. Of course, 3k + 9 is 
greater than lots of things, but 3k + 3 is what gives us what we want. In an induc- 
tion proof, because we know exactly what we want as the result, we can let that 
guide us as we manipulate algebraic expressions. ө 


PRACTICE 8 | Prove that for all n > 1, 


2"+1 < 3" 


ЭТТЭ Prove that for any positive integer n, the number 2?" – 1 is divisible by 3. 

The basis step is to show P(1), that 22) — 1 = 4 — 1 = 3 is divisible by 3. 
Clearly this is true. 

We assume that 27^ — 1 is divisible by 3, which means that 27*— 1 = 3m for 
some integer т, ог 2™% = 3m + 1 (this little rewriting trick is the key to these 
“divisibility” problems). We want to show that 22** — 1 is divisible by 3. 


220+ 1) E e 22k*2 Ex 


а! 

= 2(3m + 1) – 1 (by the inductive hypothesis) 
=12m+4-1 

= 12m + 3 

= 3(4m + 1) where 4m + 1 is an integer 


Thus 2267) — 1 is divisible by 3. ` 


Misleading claims of proof by induction are also possible. When we prove 
the truth of P(A + 1) without relying on the truth of P(k), we have done a di- 
rect proof of P(k + 1) where k + 1 is arbitrary. The proof is not invalid, but it 
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should be rewritten to show that it is a direct proof of P(n) for any n, not a proof 
by induction. 

An inductive proof may be called for when its application is not as obvious as 
in the above examples. The problem statement may not directly say “prove some- 
thing about nonnegative integers." Instead, there is some quantity in the statement 
to be proved that can take on arbitrary nonnegative integer values. 


A programming language might be designed with the following convention re- 
garding multiplication: A single factor requires no parentheses, but the product “а 
times b” must be written as (a)b. So the product 


eren» els eem 
could be written in this language as 


((((((a)b)c)d Je) f£ )g 


or, for example, 


((a)b)(c)d Xe) f )g 


depending on the order in which the products are formed. The result is the same 
in either case. 

We want to show that any product of factors can be written with an even number 
of parentheses. The proof is by induction on the number of factors (this is where the 
nonnegative integer comes in—it represents the number of factors in any product of 
factors). For a single factor, there are 0 parentheses, an even number. Assume that 
for any product of k factors there is an even number of parentheses. Now consider 
a product P of k + 1 factors. P can be thought of as r times s where r has k factors 
and s is a single factor. By the inductive hypothesis, r has an even number of pa- 
rentheses. Then we write r times s as (r)s. This adds 2 more parentheses to the even 
number of parentheses in r, giving P an even number of parentheses. e 


Here's an important observation about the proof in Example 19: There are no 
algebraic expressions! The entire proof is a verbal argument. For some proofs, we 
can't rely on just the crutch of nonverbal mathematical manipulations; we have to 
use words. 


A "tiling" problem gives a nice illustration of induction in a geometric setting. 
An angle iron is an L-shaped piece that can cover 3 squares on a checkerboard 
(Figure 2.3a). The problem is to show that for any positive integer n, a 2" X 2" 
checkerboard with one square removed can be tiled—completely covered—by 
angle irons. 

The base case isn = 1, which givesa2 X 2 checkerboard. Figure 2.3b shows the 
solution to this case if the upper right corner is removed. Removing any of the other 
three corners works the same way. Assume that any 2* X 2‘ checkerboard with one 
square removed can be tiled using angle irons. Now consider a checkerboard with 


€ PRINCIPLE 
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dimensions 2**! x 2^*!, We need to show that it can be tiled when one square is re- 
moved. To relate the k + 1 case to the inductive hypothesis, divide the 2%"! x 2**! 
checkerboard into four quarters. Each quarter will be а 2* x 2* checkerboard, and 
one will have a missing square (Figure 2.3с). By the inductive hypothesis, this 
checkerboard can be tiled. Remove a corner from each of the other three checker- 
boards, as in Figure 2.3d. By the inductive hypothesis, the three boards with the holes 
removed can be tiled, and one angle iron can tile the three holes. Hence the original 
2**! x 2**! board with its one hole can be tiled. 


(a) (b) (c) (d) 
Figure 2.3 ө 


Second Principle of Induction 


In addition to the first principle of induction, which we have been using, 


1. P(1) is true ИП 
2. (УЮ[Р(®) true > P(k + 1) true] — P(n) true for all positive integers n 


there is a second principle of induction. 


SECOND PRINCIPLE OF MATHEMATICAL INDUCTION 
1’. P(1) is true 


2'. (Vk)[P(r) true for all r, — P(n) true for all positive integers n 
1 Sr S k— Р(Ё + 1) true] 


These two induction principles differ in statements 2 and 2’. In statement 2, 
we must be able to prove for an arbitrary positive integer k that P(k + 1) is true 
based only on the assumption that P(A) is true. In statement 2’, we can assume 
that P(r) is true for all integers r between 1 and an arbitrary positive integer k in 
order to prove that P(k + 1) is true. This seems to give us a great deal more “am- 
munition,” so we might sometimes be able to prove the implication in 2’ when we 
cannot prove the implication in 2. 

What allows us to deduce (Vn)P(n) in either case? We will see that the two in- 
duction principles themselves, that is, the two methods of proof, are equivalent. In 
other words, if we accept the first principle of induction as valid, then the second 
principle of induction is valid, and conversely. In order to prove the equivalence 
of the two induction principles, we'll introduce another principle, which seems so 
obvious as to be unarguable. 
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Ф PRINCIPLE PRINCIPLE OF WELL-ORDERING 
Every collection of positive integers that contains any members at all has a 
smallest member. 


We will see that the following implications are true: 


second principle of induction — first principle of induction 
first principle of induction — well-ordering 
well-ordering — second principle of induction 


As a consequence, all three principles are equivalent, and accepting any one of 
them as true means accepting the other two as well. 

To prove that the second principle of induction implies the first principle of 
induction, suppose we accept the second principle as valid reasoning. We then 
want to show that the first principle is valid; that is, that we can conclude P(r) for 
all n from statements 1 and 2. If statement | is true, so is statement 1’. If statement 
2 is true, then so is statement 2’, because we can say that we concluded P(k + 1) 
from P(r) for all r between 1 and k, even though we used only the single condi- 
tion P(k). (More precisely, statement 2’ requires that we prove P(l) ^ P(2) Л · · · 
A P(k) > P(k + 1), but P(I) ^ PQ) ^: + ^ P(A) > P(k), and from statement 2, 
P(k) > P(k + 1), so РПО) A PQ) Л. ^ P(k)  P(k + 1)) By the second prin- 
ciple of induction, we conclude P(n) for all n. The proofs that the first principle 
of induction implies well-ordering and that well-ordering implies the second 
principle of induction are left as exercises in Section 4.1. 

To distinguish between a proof by the first principle of induction and a proof 
by the second principle of induction, let's look at a rather picturesque example that 
can be proved both ways. 


| EXAMPLE 21 | Prove that a straight fence with n fence posts has n — 1 sections for any n = 1 (as 


in Figure 2.4a). 


Fence with 4 fenceposts, 3 sections Fence with 1 fencepost, 0 sections 


(a) (b) 


Fence with last post and Fence with 1 section removed 


last section removed 
(c) (d) 


Figure 2.4 
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Let P(n) be the statement that a fence with л fence posts has n — 1 sections, and 
prove P(n) true for all n = 1. 

We'll start with the first principle of induction. For the basis step, P(1) says that 
a fence with only 1 fence post has 0 sections, which is clearly true (Figure 2.4b). 
Assume that P(A) is true: 


a fence with К fence posts has k — 1 sections 
and try to prove P(k + 1): 
(?) a fence with k + 1 fence posts has k sections 


Given a fence with k + 1 fence posts, how can we relate that to a fence with 
k fence posts so that we can make use of the inductive hypothesis? We can chop 
off the last post and the last section (Figure 2.4c). The remaining fence has k fence 
posts and, by the inductive hypothesis, k — 1 sections. Therefore the original fence 
had К sections. 

Now we'll prove the same result using the second principle of induction. The 
basis step is the same as before. For the inductive hypothesis, we assume 


for all r, 1 = r = k, a fence with r fence posts has r — 1 sections 
and try to prove P(k + 1): 
(2) a fence with k + 1 fence posts has k sections 
For a fence with k + 1 fence posts, split the fence into two parts by removing 
one section (Figure 2.4d). The 2 parts of the fence have r, and r, fence posts, where 


lsr,5hk,1 = Sk, andr, +r,=k + 1. By the inductive hypothesis, the 2 
parts have, respectively, ғ; — 1 and r, — 1 sections, so the original fence has 


(п — 1) + (5 = 1) + 1 sections 
(The extra 1 is for the one that we removed.) Simple arithmetic then yields 
ry +r, =~ 1 = (Е 1) — 1 = Ksections 


This proves that a fence with k + 1 fence posts has К sections, which verifies 
Р(к + 1) and completes the proof using the second principle of induction. e 


Example 21 allowed for either form of inductive proof because we could 
either reduce the fence at one end or split it at an arbitrary point. The problem of 
Example 19 is similar. 


We again want to show that any product of factors can be written in this program- 
ming language with an even number of parentheses, this time using the second prin- 
ciple of induction. The base case is the same as in Example 19: A single factor has 
0 parentheses, an even number. Assume that any product of r factors, 1 = r = k, 
can be written with an even number of parentheses. Then consider a product 
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P with k + 1 factors. P can be written as (S)T, a product of two factors S and T, 
where S has r, factors and T has r, factors. Then 1 = r; = kand 1 S r, S k, with 
T, t ry =k + 1. By the inductive hypothesis, S and T each have an even number 
of parentheses, and therefore so does (S)T — P. ө 


Most problems do not work equally well with either form of induction; the 
fence post and the programming language problem were somewhat artificial. 
Generally, the second principle of induction is called for when the problem 
“splits” most naturally in the middle instead of growing from the end. 


Prove that for every integer n = 2, n is a prime number or a product of prime numbers. 

We will postpone the decision of whether to use the first or the second prin- 
ciple of induction; the basis step is the same in each case and need not start with 
1. Obviously here we should start with 2. P(2) is the statement that 2 is a prime 
number or a product of primes. Because 2 is a prime number, P(2) is true. Jumping 
ahead, for either principle we will be considering the number k + 1. If k + 1 is 
prime, we are done. If k + 1 is not prime, then it is a composite number and can be 
written as k + 1 = ab. Here k + 1 has been split into two factors. Maybe neither 
of these factors has the value К, so an assumption only about P(x) isn't enough. 
Hence, we'll use the second principle of induction. 

So let's start again. We assume that for all r, 2 =r = k, P(r) is true—r is 
prime or the product of primes. Now consider the number Ё + 1. If k + 1 is 
prime, we are done. If k + 1 is not prime, then it is a composite number and 
can be written as k + 1 = ab, where 1 <a < k - 1and 1 < b< k +1. (This 
is a nontrivial factorization, so neither factor can be 1 or k + 1.) Therefore 
2 = а= капі2 = b = k. The inductive hypothesis applies to both a and b, so a 
and b are either prime or the product of primes. Thus, k + 1 = ab is the product 
of prime numbers. This verifies P(A + 1) and completes the proof by the second 
principle of induction. e 


The proof in Example 23 is an existence proof rather than a constructive 
proof. Knowing that every nonprime number has a factorization as a product 
of primes does not make it easy to find such a factorization. (We will see in 
Section 2.4 that there is, except for the order of the factors, only one such 
factorization.) Some encryption systems for passing information in a secure 
fashion on the Web depend on the difficulty of factoring large numbers into 
their prime factors (see the discussion on public-key encryption in Section 5.6). 


| EXAMPLE 24 | Prove that any amount of postage greater than or equal to 8 cents can be built using 


only 3-cent and 5-cent stamps. 

Here we let P(n) be the statement that only 3-cent and 5-cent stamps are need- 
ed to build n cents worth of postage, and prove that P(n) is true for all n = 8. The 
basis step 15 to establish P(8), which is done by the equation 


8=3 +5 
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For reasons that will be clear momentarily, we'll also establish two additional 
cases, P(9) and P(10), by the equations 


=з зе} 
10=5+5 


Now we assume that P(r) is true, that is, r can be written as a sum of 3s and 5s, for 
any r 8 =r = k, and consider P(k + 1). We may assume that k + 1 is at least 11, 
because we have already proved P(r) true for = 8, 9, and 10. Ifk + 1 = 11, then 
(k+ 1) -3 =k-2 = 8. Thus k 2 is a legitimate r value, and by the inductive 
hypothesis, P(k — 2) is true. Therefore k — 2 can be written as a sum of 3s and 5s, 
and adding an additional 3 gives us Ё + 1 as a sum of 3s and 5s. This verifies that 
P(k + 1) is true and completes the proof. e 


PRACTICE 9 


a. Why are the additional cases P(9) and P(10) proved separately in Example 24? 
b. Why can't the first principle of induction be used in the proof of Example 24? и 


As а general гше, the first principle of mathematical induction applies 


REMINDER when information about “one position back” is enough, that is, when the truth 
Use the second principle of P(k) is enough to prove the truth of P(k + 1). The second principle applies 
of induction when the when information about “опе position back" isn't good enough; that is, you 
К + 1 case depends on can't prove that P(k + 1) is true just because you know P(A) is true, but you can 


results farther back thank. — prove P(k + 1) true if you know that P(r) is true for one or more values of r that 
are “farther back" than k. 


SECTION 2.2 REVIEW 


TECHNIQUES * Aninductive proof need not begin with 1. 

* [nduction can be used to prove statements about 
quantities whose values are arbitrary nonnegative 
integers. 

* The first and second principles of induction each 
prove the same conclusion, but one approach may 

* Mathematical induction is a technique to prove be easier to use than the other in a given situation. 
properties of positive integers. 


QJ Use the first principle of induction in proofs. 
W Use the second principle of induction in proofs. 


MAIN IDEAS 


EXERCISES 2.2 


1. For all positive integers, let P(n) be the equation 


a. Write the equation for the base case P(1) and verify that it is true. 
b. Write the inductive hypothesis P(K). 

c. Write the equation for P(k + 1). 

d. Prove that P(k + 1) is true. 
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2. For all positive integers, let P(n) be the equation 


24+4+6+4+-::-+2n=n(n + 1) 


a. Write the equation for the base case P(1) and verify that it is true. 
b. Write the inductive hypothesis P(K). 
c. Write the equation for P(A + 1). 
d. Prove that P(k + 1) is true. 
In Exercises 3—26, use mathematical induction to prove that the statements are true for every positive integer n. 


[Hint: In the algebra part of the proof, if the final expression you want has factors and you can pull those factors 
out early, do that instead of multiplying everything out and getting some humongous expression. | 


3. ab Fh +--+ (4n — 3) = an- 1) 
414346 n(n + 1) _ n(n + l)(n + 2) 
: ; ; 
5. 4 - 10+ 16 t (би — 2) = nn +1) 
6. 54- 10 4 I5 g= ee) 
2 
7 12 22 „n п + DOn + 1) 
6 
8. Do pa tat 1) 
4 
оаа а 8 фр Жекеше ы 
| 3 
10. 14 + 23 Lol n(n + 1)(2n + 1)3n? + Зи — 1) 
n = 
30 
+ n + 
Te + Dub aesuse dures cde es 7) 
peg 
Dae pete = fora # 0,a ~ 1 
e NP l 0n 
12 2:3 3:4 п(п +1) п+1 
ic : E um | nmn 
O3 3:5 5:7 (Оп —1)0л+1) 2л+1 
5 5 —]y*! +1 
15.12 — 22 + 32 – 42 +---+( pue = £ ) wee ) 
16.2 t 6 t 18 | жк cam ba ar | 
(x а Дааа (ол)? _ 2n(n + 1)(2n + 1) 
| 3 
18.1:21+ 2:22 + 3:2 +... + п: 2" = (п – 1)2"*! + 2 
+ 1)(и + 2 
19.152 2«3 4-354 dg aor 4 pat Yn + 2) 


3 
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20.1:2:3 7 2:3:4 +: + п(п  1)(nd jy QT eS) 


4 
21 A M RENE" 1 || n 
1-4 47 7:10. | Qn-2)8n-1) 3n41 
22.1:1! c 2-21! t 3-3! c 9 non! = (n + 1)! — 1 where n! is the product of the positive integers 
from | to n. 
4"+1 — | 
23.14+44+ 4 b ee B amo 
3 
xy! 
24.1 d» Pa Uc Xt р where x is any integer > 1 
= 
3n — 1 
25:1 ded d 9 4 NE pe vp qun j= eD 
—(x-2 
26.1 + [x-2 — (x - D] + [x-3 — x D] +--+ [xn (х jan ы )] 
where x is any integer = 1 


27. A geometric progression (geometric sequence) is a sequence of terms where there is an initial term a and 
each succeeding term is obtained by multiplying the previous term by a common ratio r. Prove the formula 
for the sum of the first n (n = 1) terms of a geometric sequence where r 7 1: 
a — ar" 
acard ar fester = — — 
LF 
28. An arithmetic progression (arithmetic sequence) 1s a sequence of terms where there is an initial term a and 
each succeeding term is obtained by adding a common difference d to the previous term. Prove the formula 
for the sum of the first n (n > 1) terms of an arithmetic sequence: 


a- (a- 2 + (a 2d) з= [а + (n Dd] = 52а + (n — 1)а] 

29. Using Exercises 27 and 28, find an expression for the value of the following sums. 

d 2298 Она анана 57 

b. 4-7 c 4- T + 4- P o 4-72 

c. 1+7+13+---+ 49 

d.12+17+22+27+-::-+92 
30. Prove that 

1 — 2"! 
(-2* + C2 + C26 € (2) = —, 


for every positive odd integer и. 
31. Prove that n? > п + 1 forn = 2. 
32. Prove that n? = 2n + 3 forn = 3. 
33. Prove that n? > 5n + 10 forn > 6. 
34. Prove that 2” > п? forn = 5. 


In Exercises 35—40, n! is the product of the positive integers from 1 to n. 


35. Prove that n! > n? for n = 4. 


36. 
37. 
38. 
39. 
40. 
. Prove that (1 + x)” > 1 + x” forn > 1,х > 0. 


4 


— 


42. 
43. 


44. 


45. 


46. 


For Exercises 47—58, prove that the statements are true for every positive integer. 


47. 
48. 
49. 
50. 
5L. 
52. 
93: 
54. 
59; 
56. 
57. 
58. 
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Prove that n! > n> forn = 6. 
Prove that n! > 2” for n = 4. 
Prove that n! > 3” for п = 7. 
Prove that n! = 2"! forn = 1. 
Prove that n! < n” for n = 2. 


a п+1 а п 
Prove that (2) < Ө forn = landO<a<b. 


Prove that] +2+-::+n<rforn> l. 


Prove that 1 Ag ae E fornz2 
a. Try to use induction to prove that 
14 T: + | <2 f > ] 
T 2 T 4 T 2 orn = 


What goes wrong? 
b. Prove that 


1 1 1 1 

1+ | Ed =2 forn = 1 
2 4 2^ 2" 

thus showing that 
] d : TUN <2 f > 1 
1 2 1 4 1 T э" orn = 
Prove that 

1 1 1 п 

14 | bree >14 for n= 1 
2 3 2! 2 


(Note that the denominators increase by 1, not by powers of 2.) 


2?" — ] is divisible by 7. 

32" + 7 is divisible by 8. 

7" — 2" is divisible by 5. 

13" — 6" is divisible by 7. 

2" + (—1)""! is divisible by 3. 
25?1*1 + 5"*? is divisible by 27. 
3*1*? 4. 5?^*! is divisible by 14. 

7?" + 16n — 1 is divisible by 64. 
10" + 3- 4"*? + 5 is divisible by 9. 
n^ — nis divisible by 3. 

n> + 2n is divisible by 3. 

x" — ] is divisible by x — 1 forx A 1. 


39. 


60. 


6 


— 


63. 


64. 


65. 
66. 


67. 


68. 


69. 


70. 
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Prove DeMoivre’s Theorem: 


(cos Ө + isin 0)" = cos n0 + i sin n0 
for all n = 1. Hint: Recall the addition formulas from trigonometry: 
cos(a + B) = cosa cos В — sina sin B 

sin(a + B) = sin a cos B + cos а sin B 


Prove that 
sin? п 


sin 0 + sin 30 + ··· + sinn — 1)0 = — 
sin 0 


for all п > 1 and all 0 for which sin 0 ¥ 0. 


. Use induction to prove that the product of any three consecutive positive integers is divisible by 3. 
62. 


Suppose that exponentiation is defined by the equation 

xiex = xit! 
for any j = 1. Use induction to prove that x" x” = x"*" forn = 1, т = 1. 
(Hint: Do induction on m for a fixed, arbitrary value of n.) 


According to Example 20, it is possible to use angle irons to tile a4 X 4 checkerboard with the upper right 
corner removed.Sketch such a tiling. 


Example 20 does not cover the case of checkerboards that are not sized by powers of 2. Determine whether 
it is possible to tile a 3 X 3 checkerboard. 


Prove that it is possible to use angle irons to Ше a 5 X 5 checkerboard with the upper left corner removed. 


Find a configuration for a 5 X 5 checkerboard with one square removed that is not possible to tile; explain 
why this is not possible. 


Consider n infinitely long straight lines, none of which are parallel and no three of which have a common 
point of intersection. Show that for п = 1, the lines divide the plane into (n? + n + 2)/2 separate regions. 


A string of 0s and 15 is to be processed and converted to an even-parity string by adding a parity bit to the 
end of the string.(For an explanation of the use of parity bits, see Example 30 in Chapter 9.) The parity 
bit is initially 0. When a 0 character is processed, the parity bit remains unchanged. When a 1 character 
is processed, the parity bit is switched from 0 to 1 or from 1 to 0. Prove that the number of 15 in the final 
string, that is, including the parity bit, is always even. (Hint: Consider various cases.) 


What is wrong with the following “proof” by mathematical induction? We will prove that for any positive 
integer n, n is equal to 1 more than n. Assume that P(A) is true. 


к= К+ 1 
Adding 1 to both sides of this equation, we get 
k+1=k+2 
Thus, 
P(k + 1) is true 
What is wrong with the following “proof” by mathematical induction? 


We will prove that all computers are built by the same manufacturer. In particular, we will prove 
that in any collection of п computers where л is a positive integer, all the computers are built by the 
same manufacturer. We first prove P(1), a trivial process, because in any collection consisting of 


71. 


42; 


73. 


74. 


75. 


76. 
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one computer, there is only one manufacturer. Now we assume Р(/); that is, in any collection of k 
computers, all the computers were built by the same manufacturer. To prove P(k + 1), we consider 
any collection of A + 1 computers. Pull one of these Ё + 1 computers (call it HAL) out of the col- 
lection. By our assumption, the remaining k computers all have the same manufacturer. Let HAL 
change places with one of these k computers. In the new group of k computers, all have the same 
manufacturer. Thus, HAL’s manufacturer is the same one that produced all the other computers, and 
all k + 1 computers have the same manufacturer. 


An obscure tribe has only three words in its language, moon, noon, and soon. New words are composed by 
juxtaposing these words in any order, as in soonnoonmoonnoon. Any such juxtaposition is a legal word. 


a. Use the first principle of induction (on the number of subwords in the word) to prove that any word in 
this language has an even number of o's. 


b. Use the second principle of induction (on the number of subwords in the word) to prove that any word 
in this language has an even number of o's. 


A simple closed polygon consists of n points in the plane joined in pairs by п line segments; each point is 
the endpoint of exactly 2 line segments. Following are two examples. 


a. Use the first principle of induction to prove that the sum of the interior angles of an n-sided simple 
closed polygon is (n — 2)180° for all n = 3. 


b. Use the second principle of induction to prove that the sum of the interior angles of an n-sided simple 
closed polygon is (n — 2)180? for all n z 3. 


The Computer Science club is sponsoring a jigsaw puzzle contest. Jigsaw puzzles are assembled by fitting 
2 pieces together to form a small block, adding a single piece to a block to form a bigger block, or fitting 
2 blocks together. Each of these moves is considered a step in the solution. Use the second principle of 
induction to prove that the number of steps required to assemble an n-piece Jigsaw puzzle is n — 1. 


OurWay Pizza makes only two kinds of pizza, pepperoni and vegetarian. Any pizza of either kind comes 
with an even number of breadsticks (not necessarily the same even number for both kinds). Any order of 2 
or more pizzas must include at least 1 of each kind. When the delivery driver goes to deliver an order, he 
or she puts the completed order together by combining 2 suborders—picking up all the pepperoni pizzas 
from 1 window and all the vegetarian pizzas from another window. Prove that for a delivery of л pizzas, 
n = 1, there are an even number of breadsticks included. 

Consider propositional wffs that contain only the connectives ^, V, and — (no negation) and where wffs 
must be parenthesized when joined by a logical connective. Count each statement letter, connective, or 
parenthesis as one symbol. For example, ((4) ^ (B)) V ((C) ^ (D)) is such a wff, with 19 symbols. Prove 
that any such wff has an odd number of symbols. 

In any group of k people, k = 1, each person is to shake hands with every other person. Find a formula for 
the number of handshakes, and prove the formula using induction. 
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77. Prove that any amount of postage greater than or equal to 2 cents can be built using only 2-cent and 3-cent 
stamps. 


78. Prove that any amount of postage greater than or equal to 12 cents can be built using only 4-cent and 
5-cent stamps. 

79. Prove that any amount of postage greater than or equal to 14 cents can be built using only 3-cent and 
8-cent stamps. 

80. Prove that any amount of postage greater than or equal to 42 cents can be built using only 4-cent and 
15-cent stamps. 


8 


— 


. Prove that any amount of postage greater than or equal to 64 cents can be built using only 5-cent and 
17-cent stamps. 


82. Your bank ATM delivers cash using only $20 and $50 bills. Prove that you can collect, in addition to $20, 
any multiple of $10 that is $40 or greater. 


Exercises 83—84 require familiarity with ideas from calculus. Exercises 1—26 give exact formulas for the sum 


of terms in a sequence that сап be expressed аѕ У f(m). Sometimes it is difficult to find an exact expression 
т=1 

for this summation, but if the value of f(m) increases monotonically, integration can be used to find upper and 

lower bounds on the value of the summation. Specifically, 


n+1 


| rover m ds | лое 


т= 1 
1 


Using the following figure, we can see (on the left) that | f(x)dx underestimates the value of the summation 
п+1 0 


while (on the right) | f(x)dx overestimates it. 


1 


E fo) Те ШЕ? 


ntl 


83. Show that |> ах= > ms | 2x dx (see Exercise 2). 
0 


т= 1 


1 
п+1 


84. Show that |а = Ут = | хах (see Exercise 7). 
0 


т=1 
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SECTION 2.3 | MORE ON PROOF OF CORRECTNESS 


In Section 1.6, we explained the use of a formal logic system to prove mathemati- 
cally the correctness of a program. Assertions or predicates involving the program 
variables are inserted at the beginning, at the end, and at intermediate points between 
the program statements. Then proving the correctness of any particular program 
statement 5; involves proving that the implication represented by the Hoare triple 


{Q} s; iR} (1) 


is true.Here Q and R are assertions known, respectively, as the precondition and 
postcondition for the statement. The program is provably correct if all such impli- 
cations for the statements in the program are true. 

In Chapter 1, we discussed rules of inference that give conditions under 
which implication (1) is true when s; is an assignment statement and when s; is a 
conditional statement. Now we will use a rule of inference that gives conditions 
under which implication (1) is true when s; is a loop statement. We have deferred 
consideration of loop statements until now because mathematical induction is 
used in applying this rule of inference. 


Loop Rule 


Suppose that s; is a loop statement in the form 


while condition В do 
P 
end while 


where B is a condition that is either true or false and P is a program segment. 
When this statement is executed, condition В is evaluated. If B is true, program 
segment P is executed and then В is evaluated again. If B is still true, program 
segment P is executed again, then В is evaluated again, and so forth. If condition 
B ever evaluates to false, the loop terminates. 

The form of implication (1) that can be used when s; is a loop statement 
imposes (like the assignment rule did) a relationship between the precondition and 
the postcondition. The precondition Q holds before the loop is entered; strangely 
enough, one requirement is that О must continue to hold after the loop terminates 
(which means that we should look for a О that we want to be true when the loop 
terminates). In addition, B'—the condition for loop termination—must be true 
then as well. Thus (1) will have the form 


{Q} s: {0 A B'} (2) 


Consider the following pseudocode function, which is supposed to return the value 
x * у for nonnegative integers x and у. 
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Product (nonnegative integer x; nonnegative integer y) 
Local variables: 
integers i, 7 
й = 
у =й 
while 7 # x do 
[к 
i-ictl 
end while 
//j now has the value x ж y 
return j 
end function Product 


This function contains a loop; the condition В for continued loop execution is 
і # x. The condition B’ for loop termination is therefore і = x. When the loop 
terminates, it is claimed in the comment that j has the value x є y. Thus, on loop 
termination, we want 


QAB'—QA^(i = х) 

апа we also want 

ys 
То have both 

Q^A(i = x)andj = х*у 

О must be the assertion 

Ј=іжу 
(Notice that О is a predicate, that is, it states a relationship between variables in 
the program. It is never part of an equation such as О = j.) To match the form of 
(2), the assertion j = i « y would have to be true before the loop statement. This is 
indeed the case because right before the loop statement, 7 = j = 0. 

It would seem that for this example we have a candidate assertion О for impli- 

cation (2), but we do not yet have the rule of inference that allows us to say when 


(2) is a true implication. (Remember that we discovered our О by “wishful think- 
ing" about the correct operation of the function code.) e 


Assertion О must be true before the loop is entered. If implication (2) is to 
hold, Q must remain true after the loop terminates. Because it may not be known 
exactly when the loop will terminate, О must remain true after each iteration 
through the loop, which will include the final iteration. О represents a predicate, 
or relation, among the values of the program variables. If this relation holds among 
the values of the program variables before a loop iteration executes and holds 
among the values after the iteration executes, then the re/ation among these vari- 
ables is unaffected by the action of the loop iteration, even though the values 
themselves may be changed. Such a relation is called a loop invariant. 
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The loop rule of inference allows the truth of (2) to be inferred from an im- 
plication stating that О is a loop invariant. Again, for О to be a loop invariant it 
must be the case that if Q is true and condition B is true, so that another loop itera- 
tion is executed, then О remains true after that iteration, which can be expressed 
by the Hoare triple {О ^ B} P {О}. The rule is formally stated in Table 2.4. 


TABLE 2.4 
From Can Derive Name of Rule Restrictions on Use 
{Q A B} P {Q} {Q} s; {Q ^ B’} | loop s; has the form 
while condition B do 
P 
end while 


To use this rule of inference, we must find a useful loop invariant Q—one that 
asserts what we want and expect to have happen—and then prove the implication 


{О ^ B} P {0} 


Here is where induction comes into play. We denote by Q(n) the statement that 
a proposed loop invariant О is true after п iterations of the loop. Because we do 
not necessarily know how many iterations the loop may execute (that is, how long 
condition В remains true), we want to show that O(n) is true for all n = 0. (The 
value of n — 0 corresponds to the assertion upon entering the loop, after zero loop 
iterations.) 


| EXAMPLE 26 | Consider again the pseudocode function of Example 25. In that example, we 


guessed that Q is the relation 
ji y) 


To use the loop rule of inference, we must prove that О is a loop invariant. 
The quantities x and y remain unchanged throughout the function, but values 
of i and j change within the loop. We let 7, and j, denote the values of i and j, 
respectively, after n iterations of the loop. Then Q(n) is the statement /„ = i, * y. 
We prove by induction that O(7) holds for all n = 0. Q(0) is the statement 


Jo = іржу 


which, as we noted in Example 25, is true, because after zero iterations of the loop, 
when we first get to the loop statement, both 7 and / have been assigned the value 
0. (Formally, the assignment rule could be used to prove that these conditions on і 
and j hold at this point.) 


Assume Q(k): Jj, ржу 
Show Ok + 1): jq = lay 
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Between the time j and i have the values j; and i; and the time they have the values 
Jii and i444, one iteration of the loop takes place. In that iteration, j is changed by 
adding y to the previous value, and i is changed by adding 1. Thus, 


п = fe or Y (3) 
loc (4) 


Then 
Jei =Ј ty (у (3)) 
ї,* y + у (by the inductive hypothesis) 
= (i + Dy 
ia *y (Бу (4)) 
We have proved that Q is a loop invariant. 


The loop rule of inference allows us to infer that after the loop statement is 
exited, the condition О ^ B' holds, which in this case becomes 


j=ixyNi=x 
Therefore at this point the statement 
а, 


is true, which is exactly what ће function is intended to compute. 


Example 26 illustrates that loop invariants say something stronger about the 
program than we actually want to show; what we want to show is the special case 
of the loop invariant on termination of the loop. Finding the appropriate loop in- 
variant requires working backward from the desired conclusion, as in Example 25. 

We did not, in fact, prove that the loop in this example actually does ter- 
minate. What we proved was partial correctness—the program produces the 
correct answer, given that execution does terminate. Because x is a nonnegative 
integer and i is an integer that starts at 0 and is then incremented by 1 at each pass 
through the loop, we know that eventually 7 = x will become true. 


PRACTICE 10 | Show that the following function returns the value x + y for nonnegative integers x and y 
by proving the loop invariant Q: 7 = x + i and evaluating O when the loop terminates. 


Sum (nonnegative integer x; nonnegative integer y) 


Local variables: 
integers i, 7 
i=0 
j= 
while i + у do 
Pore 
і=і+ 1 
end while 


// j now has the value x + y 


return j 


end function Sum 
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The two functions of Example 25 and Practice 10 are somewhat unrealistic; 
after all, if we wanted to compute x є y or x + y, we could no doubt do it with a 
single program statement. However, the same techniques apply to more meaning- 
ful computations, such as the Euclidean algorithm. 


Euclidean Algorithm 


The Euclidean algorithm was described by the Greek mathematician Euclid over 
2300 years ago, although it may have been known even earlier. At any rate, it is 
one of the oldest known algorithms.This algorithm finds the greatest common 
divisor of two positive integers a and b with a > b. The greatest common divisor 
of a and b, denoted by gcd(a, b), is the largest integer n such that n | a and n | b. For 
example, gcd(12, 18) is 6 and gcd(420, 66) = 6. 

First, let's dispense with two trivial cases of the gcd(a, b) that do not require 
the Euclidean algorithm. 


i. gcd(a, a) = a. Clearly a |a and no larger integer divides a. 
ii. gcd(a, 0) = a. Again, a | a and no larger integer divides a, but also a | 0 
because 0 is a multiple of a: 0 — O(a) 


The Euclidean algorithm works by a succession of divisions. To find gcd(a, b), 
assuming that a > b, you first divide a by b, getting a quotient and a remainder. 
More formally, at this point a = др + rı, where 0 = rı < b. Next you divide the 
divisor, b, by the remainder, г, getting b = qarı + r2, where 0 S r, < rı. Again di- 
vide the divisor, г, by the remainder, r2, getting rı = 937: + r3, where 0 = r4 < rz. 
Clearly, there is a looping process going on, with the remainders getting succes- 
sively smaller. The process terminates when the remainder is 0, at which point the 
greatest common divisor is the last divisor used. 


| EXAMPLE 27 | To find gcd(420, 66) the following divisions are performed: 
6 2 1 3 
66)420 24)66 18)24 6)18 


396 48 18 18 
24 18 6 0 
The answer is 6, the divisor used when the remainder became 0. e 


A pseudocode version of the algorithm follows, given in the form of a func- 
tion to return gcd(a, b). 


ALGORITHM EUCLIDEAN ALGORITHM 


GCD (positive integer a; positive integer 5) 
Па > b 
Local variables: 
integers i, j 
= @ 
Ji — ® 
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while j = 0 do 
compute; = qj +7r,0Sr<j 
i-j 
=i 

end while 

// i now has the value gcd(a, b) 

return i; 

end function GCD 


EXAMPLE 28 


We intend to prove the correctness of this function, but we will need one ad- 
ditional fact first, namely, 


(V integers a, b, q, (а = qb + r) > (gcd(a, Б) = ged(b, r)) | (5) 


To prove (5), assume that a = qb + т and suppose that c divides both a and 5 so 
that a = qıc and b = 4с. Then 


r — a — qb = qc — 4с = c(qi = qq) 


so that c divides r as well. Therefore anything that divides a and 5 also divides 5 
and r. Now suppose d divides both b and r so that b = q4d and = 44. Then 


a = qb + т = даза + quad = d(qqs + q4) 


so that 4 divides a as well. Therefore anything that divides b and r also divides a 
and b. Because (a, b) and (b, ғ) have identical divisors, they must have the same 
greatest common divisor. 


Prove the correctness of the Euclidean algorithm. 

Using function GCD, we will prove the loop invariant Q: gcd(i, j) = gcd(a, b) 
and evaluate О when the loop terminates. We use induction to prove Q(n): 
gcd(i,, ja) = gcd(a, b) for all n = 0. Q(0) is the statement 


gcd(ip, jo) = ged(a, Б) 


which is true because when we first get to the loop statement, i and / have the val- 
ues a and b, respectively. 


Assume Q(k):  gced(i,, j,) = ged(a, b) 
Show O(k + 1): ged(ij. i. +1) = ged(a, Б) 


By the assignment statements within the loop body, we know that 


lee] — Jk 
Jk — Fk 
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Then 


Bcd igs fpei) = gcd( ji, ry) 
= ged(i, jx) by (5) 
= gcd(a, b) by the inductive hypothesis 


О is therefore a loop invariant. At loop termination, ged(i, j) = gcd(a, b) andj = 0, 
so gcd(i, 0) = gcd(a, b). But gcd(i, 0) is 1, so i = gcd(a, b). Therefore function 
GCD is correct. ө 


SPECIAL INTEREST PAGE 


Making Safer Software 


Proof of correctness seeks to verify that a given com- 
puter program or segment of a program meets its speci- 
fications. As we have seen, this approach relies on 
formal logic to prove that if a certain relationship (the 
precondition) holds among the program variables before 
a given statement is executed, then after execution an- 
other relationship (the postcondition) holds. Because of 
the labor-intensive nature of proof of correctness, its use 
is typically reserved for critical sections of code in im- 
portant applications. 

The B method is a set of tools that does two things: 


1. Supports formal project specification by means 
of an abstract model of the system to be devel- 
oped. This support includes both automatic gen- 
eration of lemmas that must be proven in order 
to guarantee that the model reflects the system 
requirements and automatic proof tools to prove 
each lemma or flag it for human verification 
assistance. 

2. Translates the abstract model into a code-ready 
design, again using lemmas to ensure that the 
design matches the abstract model. The final 
level can then be translated into code, often us- 
ing the Ada programming language, described 
by its proponents as “the language designed for 
building systems that really matter.” 


One of the most interesting applications of proof 
of correctness, based on the B method, is the develop- 
ment of software for the Paris Météor train. This is part 
of the Paris metro train system designed to carry up to 
40,000 passengers per hour and per direction with an 
interval between trains as low as 85 seconds on peak 
hours. The safety-critical part of the software includes 
the running and stopping of every train, opening and 


ө Chapter 


closing of doors, electrical traction power, routes, 
speed of trains, and alarms from passengers. By the 
end of the project, 27,800 lemmas had been proven, 
with 92% proven automatically (with no human inter- 
vention). But here is the amazing part: the number of 
bugs in the Ada code found by testing on the host com- 
puter, the target computer, on site, and after the system 
was put into operation was—0. Zero, nada, none. Very 
impressive indeed. 

Other formal method systems have been used for 
critical software projects, such as 


* Development of a left ventricular assist device 
that helps the heart pump blood in those with 
congestive heart failure. The eventual goal is an 
artificial heart. 

* "Conflict detection and resolution algorithms" 
for safety in air traffic control 

* Development of the Tokeneer ID Station soft- 
ware to perform biometric verification of a hu- 
man seeking access to a secure computing en- 
vironment. Tokeneer is a hypothetical system 
promoted by NSA (National Security Agency) as 
a challenge problem for security researchers. 


Formal Verification of Large Software Systems, Yin, X., 
and Knight, J., Proceedings of the NASA Formal 
Methods Symposium, April 13-15, 2010, Washington 
D.C., USA. 

http://libre.adacore.com/academia/projects-single/echo 

http://shemesh. larc.nasa.gov/fm/fm-atm-cdr.html 

*Météor: A Successful Application of B in a Large Project,” 
Behm, P., Benoit, P., Faivre, A., and Meynadier, J., 
World Congress On Formal Methods in the Develop- 
ment of Computing Systems, Toulouse, France, 1999, 
vol. 1709, pp. 369—387. 
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SECTION 2.3 REVIEW 


TECHNIQUES MAIN IDEAS 
T 
wW Verify the correctness of a program segment that ° A loop invariant, proved by induction on the num- 
includes a loop statement. ber of loop iterations, can be used to prove correct- 
* Compute gcd(a, b) using Euclid's algorithm. ness of a program loop. 


* The classic Euclidean algorithm for finding the 
greatest common divisor of two positive integers is 
provably correct. 


EXERCISES 2.3 


1. Let Q: x? >x + 1 where x is a positive integer. Assume that О is true after the kth iteration of the follow- 
ing while loop; prove that Q is true after the next iteration. 


while (x > 5) and (x < 40) do 
x=xt+l1 
end while 


2. Let О: x! > 3" where x is a positive integer. Assume that О is true after the kth iteration of the following 
while loop; prove that Q is true after the next iteration. 
while (x > 10) and (x < 30) do 
x=xt+2 
end while 


In Exercises 3—6, prove that the pseudocode program segment is correct by proving the given loop invariant О 
and evaluating О at loop termination. 


3. Function to return the value of x! for x = 1. 


Factorial (positive integer x) 
Local variables: 
integers i, j 
i=2 
j=l 
while і # x + 1 do 
j-j*i 
і= 1+1 
end while 
//j now has the value x! 
return j 
end function Factorial 


Q:jJ= =l)! 
4. Function to return the value of x? for x = 1. 


Square (positive integer x) 
Local variables: 
integers i, 7 
ї = 1 
j=l 
while i 5 x do 
j^j*2i*1 
i-i-cl 
end while 
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//j now has the value x? 
return j 
end function Square 
gs 
5. Function to return the value of x” for x, y = 1. 


Power (positive integer x, positive integer y) 
Local variables: 
integers i, j 
i=l 
j=x 
while i = y do 
J5j*x 
i=it+l 
end while 
//j now has the value x” 
return j 
end function Power 


0:ј= х! 


6. Function to compute and write out quotient q and remainder r when x is divided by y, х = 0, у = 1. 


Divide (nonnegative integer x; positive integer y); 
Local variables: 
nonnegative integers q, r 
q=0 
r=x 
while > y do 
4=9 +1 
r=r-y 
end while 
//q and т are now the quotient and remainder 


write(^The quotient is” д “and the remainder is” r) 


end function Divide 


O:x=q*ytr 


For Exercises 7—12 use the Euclidean algorithm to find the greatest common divisor of the given numbers. 


7. (308, 165) 
8. (2420, 70) 

9. (735, 90) 

10. (8370, 465) 

11. (1326, 252) 

12. (1018215, 2695) 


13. Following is the problem posed at the beginning of this chapter. 


The nonprofit organization at which you volunteer has received donations of 792 bars of soap and 
400 bottles of shampoo. You want to create packages to distribute to homeless shelters such that 
each package contains the same number of shampoo bottles and each package contains the same 


number of bars of soap. How many packages can you create? 


Explain why the solution to this problem is the gcd(792, 400). 
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14. Concerning the question posed in Exercise 13: 


a. Use the Euclidean algorithm to find the number of packages. 


b. How many bottles of shampoo are in each package? 


c. How many bars of soap are in each package? 


In Exercises 15—21, prove that the program segment is correct by finding and proving the appropriate loop in- 


variant О and evaluating О at loop termination. 


15. Function to return the value x * у” forn = 0. 


Computation (integer x; integer у; nonnegative integer л) 
Local variables: 
integers i, j 
i-0 
j^x 
while i #7 do 
fairy 
i=it+l 
end while 
//j now has the value x * y" 
return j 
end function Computation 


16. Function to return the value x — y for x, у = 0. 


Difference (nonnegative integer x; nonnegative integer y) 
Local variables: 


integers i, j 
ї= 0 
j=x 
while i y do 
ger 
і= 1+1 
end while 
//j now has the value x — y 
return j 
end function Difference 


17. Function to return the value (x + 1)? forx = 1. 


IncrementSquare (positive integer x) 
Local variables: 
integers i, j 
i=1 
j=4 
while i ~ x do 
j=j+2i+3 
i=i+l 
end while 
//j now has the value (x + 1)? 
return j 
end function /ncrementSquare 
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18. Function to return the value 2" for n = 1. 


TwosPower (positive integer п) 
Local variables: 
integers 7, 7 
i=l 
j=2 
while i = n do 
j=j*2 
i=i+l 
end while 
//j now has the value 2” 
return j 
end function TwosPower 


19. Function to return the value x * n! forn > 1. 


AnotherOne (integer x; positive integer n) 
Local variables: 
integers i, j 
i=1 
ј = х 
while i = п do 
j=j*(i+1) 
i=it+l 
end while 
//j now has the value x * n! 
return j 
end function AnotherOne 


20. Function to return the value of the polynomial a,x" + а, х"! + +++ + ax + ay ata given value of x. 


Polly (real a,; .. 
Local variables: 
integers i, j 
i=n 
Ј = а, 
while i = 0 do 
/=/]*х + an 
і= 1-1 
end while 
//j now has value of the polynomial evaluation 
return j 
end function Polly 


. ; real ао; real x) 


21. Function to return the maximum value from the first n entries a[1], a[2], 
distinct integers. 


ArrayMax (integers n, a[1], a[2], ... , a[n]) 
Local variables: 
integers i, j 

i-l 


... a[n], n = 1, in an array of 
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j= a[l] 
while i 7 n do 
i=it+l 
if a[i] >j then j = a[i] 
end while 
//j now has the value of the largest array element 
return j 
end function ArrayMax 


22. Following are four functions intended to return the value a[1] + a[2] + --- + a[n] for n = 1 (the sum of 
the first n entries in an array of integers). For those that do not produce correct results, explain what goes 
wrong. For those that do produce correct results, do a proof of correctness. 


a. ArraySumA (integers n, a[1], a[2], ... , a[n]) 
Local variables: 
integers i, j 


ї= 0 
j^0 
while i = n do 
i=it+l 
j 7j + ali] 
end while 
//j now has the value a[1] + a[2] + --- + a[n] 
return j 


end function ArraySumA 


b. ArraySumB (integers n, а[1], a[2], ... , a[n]) 
Local variables: 
integers i, j 
i=l 
j=0 
while і = n do 
у=] + ali] 
і= 1+1 
end while 
//j now has the value a[1] + a[2] + --- + a[n] 
return j 
end function ArraySumB 


c. ArraySumC (integers n, a[1], a[2], ... , a[n]) 
Local variables: 
integers i, j 


ї= 0 
j^0 
while i = n do 
у=] + ali] 
і= 1+1 
end while 
//j now has the value a[1] + a[2] +--+ + a[n] 
return j 


end function ArraySumC 
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d. ArraySumD (integers n, a[1], a[2], ... , a[n]) 
Local variables: 
integers i, 7 
i=l 
j= а 
while і = n do 
j=j+aļi+ 1] 
і= 1+1 
end while 
//j now has the value a[1] + a[2] +--+: + a[n] 
return j 
end function ArraySumD 


Exercises 23—28 concern a variation of the Euclidean algorithm more suited to computer implementation. The 
original GCD algorithm relies on repeated integer divisions to compute remainders. The following variation, 
called the binary GCD algorithm, also uses divisions, but only by 2, plus subtraction and testing for parity (odd- 
ness or evenness). Given that the numbers are stored in the computer in binary form, these operations are simple 
and often done using built-in circuits. Testing for even/odd can be done using bitwise conjunction of N & 1, 
which results in 1 if and only if N is odd. Given an even N, division by 2 is easily accomplished by a 1-bit right 
shift operation, where all bits are shifted one place to the right (the rightmost bit disappears), and the leftmost 
bit is set to 0. (Multiplication by 2 is a 1-bit left shift.) Subtraction involves the 2’s complement. As a result, the 
binary gcd algorithm, which avoids regular division, runs faster than the Euclidean algorithm, even though it 
does more (but simpler) steps. The binary GCD algorithm relies on three facts: 


23. 


24. 


25. 


1. If both a and b are even, then gcd(a, b) = 2ged(a/2, b/2). 
2. If ais even and b is odd, then gcd(a, Б) = gced(a/2, Б). 
3. If a and b are both odd, and a = b, then gcd(a, b) = ged((a — b)/2, Б). 


To prove that if both a and b are even, then gcd(a, b) = 2gcd(a/2, b/2), let a and b be even integers. Then 
2 15 a common factor of both a and b, so 2 is a factor of gcd(a, b). Let 2c = gcd(a, b). Then 


a = n(2c) and b = m(2c) 
a/2 = nc and b/2 = mc 
so c|a/2 and c | b/2. Finish this proof by showing that c = gcd(a/2, b/2). 


To prove that if a is even and b is odd, then gcd(a, Р) = gced(a/2, Б), note that because b is odd, 2 is not a 
factor of b, hence not a factor of gcd(a, Б). Therefore all contribution to gcd(a, b) comes from b and a/2, 
and gcd(a, Б) = ged(a/2, b). Write an equation for gcd(a, b) when a is odd and b is even. 


We want to prove that if a and b are both odd, and a = b, then ged(a, b) = ged((a — by2, Б). If a and 
b are both odd and a = b, then gcd(a, Б) = gcd(a — b, Б) because, from the regular Euclidean algorithm, 
gcd(a, Б) begins with a = qb + r, 0 Sr < b and gcd(a — b, b) begins with a-b = (q — Dbt+r,0Sr<b. 
The next step in either case is to divide b by r, so the two final answers will be the same. Finish this proof 
by showing that gcd(a — b, b) = gcd((a — b)/2, b). 
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26. To find gcd(420, 66) using the binary GCD algorithm takes the following steps (compare with Example 27). 
You can do these steps in your head! 


420 

210 
105 
36 


66 
33 
33 
33 
33 
33 


3 


Fact | Save the 2 factor to multiply at the end 
Fact 2 
Fact 3 
Fact 2 
Fact 2 


Fact 3 [Because gcd(a, b) = gcd(b, а), it doesn’t matter whether the 
larger number is first or second or whether the even number is 
first or second. ] 


Fact 2 
Fact 2 
Fact 3 
Fact 3 


One number is now 0, so the other number is a factor in the gcd, therefore gcd(420, 66) = 2*3 = 6 (the 
factor of 2 comes from the very first step). 


Use the binary GCD algorithm to find gcd(24, 20). 


27. Use the binary GCD algorithm to find gcd(308, 165) [see Exercise 7]. 
28. Use the binary GCD algorithm to find gcd(2420, 70) [see Exercise 8]. 


SECTION 2.4 


NUMBER THEORY 


In Section 2.1 we proved several elementary number theory results, such as “The 
product of two even integers is even.” These proofs relied on basic definitions and 
the standard proof techniques (direct proof, proof by contraposition, and proof by 
contradiction). Now that we have some additional ammunition, we can prove more 
number theory results. Number theory is fun because conjectures can be stated 
easily—after all, only integers are involved—yet sometimes it can be quite dif- 
ficult to prove. As an extreme case, Fermat’s last theorem states that there are no 
positive integers x, y, and z for which 


п п 


ху = 2 


for any integer n > 2. (There аге solutions for n = 2, such as 3? + 4? = 5?) Pierre 
de Fermat stated this result around 1637 but—although many false “proofs” were 
published in the interim—it took until 1995 before a proof was found, using very 
complicated mathematics, by Dr. Andrew Wiles of Princeton University. We are 
interested in number theory, however, because of its usefulness in computer secu- 
rity (see Section 5.6). 


REMINDER 


A prime number is an 
integer > 1 that is divisible 
only by itself and 1. 
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The Fundamental Theorem of Arithmetic 


We'll start by expanding on a result we proved using the second principle of math- 
ematical induction in Example 23, Section 2.2: For every integer n = 2, n is a 
prime number or a product of prime numbers. 

In fact, a stronger statement can be made. 


* THEOREM THE FUNDAMENTAL THEOREM OF ARITHMETIC 


For every integer п = 2, n is a prime number or can be written uniquely (ignoring 
ordering) as a product of prime numbers. 


The new part is that there is only one way to factor a composite number 
into prime factors if we ignore the order in which we write the factors. Thus we 
consider 


2(3)(3) = 32)(3) 


to be the same factorization of 18. We intuitively accept the uniqueness idea— 
how else could you factor 18 into prime factors? A formal proof requires quite a 
bit of preparation, and it begins with revisiting the idea of the greatest common 
divisor of two positive integers. 

The Euclidean algorithm to find gcd(a, b) was given in Section 2.3. It turns out 
that if a and b are positive integers, then gcd(a,b) can always be written as a linear 
combination of a and 5; that is, 


gcd(a, b) = ia + jb for some integers i and j 


In Section 2.3 we learned, using the Euclidean algorithm, that gcd(420, 66) — 6. 
And 6 can be written as a linear combination of 420 and 66: 


6 — 3(420) — 19(66) e 


Although it's easy to verify in Example 29 that 3(420) — 19(66) indeed has the 
value 6, the coefficient values of 3 and —19 seem mysterious. They are not just 
pulled out of the air, however; in fact they are derived from the successive divi- 
sions done by the Euclidean algorithm. 


The successive divisions performed by the Euclidean algorithm in finding 
gcd(420, 66) can be written as follows (see Example 27 in Section 2.3): 


420 = 6:66 + 24 
66 = 2:24 + 18 
24= 1-18 + 6 


Ig = 3° ar 


* THEOREM 
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Rewriting the first three equations from the bottom up, 


6 — 24 — 1:18 
18 = 66 — 2-24 
24 = 420 — 6:66 


Now we use these equations in a series of substitutions: 


6 = 24 — 1:18 = 24 — 1: (66 — 2-24) (substituting for 18) 


— 3-24 — 66 
— 3- (420 — 6-66) — 66 (substituting for 24) 
= 3-420 — 19-66 
which reveals the linear combination of 420 and 66 that gives the value 6. e 


The Euclidean algorithm gives us a way to express gcd(a, b) as a linear combi- 
nation of a and b, but there is another way to characterize this linear combination. 


ON GCD(a, b) 
Given positive integers a and b, gcd(a, Б) is the linear combination of a and b 
that has the smallest positive value. 


To prove this result, we need to make use of the principle of well-ordering 
that we mentioned in Section 2.2, namely, that every collection of positive inte- 
gers that contains any members at all has a smallest member. The collection we 
have in mind consists of all positive linear combinations of a and b, and certainly 
such numbers exist (as a trivial example, 1-а + 1-5). By the principle of well- 
ordering, there is a least such number c = ia + jb where i and j are integers. The 
theorem claims that c = gcd(a, b), that is, that c | a, c | b, and c is the largest integer 
that divides both a and b. 

To prove that с |а, we'll use a proof by contradiction. Suppose c + a. Then 
when we divide a by c there is a nonzero remainder 


a=met+r with т an integer and 0 < r < c 
Rewriting this equation, 
r-—a- mc 


= а — m(ia + jb) 
(1 — mi)a — (mj)b 


which makes r a positive linear combination of a and b, that is, r is a member of 
our collection, but r < c, which is a contradiction because c was the least member 
of this collection. Therefore c | a. In the same way we can show that c | b. Conse- 
quently с is a common divisor of a and b; by Practice 11, c is the greatest common 
divisor of a and b, which completes the proof of the theorem. 
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PRACTICE 11 


a. Prove that if d is a positive integer such that d|a and d|b, then d|c, where c = ia + jb. 
b. Prove that if d |с, then c = d. m 


* DEFINITION RELATIVELY PRIME 
Two integers a and b are relatively prime if gcd(a, b) = 1. 


From the theorem on gcd(a, b), it follows that a and b are relatively prime if 
and only if there exist integers i and j such that 


ia +jb= 1 


PRACTICE 12 | The integers 21 and 16 are relatively prime. Find i and j such that i (21) + j(16) = 1. > 


Recall that a prime number is an integer р > 1 that is not divisible by any 
integers other than | and p. If a is an integer that is a multiple of a prime p, 
then clearly p | p and p | a, so gcd(a, р) = p. But if a is not a multiple of p, then 
gcd(a, p) = 1 because nothing else divides p. Therefore all integers are either 
multiples of p or are relatively prime to p. 

Suppose that p is a prime number that divides the product ab of integers a and b. 
Because р is “irreducible,” p must divide either a or b. More formally, if p does not 
divide a, that is, a is not a multiple of p, then a is relatively prime to p, gcd(a, p) — 1, 
and there exist integers i and j such that 


1 = ia + jp 
Multiplying this equation by b, 
b = (ia)b + (jp)b = i(ab) + Cjp)b 


Because p | ab, ab can be written as kp, where k is an integer, so the previous 
equation becomes 


b = i(kp) + (jp)b = (ik  jb)p ik + jb an integer 
so that p | b. This proves the following theorem. 


* THEOREM ON DIVISION BY PRIME NUMBERS 
Let p be a prime number such that p | ab where a and b are integers. Then either 
plaorp|b. 


PRACTICE 13 | Extend the theorem on division by prime numbers as follows: Let p be a prime num- 
ber such that p | аа... a, where each a;is an integer. Then p | a; for some j, 1 Sj = k. 
(Hint: You want to prove that this is true for every positive integer A—What proof technique 
should you use?) 


PRACTICE 14 


Section 2.4 Number Theory 147 


Find the unique factorization of 1176 as a product of primes. 


Finally we are ready to prove that the factorization of a composite number 
n > 2 into prime factors is unique (save for ordering). If n is a composite number, 
then л can be written as a product of primes: 
n= piP: P, where p, S p; =... S р. апі each p; is a prime number 
Now suppose that n can also be written as 


n = 4192 `` ds where q, = 45 = ++ = q, and each q; 15 a prime number 


Then 


Р\Рә``` Pr = did2^ 7^ ds 


We are assuming that these two representations are different, but they might still 
have some factors in common on both sides of the equation; let’s assume these 
have been divided out. Then 


P\|PiP2*** Py 


50 


р114192 `` "9 
By Practice 13, p, |q; for some i, 1 = i = s. However, q; is a prime number, divis- 


ible only by itself and 1, which would mean that p, = q;. This is a contradiction 
because we already eliminated common factors. 


To find the unique factorization of 825 as a product of primes, we can start by 
simply dividing 825 by successively larger primes (2, 3, 5, and so on). 


2 + 825 
825 = 3:275 = 35-55 = 3-5- 5-10 = 3- 9-1 


Similarly, 
455 = 5-7: 13 
From these factorizations we can see that gcd(825, 455) = 5. Decomposing two 


positive integers into their respective prime number factorizations is another way 
(besides the Euclidean algorithm) to determine their greatest common divisor. Ф 


PRACTICE 15 


Find ged(420, 66) by unique factorization into products of primes (see Example 27). 
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We have now completed the proof of the fundamental theorem of arithmetic. 
Note, however, that this is an existence result. It says that for any integer n = 2 
that is not prime, there exists a unique factorization as a product of primes. But 
this does not tell us 


a. how to decide whether n is prime. 
b. if is not prime, how to find the prime factors of n. 


Neither of these problems has an efficient algorithmic solution. The approach in 
Example 31 of dividing a number n by successively larger primes accomplishes 
both tasks—if there are prime factors of n, they will be discovered and if there 
are none, then и is prime. However, this approach becomes very labor intensive 
when 7 is large. 


More on Prime Numbers 


Anything with a title as imposing as the fundamental theorem of arithmetic must 
be fairly important. We can use this theorem to discover several more results 
about prime numbers. 

Given a positive integer п, suppose we test for prime factors by dividing n by 
successively larger primes. Clearly we can stop with the largest prime less than or 
equal to л, but in fact we can stop with the largest prime less than or equal to V/n. 
Ifn can be factored in a nontrivial way as n = st, then s and t cannot both be greater 
than Vn because then their product would be greater than n. Therefore one of s 
and t, let's say s, must be less than or equal to. Vn. By the fundamental theorem of 
arithmetic, s is either prime or can be written as a product of primes. In either case, 
there is a prime factor less than or equal to V/z, which proves the following theorem. 


* THEOREM ОМ SIZE OF PRIME FACTORS u 
If n is a composite number, then it has a prime factor less than or equal to Vn. 


Given n = 1021, let's find the prime factors of n or determine that л is prime. The 
value of V 1021 is just less than 32. So the primes we need to test are 2, 3, 5, 7, 11, 
13, 17, 19, 23, 29, 31. None divides 1021, so 1021 is prime. e 


How many prime numbers are there? An infinite number. 


* THEOREM ON INFINITY OF PRIMES (EUCLID) 
There is an infinite number of prime numbers. 


Proof: Assume that there is a finite number of primes, listed as ру, po, ..., py. 
Consider the number s = pip; · · : p, + 1. The integer s is greater than any of the 
primes рі, ... , рь, Which we assumed is the total list of primes, therefore s is not 
prime. Thus s is composite and, by the fundamental theorem of arithmetic, 5 can 
be factored as a product of (some of) the prime numbers. Suppose that p; is one of 
the prime factors of s, that is, s — p;(m) for some integer m. Then 


l =s = pp»: Pe = p(m) — рі ‘`` Pe = pm — pisc pjaDja ``" pg 
Therefore p;| 1, which is a contradiction. End of Proof 
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Although Euclid’s theorem says that there is always another prime number 
ahead as we march through the positive integers, the distribution of primes among 
the integers is erratic. Contrary to what one might think, the primes do not get 
farther and farther apart. Even among the small primes, 23 and 29 are farther 
apart than 29 and 31. 


In Example 11, we did a proof by contradiction that V2 is not a rational number. 
The same argument works for УЗ and V5 (2, 3, and 5 are all prime numbers). 

We can generalize this result from a single prime to any integer x that is the 
product of an odd number of primes. Assume that by the fundamental theorem of 
arithmetic, x = рур `` ‘рор (In this example, we are not using any exponents, so 
some of these primes could be identical; that is, 75 = 3*5*5). Again doing a proof 
by contradiction, assume that Vpip;: · ‘рор = a/b where a and b are integers, 
b 5 0, and a and b are relatively prime. Then 


a 


[yoy о оь = m 


а? = Pip2°* “Pab 


By the fundamental theorem, a can be written as a product of one or more primes, 
but a’ will add another factor of each of a’s primes, resulting in a product of an even 
number of primes. Similarly, 5? is the product of an even number of primes. Therefore 
Dipa *Pox+ 1b" is the product of an odd number and an even number (which gives 
an odd number) of prime factors. Contradiction: а? has an even number of prime 
factors while pip»: °рәк+1Ь” has an odd number of prime factors, yet by the fun- 


damental theorem, factorization is unique. e 


The search for prime numbers and information about primes has generated 
much interest. As of June 2013, the largest known prime number is MSIE y 
which is a number with 17,425,170 decimal digits. If we consider that for a reason- 
able type size it takes about 1 inch to print 10 digits, then it would take more than 
27 miles just to print out a number of this size! 

One of the oldest conjectures about prime numbers—still unsolved—is Gold- 
bach's conjecture, formulated in 1742: Every even integer greater than 2 is the 
sum of two prime numbers. 


Euler Phi Function 


Ф DEFINITION EULER PHI FUNCTION 
For n an integer, n = 2, the Euler phi function of n, ф(л), is the number of posi- 
tive integers less than or equal to n and relatively prime to л. (ф(и) is pronounced 
"fee" of n.) 
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Notice that n will never be relatively prime to л, so this definition could have 
been stated as, “the number of positive integers less than n and relatively prime to 
n," but it turns out to be convenient to include equality. 


| EXAMPLE 34 | The first few values of (7), together with the numbers that give those values, are 


the number 1) 

the numbers 1, 2) 

the numbers 1, 3) 

the numbers 1, 2, 3, 4) 

the numbers 1, 5) 

the numbers 1, 2, 3, 4, 5, 6) e 


SUL USUS 


PRACTICE 16 | If p is a prime number, prove that Ф (p) = p — 1. 


For small n, it is easy to compute q(r) by a brute-force approach of just try- 
ing values < n to find how many are relatively prime to л. But there is actually a 
formula to compute ф(л), which we'll derive now. 


Using the fundamental theorem of arithmetic, write the positive integer n in its 
factored form as a product of primes, where if the same prime p occurs m times, it 
18 written as p". Suppose, for example, that 


Mp Illo, ls 
3 


n —pipo»p 


To compute ф(л), we'll count all the positive integers = n, of which there are л, 
and throw out those that are not relatively prime to n, Now let 4;, 1 = i = 3, be 
defined as the collection of all positive integral multiples of p; that are = n; these 
numbers share a common factor of p; with n and so are not relatively prime to n. 
The integral multiples of p; that are = n are p; 2P; 3P; ... ‚ n. How many numbers 
are in this list? Exactly the number of times you can divide n by p; or n/p;. So, 
denoting the size of A; by | А], we know that |A,| = n/p,. If we combine А), 45, and 
Аз, that will be all integers = n that are not relatively prime to и. How many such 
integers are there? We can't just add |4;| + |A,| + |A3| = n/p, + n/p, + n/p, because 
there could be some numbers that appear in more than one of the three collec- 
tions, and would therefore be counted twice. Numbers that appear in both А; and 
Aj, i 7 j, are integral multiples of both p; and р, so there will be n/(p;p;) of them. 
We'll subtract those for all i, 7 combinations. But by doing this, we have subtracted 
any numbers in all three collections three times, which means they now are not 
counted at all, so we have to add back the л/(рур» рз) numbers that are in all three 
collections.” Therefore 


?This lengthy discussion is an instance of the principle of inclusion and exclusion, discussed in Chapter 4. 
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m (2 iS n n n n ) 
= 10 T 1 
i Di P2 Ps Pip Ррз  PiPs  PiP2P3 


(1 1 1 1 1 1 | 1 1 ) 
= 10 T 1 
Di Р Рз Рр Pops Dips PiP2P3 


— {1% = Ры» = [йү = [ү ЛЕ /@ a [д “Р ЕП (adding 
PiP2P3 fractions) 


ар (‘ Pi Dips = Wp; 2) (check this by multiplying out the 


PiP2P3 numerator) 
т (рі = Xp- ps — 1) 
PiP>P3 pi P2 P3 
= p" pp ps e(p)e(p)e(p) (1) 


Equation (1) expresses ф(л) in terms of the Euler phi function of its prime factors, 


which are known to us (see Practice 16). : 


Equation (1) in Example 34 gives the formula for ф(л) where n has 3 distinct 
prime factors. It is easy to extend this equation to the more general case where n 
has an arbitrary number of prime factors. If 


т 


= тә... т 
п = рі рэ? `` рр" 


Шеп 


m,—-ly,m;—l, 


p(n) = pr pp ! pE [e(p)o(py ++: Ф(р)] (2) 


| EXAMPLE 96 | For = 133848 = 2?- 3* - 11+ 13°, 


p(n) = 22-3- 13[9(2)9(3)(11)9(13)] = 4: 3- 13[1- 2-10-12] = 37440 ө 


Equation (2) requires that we know the prime factorization of n, so it does not 
avoid the difficulty we noted earlier of factoring large values of n. 


PRACTICE 17 | For n = 34: 5 · 72, compute ф(л). 
[| 
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SECTION 2.4 REVIEW 
TECHNIQUES * Two integers a and b are relatively prime if a linear 


Q) Write the ged(a, b) as a linear combination of a and b. 

* Test whether a given positive integer is prime or, if 
not, find its prime factorization. 

e Compute the Euler phi function ф(л) for a positive 


combination of a and b can be found that equals 1. 
* [fn is not prime, it has a prime factor no greater 
than Vn. 
* Aninfinite number of prime numbers exist. 
* There is no efficient algorithm to decide whether a 


шера positive integer п is prime, or to find the prime fac- 
MAIN IDEAS tors ofn ifn is not itself prime. | | 
* Given n written as a product of primes, there is a 
* Every integer = 2 is prime or can be uniquely fac- formula to compute ф(л), the number of positive 
tored into prime numbers (fundamental theorem of integers = n and relatively prime to n. 
arithmetic). 


EXERCISES 2.4 


Exercises 1—6 refer to Exercises 7-12 in Section 2.3 


Nn BW мю н 


. Write gcd(308, 165) as a linear combination of 308 and 165. 

. Write gcd(2420, 70) as a linear combination of 2420 and 70. 

. Write gcd(735, 90) as a linear combination of 735 and 90. 

. Write gcd(8370, 465) as a linear combination of 8370 and 465. 

. Write gcd(1326, 252) as a linear combination of 1326 and 252. 

. Write gcd(1018215, 2695) as a linear combination of 1018215 and 2695. 


For Exercises 7-12, test whether n is prime and, if not, find its decomposition as a product of primes. 


11. 
12. 


. n= 1729 
. n = 1789 
. n= 1171 
.n = 1177 
n = 8712 
n = 29575 


Exercises 13—18 refer to Exercises 7—12 in Section 2.3 


13. 
14. 
15. 
16. 
17. 
18. 
19. 


20. 


Find оса(308, 165) by unique factorization into products of primes. 

Find gcd(2420, 70) by unique factorization into products of primes. 

Find gcd(735, 90) by unique factorization into products of primes. 

Find gcd(8370, 465) by unique factorization into products of primes. 
Find gcd(1326, 252) by unique factorization into products of primes. 
Find gcd(1018215, 2695) by unique factorization into products of primes. 


The least common multiple of two positive integers a and b, lcm(a, b) 15 the smallest integer n such that 
a | n and b | n. Like the gcd(a, b), the lem(a, Б) can be found from the prime factorizations of a and b. 
Describe (in words) the ged and the lcm in terms of the prime factors of a and b. 


Prove that for any positive integers a and b, a · b = gcd(a, b) · lem(a, Б). (Hint: consider both a and b in 
their factored form as a product of primes.) 
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For Exercises 21—24, find the gcd and Іст of the two numbers given. 


29. 


30 


212 -3411,5 =2:3- 11* «13 

.a=2 S.T, bus. T 

83:58 1, 5o 35-4 T7 

des 114299 b= 5 «qr 

. Prove that for any positive integers a and b, gcd(a, b) = gcd(a, a + b). 
. Prove that gcd(n, n + 1) = 1 for all positive integers n. 


. Find an example where n | ab but n / a and n ¥ b. Does this violate the theorem of division by prime 
numbers? 

. The division of a full circle into 360? probably dates back to the early Persian calendar from around 
700 в.с. that used 360 days in a year, so one day represented a rotation of 1/360 of other stars around the 
North Star. But it was also chosen because it is divisible by so many factors, avoiding the need to deal with 
fractions. Find the distinct nontrivial (but not necessarily prime) factors of 360. 


Prove that there exist three consecutive odd positive integers that are prime numbers. 


. Prove that for any positive integer п, there exist п consecutive composite numbers. [Hint: Start with 
(n + 1)! + 2] 


For Exercises 31—34, find ф(л) together with the numbers that give those values. 


31 


32. 
33. 
34. 
35. 


36. 


37. 
38. 


n=8 
n=9 
п = 10 
п = 11 


By Practice 16, if p is prime then ф(р) = p — 1. Prove that this is an “if and only if” condition by proving 
that if (7) = n — 1 for a positive integer n > 1, then n is prime. 


For any prime number p and any positive integer k, ep") = p'ep). Although this result follows directly 
from Equation (2) in this section, give a direct proof using the definition of the Euler phi function. 


Compute ф(2°) and state the numbers being counted. (Hint: See Exercise 36.) 
Compute Ф(3?) and state the numbers being counted. (Hint: See Exercise 36.) 


For Exercises 39—42, compute (7). 


39. 
40. 


4 


— 


44. 
45. 
46. 


4T. 


п = 117612 = 2- . 112 
п = 233206 = 2 · 17 · 193 


ги = 1795625 = 5f - 13° -17 
42. 
43. 


n = 1,690,541,699 = 7*- 11? · 22? 

If p and q are prime numbers with p # q, then ф(ра) = «(p)e(q). Although this result follows directly 
from Equation (2) in this section, give a direct proof using the definition of the Euler phi function. 

Prove that if r and s are relatively prime, then ọ(rs) = ẹ(r) Ф(5). 

Prove that for n and m positive integers, ф(и") = п" '(n). 


Except for n = 2, all the values of ф(л) in Example 34 are even numbers. Prove that (7) is even for all 
n2. 


A particular class of prime numbers is known as Mersenne primes, named for a French monk and math- 
ematician of the seventeenth century who studied them. Mersenne primes are numbers of the form 2? — 1 
where p is a prime, but not all numbers of this form are primes. For example, 2! — 1 = 23 · 89 is not 


48. 


49. 


50. 


5 


— 


22: 


53. 
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prime. The largest known prime number as of June 2013 is 2°”***!°' — 1, a Mersenne prime. There hap- 


pens to be a particularly efficient algorithm for testing numbers of the form 2” — 1 for primality, which is 
why almost all of the largest known primes are Mersenne primes. In recent years, most of these Marsenne 
primes have been discovered (and verified) by the GIMPS (Great Internet Mersenne Prime Search) dis- 
tributed computing project, a worldwide group of volunteers who collaborate over the Internet to test for 
Mersenne primes. 


Find the first 4—the 4 smallest—Mersenne primes. 


Goldbach’s conjecture states that every even integer greater than 2 is the sum of two prime numbers. 
Verify Goldbach’s conjecture for 


an=8 
b. n= 14 
c. n=28 


A perfect number is a positive integer n that equals the sum of all divisors less than n. For example, 6 is a 
perfect number because 6 = 1 + 2 + 3. Perfect numbers are related to Mersenne primes (see Exercise 47) 
in that ifp is a prime and 27 — 1 is a prime, then 2" !(2? — 1) is a perfect number. (This result was proved 
by Euclid around 300 в.с.). For example, 6 = 2!(2? — 1). 

a. Prove that 28 is a perfect number by writing it as the sum of its divisors. 


b. Write 28 in the form 2" (2? — 1). 


a. Prove that 496 is a perfect number (see Exercise 49) by writing it as the sum of its divisors. 
b. Write 496 in the form 2" (2? — 1). 


. An algorithm exists to find all prime numbers less than some given positive integer n. This method, called 


the Sieve of Eratosthenes, was discovered by Eratosthenes, a student of Plato. To carry out this algorithm, 
list all integers from 1 through п — 1. Then make repeated passes through the list, on the first pass crossing 
out all multiples of 2 that are greater than 2. On the second pass cross out all multiples of 3 that are greater 
than 3. On the next pass, cross out all multiples of 5 that are greater than 5, and so forth for all primes less 
than V/n. The numbers remaining when this process terminates are the primes less than n. Use the Sieve 
of Eratosthenes to find all prime numbers less than 100. 


a. Compute the square of 11. 
b. Compute the square of 111. 


c. Prove that any n-digit number consisting of all 1’s, when squared, produces the number 
123 ... (n — D)n(n — 1) ... 321. A number that reads the same forward and backward is called a 
palindrome. 


Sudoku puzzles are popular number-based puzzles. A game con- 
sists of a 9 X 9 grid made up of nine 3 X 3 blocks. Each row 
and each column of the game must contain exactly one of the 9 
digits 1 — 9; furthermore, each 3 X 3 block must contain exactly 
one of the digits 1 through 9. Following is a sample puzzle, used 
by permission from Web Sudoku at http://www.websudoku.com, 
where you can generate puzzles at any of four levels of difficulty. 
Try completing this puzzle. 
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CHAPTER 2 REVIEW 


TERMINOLOGY 


absolute value (p. 107) 
basis step (p. 111) 
composite number (p. 107) 
contrapositive (p. 103) 
converse (p. 103) 
counterexample (p. 99) 
deductive reasoning (p. 99) 
direct proof (p. 101) 
divides (p. 107) 

Euclidean algorithm 


Euler phi function (p. 149) 
even number (p. 101) 
Fermat’s last theorem (p. 143) 


(p. 144) 


(p. 133) 


n factorial (p. 99) 


SELF-TEST 


first principle of mathematical 
induction (p. 111) 
fundamental theorem of arithmetic 


greatest common divisor (p. 133) 
inductive assumption (p. 112) 
inductive hypothesis (p. 112) 
inductive reasoning (p. 99) 
inductive step (p. 111) 

linear combination (p. 144) 

loop invariant (p. 130) 

loop rule of inference (p. 131) 


odd number (p. 101) 

partial correctness (p. 132) 

perfect square (p. 107) 

prime number (p. 107) 

proof by cases (p. 104) 

proof by contradiction (p. 104) 

proof by contraposition (p. 103) 

proof by exhaustion (p. 100) 

rational number (p. 105) 

relatively prime (p. 146) 

second principle of mathematical 
induction (p. 118) 

well-ordering principle (p. 119) 


number theory (p. 107) 


Answer the following true-false questions without looking back in the chapter. 


Section 2.1 


1. 


2. 


3. 


A conjecture can never be proved merely by prov- 
ing a finite number of cases. 

A proof by contradiction of P > О begins by as- 
suming both P and Q'. 

In the statement of the theorem, “twice an odd inte- 
ger is even," an existential quantifier is understood. 


. To prove the conjecture, “If Laramie is the capital, 


then Wyoming is the state," it 1s sufficient to prove, 
"If Wyoming is the state, then Laramie is the capital." 


. To prove, “А if and only if В,” requires a proof of 


A — B anda proof of B — A. 


Section 2.2 


1 


2. 


3. 


. Induction is an appropriate proof technique for 


proving a statement about all the positive integers. 

The basis step of an inductive proof requires prov- 
ing a property true forn = 1. 

If the truth of P(A + 1) depends on the truth of 
other previous values besides P(x), then the second 
principle of induction should be used. 


. The key to a proof by the first principle of induc- 


tion is to see how the truth of P at the value k + 1 
depends on the truth of P at value k. 


. The equation А? = k°(k+1)/4 is the inductive hy- 


pothesis in an inductive proof of the statement 


+++: +w =n + 1)?/4 


Section 2.3 


1. 


2. 


A loop invariant remains true until the loop is ex- 
ited, at which point it becomes false. 

Partial correctness of a loop statement in a program 
means that the loop behaves correctly for some in- 
put values but not for others. 


. The second principle of induction is used to prove 


loop invariants because the loop can be executed an 
arbitrary number of times. 


. If a loop statement has the form 


while (condition B) 
P 
end while 


then the loop invariant О will be B’. 


. When computing the gcd(42, 30) by the Euclidean 


algorithm, the computation of dividing 30 by 12 is 
carried out. 


Section 2.4 


1. 


2. 


gcd(a,b) can always be written as a linear combina- 
tion of a and b. 

Two integers a and b are relatively prime if there 
exist integers i and j such that ia + jb = p where p 
is a prime number. 


. Ifa positive integer л is not a prime number, then it 


has at least one prime factor > Vn. 


. ф(п) is a prime number for any integer n = 2. 
. Ф(р) = p — | for any prime number р. 
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ON THE COMPUTER 


For Exercises 1—5, write a computer program that pro- 
duces the desired output from the given input. 


1 


. Input: Number n of terms in a geometric progres- 


sion (see Exercise 27, Section 2.2), the initial term 
a, and the common ratio r 
Output: Sum of the first п terms using 


a. iteration 
b. formula of Exercise 27, Section 2.2 


. Input: Number п of terms in an arithmetic progres- 


sion (see Exercise 28, Section 2.2), the initial term 
a, and the common difference d 
Output: Sum of the first п terms using 


a. iteration 
b. formula of Exercise 28, Section 2.2 


. Input: Number п 


Output: Sum of the first n cubes using 


a. iteration, using only multiplication and addi- 
tion; output the number of multiplications and 
additions used 


b. formula of Exercise 8, Section 2.2, using only 
multiplication, addition, and division; output 
the number of multiplications, additions, and 
divisions used 


‚ Input: None 


Output: Table showing every integer л, 8 =n = 100, 
as the sum of 3s and 5s (see Example 24) 


. Input: Value for n 


Output: Value for q(n) 


. The formula 4" < n! is true for all n = N. Write a 


program to determine № and then prove the result 
by induction. 


. The formula 2” > n? is true for all n = N. Write a 


program to determine № and then prove the result 
by induction. 


Recursion, Recurrence Relations, 
and Analysis of Algorithms 


Chapter 


CHAPTER OBJECTIVES · ө 


Question: 


After studying this chapter, you will be able to: 


* Understand recursive definitions of sequences, collections of objects, and op- 
erations on objects. 

* Write recursive definitions for certain sequences, collections of objects, and 
operations on objects. 

* Understand how recursive algorithms execute. 

* Write recursive algorithms to generate sequences defined recursively. 

* Find closed-form solutions for certain types of recurrence relations. 

* Analyze algorithms by counting the number of executions of a basic unit of 
work, either directly or by solving a recurrence relation. 


You are serving on the city council's Board of Land Management, which is consider- 
ing a proposal by a private contractor to manage a chemical disposal site. The mate- 
rial to be stored at the site degrades to inert matter at the rate of 596 per year. The 
contractor claims that, at this rate of stabilization, only about one-third of the original 
active material will remain at the end of 20 years. 


Is the contractor's estimate correct? 


It is possible to check this estimate by doing some brute-force calculations: If 
there is this much initially, then there will be that much next year, so much the fol- 
lowing year, and so on through the 20 years. But a quick and elegant solution can 
be obtained by solving a recurrence relation; recurrence relations are discussed in 
Section 3.2. 

Section 3.1 explores recursion, which is closely related to mathematical in- 
duction (discussed in the previous chapter) and is important in expressing many 
definitions and even algorithms. Some sequences defined recursively can also 
be defined by a formula. Finding such a formula involves solving a recurrence 
relation; solution methods for several types of recurrence relations are devel- 
oped in Section 3.2. Recurrence relations are an important tool in the analysis of 
algorithms, which mathematically determines the amount of work a particular 
algorithm must do. Analysis of algorithms is the topic of Section 3.3. 
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SECTION 3.1 | RECURSIVE DEFINITIONS 


A definition in which the item being defined appears as part of the definition 
is called a recursive definition. At first this seems like nonsense—how can we 
define something in terms of itself? This works because there are two parts to a 
recursive definition: 


1. A basis, where some simple cases of the item being defined are explicitly 
given 

2. An inductive or recursive step, where new cases of the item being defined 
are given in terms of previous cases 


Part 1 gives us a place to start by providing some simple, concrete cases; 
part 2 allows us to construct new cases from these simple ones and then to con- 
struct still other cases from these new ones, and so forth. (This seems analogous 
to proofs by mathematical induction. In a proof by induction, there is a basis step, 
namely, to show that P(1)—or P at some other initial value—holds, and there is 
an inductive step where the truth of P(A + 1) is deduced from the truth of P at pre- 
vious values. This similarity is why the term inductive definition is sometimes 
used instead of recursive definition.) 

Recursion is an important idea that can be used to define sequences of ob- 
jects, more general collections of objects, and operations on objects. (The Prolog 
predicate in-food-chain of Section 1.5 was defined recursively.) Even algorithms 
can be recursive. 


Recursively Defined Sequences 


A sequence S (an infinite sequence) is a list of objects that are enumerated in 
some order; there is a first such object, then a second, and so on. S(k) denotes 
the kth object in the sequence. The list goes on forever, so a sequence therefore 
consists of 


S(1), S2), ... , S(&), ... 
Subscript notation is often used to denote the elements in a sequence, as in 
$1, $5, ... , Sy, ... 
The letter S is just a “dummy variable,” so a sequence could also be denoted by 
44, Ay, o.. y а... or Wis Wa, ce. 5 Wk eee 
and so forth.' 
A sequence is defined recursively by explicitly naming the first value (or the 


first few values) in the sequence and then defining later values in the sequence in 
terms of earlier values. 


! A more formal definition of a sequence is given in Chapter 5, Example 27. 
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| | Ew: | The sequence 5 is defined recursively by 


1. $(1) 22 
2. S(n) = 25(п — 1) forn = 2 


By statement 1, S(1), the first object in S, is 2. Then by statement 2, the second ob- 
ject in Sis S(2) = 2S(1) = 2(2) = 4. By statement 2 again, S(3) = 2S(2) = 2(4) = 8. 
Continuing in this fashion, we can see that S is the sequence 


24,8, 16, 32,... e 


A rule like that of statement 2 in Example 1, which defines a sequence value 
in terms of one or more earlier values, is called a recurrence relation. 


PRACTICE 1 | The sequence T is defined recursively as follows: 


1. 1) 21 
2. T(n) = T(n - 1) + 3 form z2 
Write the first five values in the sequence 7: ш 


The Fibonacci sequence of numbers, introduced in the thirteenth century by an 
Italian merchant and mathematician, is defined recursively by 


Fl) = 1 

Е(2) = 1 

F(n) = Еп — 2) + Е(п — 1) forn 2 
Here the first two values of the sequence are given, and the recurrence relation 
defines the nth value for n > 2 in terms of the two preceding values. It’s best to 


think of the recurrence relation in its most general form, which says that F at any 
value—except 1 and 2—15 the sum of F at the two previous values. ө 


PRACTICE 2 | Write the first eight values of the Fibonacci sequence. 


The Fibonacci sequence is famous because of its many interesting properties. 
Here is a small list (without proofs): 


a. Every positive integer can be written uniquely as a sum of | or more dis- 
tinct, nonconsecutive Fibonacci numbers. For example 11 = 3 + 8, where 
3 = F(4) and 8 = F(6). 

b. ged(F(p), F(q)) = Е(еса(р, q)). For example, if р = 6 and q = 9, then 
F(6) = 8, F(9) = 34, and gcd(8, 34) = 2. Also, ged(6, 9) = 3 and F(3) = 2. 

c. Every two consecutive Fibonacci numbers are relatively prime, that 
is, their greatest common divisor equals 1. As a result, the Euclidean 
algorithm to find gcd(a, Б) does the maximum amount of work when 
a and b are two consecutive Fibonacci numbers. 
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Other mathematical properties of the Fibonacci sequence are given in 
Example 3 and in the exercises at the end of this section. But it’s not only 
mathematicians who are interested in the Fibonacci sequence. Fibonacci num- 
bers often occur in nature. The number of petals on a daisy is often a Fibonacci 
number. Viewing a pine cone from its base, the seeds appear to be arranged in 
clockwise and counterclockwise spirals. Counting the number of each kind of 
spiral often gives two consecutive Fibonacci numbers (here 8 and 13). The same is 
true for seeds in flowers such as sunflowers, or for spirals on pineapples. 


And in the worlds of art and architecture, the golden ratio is thought to create 
aesthetically pleasing proportions. The golden ratio is 


[35/5 
2 


== 1.6180339 


and is the value approached by the ratio of two consecutive Fibonacci numbers 
F(n + 1)/F(n) for larger and larger values of n. 


| EXAMPLES | Prove that in the Fibonacci sequence 


F(n + 4) = 3F(n + 2) — F(n) for all n > 1 


Because we want to prove something true for all n = 1, it is natural to think 
of a proof by induction. And because the value of F(n) depends on both F(n — 1) 
and F(n — 2), the second principle of induction should be used. For the basis step 
of the inductive proof, we'll prove two cases, n — 1 and n — 2. For n — 1, by sub- 
stituting 1 for n in the equation we want to prove, we get 


F(5) = 3F(3) — FQ) 
or (using values computed in Practice 2) 
5 = 3(2) – 1 
which is true. For n = 2, 
F(6) = 3F(4) — F(2) 
Or 
8 = 3(3) — 1 


which is also true. Assume that for all r, 1 =r =k, 


F(r + 4) = 3F(r + 2) — Ер). 
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Now show the case for k + 1, where k + 1 = 3. (We've already proved the 
case for n = 1 and the case for n = 2.) Thus we want to show 


p 


F(k-- 1-4) £ 3F(k - 1-2) = Fk 1) 


or 


fo 


IK ae 8) = VAC ae 3) = JA ae Л) 
From the recurrence relation for the Fibonacci sequence, we have 


F(k + 5) = F(k + 3) + F(k + 4) (F at any value is the sum of F at 
the two previous values) 


and by the inductive hypothesis, with r = k — 1 and r = k, respectively, 
ARE ae э) = BCE ae 10) = IAE — Ш) 
and 
F(k + 4) = 3F(k + 2) — F(K) 
Therefore 
Е( + 5) = F(k + 3)  F(k + 4) 
= [уду чь Ш) = ух = db se ese 2) = (| 


= 3[F(k + 1) + F(k + 2)] — [F(k — 1) + Е] 
= 3F(k + 3) — F(A + 1) (using the recurrence relation again) 


This completes the inductive proof. ө 


PRACTICE 3 | In the inductive proof of Example 3, why is it necessary to prove л = 2 asa special case? | 


| EXAMPLE 4 | The formula 


F(n + 4) = 3F(n + 2) — F(n) foralln = 1 


of Example 3 can also be proved without induction, using just the recurrence rela- 
tion from the definition of Fibonacci numbers. The recurrence relation 


F(n + 2) = F(n) + F(n + 1) 
can be rewritten as 


Еп + 1) = F(n + 2) — F(n) (1) 
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Then 


Еп + 4) = Еп + 3) + F(n + 2) 


Sba 3p 2) ap ratr M) te d se 2) (rewriting F(n + 3)) 
= F(n + 2) + [F(n + 2) — F(n)]  F(n + 2) (rewriting F(n + 1) 
using (1)) 


hn 2) > Fn) ө 


Recursively Defined Sets 


The objects in a sequence are ordered—there is a first object, a second object, and 
so on. A set of objects is a collection of objects on which no ordering is imposed. 
Some sets can be defined recursively. 


In Section 1.1 we noted that certain strings of statement letters, logical connec- 
tives, and parentheses, such as (4 ^ B)' V С, are considered legitimate, while 
other strings, such as ^ ^ A"'B, are not legitimate. The syntax for arranging such 
symbols constitutes the definition of the set of propositional well-formed formulas, 
and it is a recursive definition. 


1. Any statement letter is a wff. 
2. If P and О are wffs, so are (P ^ О), (P V О), (P > О), (P^) and (P + Q)? 


Using the rules of precedence for logical connectives established in Section 1.1, 
we can omit parentheses when doing so causes no confusion. Thus we write 
(P V О) as P V О, or (P’) as P’; the new expressions are technically not wffs by 
the definition just given, but they unambiguously represent wffs. 

By beginning with statement letters and repeatedly using rule 2, any proposi- 
tional wff can be built. For example, А, B, and C are all wffs by rule 1. By rule 2, 


(A ^ B) and (C') 
are both wffs. By rule 2 again, 
(AA B) (C^) 
is a wff. Applying rule 2 yet again, we get the wff 
(KA ^ B)  (С'))) 
Eliminating some pairs of parentheses, we can write this wff as 


((4 ^ B) > C') ө 


?Sometimes there is a final rule added to the effect that there are no applicable rules besides those already 
given, which means that if something can’t be generated using the rules already given, then it does not belong 
to the set being described. We'll assume that when we stop writing rules, there are no more applicable rules! 
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PRACTICE 4 | Show how to build the wff ((4 V (B’)) — C) from the definition in Example 5. 


PRACTICE 5 | A recursive definition for the set of people who are ancestors of James could have the 


following basis: 


James's parents are ancestors of James. 


Give the inductive step. " 


Strings of symbols drawn from a finite "alphabet" set are objects that are 
commonly encountered in computer science. Computers store data as binary 
strings, strings from the alphabet consisting of 0s and 15; compilers view program 
statements as strings of tokens, such as key words and identifiers. The collection 
of all finite-length strings of symbols from an alphabet, usually called strings over 
an alphabet, can be defined recursively (see Example 6). Many sets of strings with 
special properties also have recursive definitions. 


| EXAMPLE 6 | The set of all (finite-length) strings of symbols over a finite alphabet A is denoted 


by A*. The recursive definition of A* is 


1. The empty string А (the string with no symbols) belongs to A*. 
2. Any single member of A belongs to A*. 
3. If x and y are strings in A*, so is xy, the concatenation of strings x and y. 


Parts 1 and 2 constitute the basis, and part 3 is the recursive step of this definition. 
Note that for any string x, xX = Ах = x. ө 


PRACTICE 6 | Ifx = 1011 апа у = 001, write the strings ху, ух, and ух\х. 


PRACTICE 7 | Give a recursive definition for the set of all binary strings that are palindromes, strings 
that read the same forward and backward. 


Suppose that in a certain programming language, identifiers can be alphanumeric 
strings of arbitrary length but must begin with a letter. A recursive definition for the 
set of such strings is 


1. Asingle letter is an identifier. 
2. If A is an identifier, so is the concatenation of А and any letter or digit. 


A more symbolic notation for describing sets of strings that are recursively 
defined is called Backus—Naur form, or BNF, originally developed to define the 
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programming language ALGOL. In BNF notation, items that are defined in terms of 
other items are enclosed in angle brackets, while specific items that are not further 
broken down do not appear in brackets. The vertical line | denotes a choice, with the 
same meaning as the English word or. The BNF definition of an identifier is 


«identifier ::= «letter | «identifier «letter | <identifier> <digit> 
«letter := а |Б |с|-- |2 
«digit» := 1|2|---|9 


Thus the identifier me2 is built from the definition by a sequence of choices such as 


<identifier> can be <identifier> <digit> 
which can be <identifier>2 
which can be <identifier> <letter>2 
which can be <identifier>e2 
which can be <letter>e2 
which can be me2 ө 


Аз a further connection between recursion and induction, there is a form of 
induction called structural induction that can be applied to recursively defined 
sets. Suppose we have a recursively defined set S and there is some property P(x) 
that may or may not hold for x a member of S. If we can prove 


1. Property P holds for all members of S described in the basis. 
2. If property P holds for some members of S, then it holds for new members 
of S constructed from these members using the recursive step. 


then property P holds for all members of S. 


ИШЕТ | Е A set S of strings is defined recursively by 


1. А belongs to S. 
2. If x belongs to S, so do 1x0 and 0х1. 


We can use structural induction to prove that every string in S consists of an 
equal number of 05 and 15. The basis, rule 1, identifies only a single string in S, 
namely №, which consists of an equal number of 05 and 15 (zero 05 and zero 15). 
Assume that string x consists of an equal number of 0s and 15. Using rule 2, the two 
new strings that can be constructed from x each add a single 1 and a single 0, so the 
number of 05 and the number of 15 has each been increased by | and they are still 
equal. By structural induction, every string in S has an equal number of 0s and 1s. 

Notice that not all strings with an equal number of 0s and 15 belong to S. For 
example there is no way to generate the string 1001 using the given rules. ө 


Ordinary mathematical induction proves properties about integer values, and 
the integers are ordered: 1, 2, ..., k, k + 1, .... A set, however, isn't necessarily 
ordered. If we consider the set S defined in Example 8, it looks like 
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10 01 
Zw ZN 
1100 0100 1010 0011 


and structural induction helps us deal with this “spread” of values in the set. 


Recursively Defined Operations 


Certain operations performed on objects can be defined recursively, as in Examples 
9 and 10. 


A recursive definition of the exponentiation operation a” on a nonzero real number 
a, where n is a nonnegative integer, is 


а= 
Da Соот e 


| EXAMPLE | A recursive definition for multiplication of two positive integers т and n is 


1. m(1) = m 
2. m(n) = m(n — 1) + mfornz2 e 


PRACTICE 8 | Let x be a string over some alphabet. Give a recursive definition for the operation x" 


(concatenation of x with itself n times) for n > 1. 


In Section 1.1, we defined the operation of logical disjunction on two statement 
letters. This definition can serve as the basis step for a recursive definition of the 
disjunction of л statement letters, n = 2: 


1. A, V A, defined as in Section 1.1 
2. A, V VA, = (АМ УА, 1) VA, forn > 2 (2) 


Using this definition, we can generalize the associative property of disjunction 
(tautological equivalence 2a) to say that in a disjunction of л statement letters, 
grouping by parentheses is unnecessary because all such groupings are equivalent 
to the general expression for the disjunction of n statement letters. In symbolic 
form, for any n with n = 3 and any p with 1 E p x n — 1, 


(Ay Vo V AL) V Mpi VV A) eed VV, 
This equivalence can be proved by induction on л. For n = 3, 


A, V (A, V Aj) = (A, А) V AG (by equivalence 2a) 
= Á VA, V A, (by equation (2)) 
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Assume that for n = Капд1=р=К— І, 
(Д.М УА) V (А, ММА) A VV AQ 
Then forn =k + l and1 Sp S k, 


(4 V: VA) V Aa Vi V Age 


= (ДМ V Ap) УКА, Ves VAD V Aa] (by equation (2)) 

s [4 V VA) V Mp1 М УА] МА (by equivalence 2a) 

= (А, V °°: V Ap V Аку, (by inductive hypothesis) 
= АММА (by equation (2)) 


Recursively Defined Algorithms 


Example 1 gives a recursive definition for a sequence S. Suppose we want to 
write a computer program to evaluate S(7) for some positive integer n. We can use 
either of two approaches. If we want to find S(12), for example, we can begin with 
S(1) = 2 and then compute 5(2), S(3), and so on, much as we did in Example 1, 
until we finally get to S(12). This approach no doubt involves iterating through 
some sort of loop. A pseudocode function S that uses this iterative algorithm 
follows. The basis, where n — 1, 1s handled in the first clause of the if statement; 
the value 2 is returned. The else clause, for n > 1, does some initializing and then 
goes into the while loop that computes larger values of the sequence until the 
correct upper limit is reached. You can trace the execution of this algorithm for a 
few values of n to convince yourself that it works. 


ALGORITHM 


S(positive integer л) 

//function that iteratively computes the value S(n) 
//for the sequence S of Example 1 

Local variables: 

integer і //loop index 

CurrentValue //current value of function S 


if n — 1 then 
return 2 
else 
i=2 


CurrentValue = 2 

while i <= n do 
CurrentValue = 2 * CurrentValue 
i=it+l 

end while 


//CurrentValue now has the value S(n) 
return CurrentValue 
end if 
end function 5 
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The second approach to computing S(m) uses the recursive definition of 5 
directly. Following is a version of the recursive algorithm, written again as a 
pseudocode function. 


ALGORITHM 


S(positive integer л) 
//function that recursively computes the value S(n) 
//for the sequence S of Example 1 


if n = | then 
return 2 
else 
return 2 * S(n — 1) 
end if 
end function S 


The body of this function consists of a single if-then-else statement. To un- 
derstand how the function works, let's trace the execution to compute the value 
of S(3). The function is first invoked with an input value of n — 3. Because n is 
not 1, execution is directed to the else clause. At this point, activity on computing 
S(3) must be suspended until the value of S(2) is known. Any known information 
relevant to the computation of 5(3) is stored within computer memory on a stack, 
to be retrieved when the computation can be completed. (A stack is a collection 
of data where any new item goes on top of the stack, and only the item on top of 
the stack at any given time can be accessed or removed from the stack. A stack is 
thus a LIFO—last in, first out—structure.) The function is invoked again with an 
input value of п = 2. Again, the else clause is executed, and computation of S(2) 
is suspended, with relevant information stored on the stack, while the function is 
invoked again with n = 1 as input. 

This time the first clause of the if statement applies, and the functional value, 2, 
can be computed directly. This final invocation ofthe function is now complete, and 
Its value of 2 1s returned to the second-to-last invocation, which can now remove 
any information relevant to the n = 2 case from the stack, compute S(2), and return 
the result to the previous (initial) invocation. Finally, this original invocation of 5 
is able to empty the stack and complete its calculation, returning the value of S(3). 

What are the relative advantages of iterative and recursive algorithms for doing 
the same task? In this example, the recursive version is certainly shorter because it 
does not have to manage a loop computation. Describing the execution of the recur- 
sive version makes it sound more complex than the iterative version, but all steps are 
carried out automatically. One need not be aware of what is happening internally 
except to note that a long series of recursive invocations can use a lot of memory 
by storing information relevant to previous invocations on the stack. If too much 
memory is consumed, a “stack overflow" can result. Besides using more memory, 
recursive algorithms can require many more computations and can run more slowly 
than nonrecursive ones (see Exercise 3 in On the Computer at the end of this chapter). 

Nonetheless, recursion provides a natural way to think about many problems, 
some of which would have very complex nonrecursive solutions. The problem of 
computing values for a sequence that has itself been defined recursively is well- 
suited to a recursive solution. Many programming languages support recursion. 
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PRACTICE 9 | Write the body of a recursive function to compute T(n) for the sequence T defined in 
Practice 1. 


In Example 10, a recursive definition was given for multiplying two positive 
integers m and n. A recursive pseudocode function for multiplication based on this 
definition follows. ө 


ALGORITHM 


Product(positive integer т; positive integer л) 
//Function that recursively computes the product of m and n 


if n = | then 
return т; 
else 
return Product(m, n — 1) + т 
end if 
end function Product 


A recursive algorithm invokes itself with “smaller” input values. Suppose a 
problem can be solved by solving smaller versions of the same problem, and the 
smaller versions eventually become trivial cases that are easily handled. Then 
pallid SI ТӘ prociem а recursive algorithm can be useful, even if the original problem was not stated 
from solutions to smaller recursively. 
versions of the problem. To convince ourselves that a given recursive algorithm works, we don’t have to 
start with a particular input and go down through smaller and smaller cases to the 
trivial case and then back up again. We did this when discussing the computation 
of S(3), but that was just to illustrate the mechanics of a recursive computation. 
Instead, we can verify the trivial case (like proving the base case in an induction 
proof) and verify that if the algorithm works correctly when invoked on smaller 
input values, then it indeed solves the problem for the original input values (this 
is similar to proving P(k + 1) from the assumption P(x) in an inductive proof). 


REMINDER 


Think of a recursive 
algorithm whenever you 


One of the most common tasks in data processing is to sort a list L of n items into 
increasing or decreasing numerical or alphabetical order. (The list might consist of 
customer names, for example, and in sorted order “Valdez, Juanita” should come 
after “Tucker, Joseph.") The selection sort algorithm—a simple but not particularly 
efficient sorting algorithm—is described in pseudocode in the accompanying box. 
This function sorts the first j items in L into increasing order; when the func- 
tion is initially invoked, j has the value л (thus, the first invocation ultimately sorts 
the entire list). The recursive part of the algorithm lies within the else clause; the 
algorithm examines the section of the list under consideration and finds the loca- 
tion i such that L[7] is the maximum value. It then exchanges Z[;] and Z[], after 
which the maximum value occurs at position j, the last position in the part of the 
list being considered. L[j] is now correct and should never change again, so this 
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process is repeated on the list L[1] through Z[ j — 1]. If this part of the list is sorted 
correctly, then the entire list will be sorted correctly. Whenever j has the value 1, 
the part of the list being considered consists of only one entry, which must be in 
the right place. The entire list is sorted at that point. ө 


ALGORITHM SELECTIONSORT 


SelectionSort(list L; positive integer /) 
//recursively sorts the items from 1 to j in list L into increasing order 


ifj — 1 then 
sort is complete, write out the sorted list 

else 
find the index 7 of the maximum item in L between 1 and j 
exchange L[i] and L[ j] 
SelectionSort(L, j — 1) 

end if 

end function SelectionSort 


Other recursive sorting algorithms are discussed in the exercises of Section 3.3. 


Now that we have sorted our list, another common task is to search the list for a 
particular item. (Is Juanita Valdez already a customer?) An efficient search tech- 
nique for a sorted list is the recursive binary search algorithm, which is described 
here in pseudocode. 


ALGORITHM BINARYSEARCH 


BinarySearch(list L; positive integer i; positive integer j; itemtype x) 
//searches sorted list L from L[i] to L[ /] for item x 


if > j then 
write(“not found") 
else 
find the index k midway between i and j 
if x = midpoint item L[k] then 
write( found") 
else 
if x < midpoint item L[k] then 
BinarySearch(L, i, k — 1, x) 
else 
BinarySearch(L, k + 1, j, x) 
end if 
end if 
end if 
end function BinarySearch 
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This algorithm searches the section of list L between L[i] and L[j] for item x; 
initially 7 and j have the values 1 and л, respectively. The first clause of the major if 
statement is the basis step that says x cannot be found in an empty list, one where 
the first index exceeds the last index. In the major else clause, the midpoint item in 
a section of the list must be found. (If the section contains an odd number of items, 
there is indeed a midpoint item; if the section contains an even number of items, 
it is sufficient to take as the “midpoint” item the one at the end of the first half of 
the list section.) Comparing x with the midpoint item either locates x or indicates 
which half of the list to search next, the half before the midpoint or the half after 
the midpoint. ө 


| EXAMPLE14 | Let’s apply the binary search algorithm to the list 


3, 7, 8, 10, 14, 18, 22, 34 


where the target item x is the number 25. The initial list is not empty, so the mid- 
point item is located and determined to have the value 10. Then x is compared with 
the midpoint item. Because x > 10, the search is invoked on the second half of the 
list, namely, the items 


14, 18, 22, 34 


Again, this list is nonempty, and the midpoint item is 18. Because x > 18, the 
second half of this list is searched, namely, the items 


22, 34 


In this nonempty list, the midpoint item is 22. Because x > 22, the search continues 
on the second half of the list, namely, 


34 


This is a one-element list, with the midpoint item being the only item. Because 
x « 34, a search is begun on the “first half" of the list; but the first half is empty. 
The algorithm terminates at this point with the information that x is not in the list. 

This execution requires four comparisons in all; x is compared, in turn, to 10, 
18, 22, and 34. e 


PRACTICE 10 | ш a binary search of the list in Example 14, name the elements against which x 15 
compared if x has the value 8. 


We have now seen a number of recursive definitions. Table 3.1 summarizes 
their features. 


Section 3.1 
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TABLE 3.1 


Recursive Definitions 


What Is Being Defined 


Characteristics 


Recursive sequence 


Recursive set 


The first one or two values in the sequence are known; later items in the sequence are 
defined in terms of earlier items. 


A few specific items are known to be in the set; other items in the set are built from 
combinations of items already in the set. 


Recursive operation 


A “small” case of the operation gives a specific value; other cases of the operation are 
defined in terms of smaller cases. 


Recursive algorithm 


SECTION 3.1 


REVIEW 


TECHNIQUES 


Generate values in a sequence defined recursively. 
Prove properties of the Fibonacci sequence. 


W) Recognize objects in a recursively defined collec- 


tion of objects. 

Give recursive definitions for particular sets of 
objects. 

Give recursive definitions for certain operations on 
objects. 

Write recursive algorithms to generate sequences 
defined recursively. 


EXERCISES 3.1 


For the smallest values of the arguments, the algorithm behavior is known; for larger 
values of the arguments, the algorithm invokes itself with smaller argument values. 


MAIN IDEAS 


* Recursive definitions can be given for sequences of 


objects, sets of objects, and operations on objects 
where basis information is known and new infor- 
mation depends on already known information. 
Recursive algorithms provide a natural way to 
solve certain problems by invoking the same task 
оп а smaller version of the problem. 


For Exercises 1—12, write the first five values in the sequence. 


1. S(1) = 10 

S(n) = S(n — 1) + 10 forn = 2 
2. C(1) = 5 

C(n) = 2C(n — 1) + 5 forn = 2 
3. 401) = 2 

A(n) = An = 1) =H forn = 2 
4. В(1) = 1 

В(п) = B(n — 1) + п? forn =2 
5. 5(1) = 1 


1 
S(n) = S(n — 1) + = forn = 2 


6. T(1) = 1 
T(n) = nT(n — 1) forn = 2 
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7. Р(1) = 1 

P(n) = п? Pn — 1) + (n – 1) forn 22 
8. A(1) = 2 

A(n) = пА(п — 1) +n forn = 2 
9. М(1) = 2 

М(2) = 2 

М(п) = 2M(n — 1) + M(n — 2) for n > 2 
10. D(1) = 3 

D(2) = 5 

D(n) = (n — 1)D(n — 1) + (п – 2)р(п – 2) for n > 2 
11. 1) = 2 

W(2) = 3 

Wn) = Wn — 1) (п — 2) forn > 2 
12. Т(1) = 1 

Т2) = 2 

13) =3 


Тп) = T(n — 1) + 2T(n — 2) + ЗТ(п — 3) forn > 3 
In Exercises 13—18, prove the given property of the Fibonacci numbers directly from the definition. 
13. F(n + 1) + F(n — 2) = 2F(n) for n = 3 
14. Кп) = 5F(n — 4) + 3F(n — 5) forn = 6 
15. F(n) = 3F(n — 3) + 2F(n — 4) forn = 5 
16. [F(n + DP = [Fm] + F(n — DF(n + 2) forn = 2 
17. F(n + 3) = 2F(n + 1) + Е(п) forn = 1 
18. F(n + 6) = 4F(n + 3) + F(n) forn = 1 


In Exercises 19—22, prove the given property of the Fibonacci numbers for all n = 1. (Hint: The first principle 
of induction will work.) 

19. F(I) + F(2) + +++ + F(n) = F(n + 2) – 1 

20. F(2) + F(4) + --- + F2n) = FQn+1)-1 

21. F(1) + F(3) + --- + FQn — 1) = F(2n) 

22. [FF + [FP +: + [EF = F(n)F(n + 1) 


In Exercises 23—26, prove the given property of the Fibonacci numbers using the second principle of induction. 


23. Exercise 17 
24. Exercise 18 
25. F(n) < 2” forn = 1 


3 п—1 
26. F(n) > Ө forn = 6 
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27. Write a pseudocode recursive algorithm for a function to compute F(n), the nth Fibonacci number. 


28. Walk through your recursive algorithm from Exercise 27 to compute F(6). 
a. How many times is the function invoked? 
b. How many times is F(4) computed? 
c. How many times is А(3) computed? 
d. How many times is А(2) computed? 


Exercises 29 and 30 concern a proof of correctness ofthe following iterative algorithm for a function to compute 
F(n), the nth Fibonacci number. 


F(positive integer л) 

//function that iteratively computes the value of 

//the nth Fibonacci number 

Local variables: 

positive integer 7 /Лоор index 

positive integers p, q, r //terms in Fibonacci sequence 


if n = 1 then 
return 1 
else 
ifn = 2 then 
return | 
else 
i=2 
р=1 //р = lagging term in Fibonacci sequence 
4 = 1 //q = leading term in Fibonacci sequence 
while i < n do 
r=p+q //form the next term as the 
//sum of the two previous terms 
p //bump up p 
q=r //oump up q 
і= 1+ 1 
end while 
//q now has the value F(n) 
return q 
end if 
end if 


end function F 


29.a. In the iterative Fibonacci algorithm, the condition В for loop continuation is i < n, so B’ is i = n, but 
what is the exact value of i when the loop terminates? 


b. When the loop exits, you want g = F(n); what do you want for the value of p at that point? 


30. a. Write the loop invariant О for the iterative Fibonacci algorithm. 
b. Prove that О is a loop invariant. 
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3]. The values p and q are defined as follows: 
р = 1+%5 and q= 1- V5 
2 2 
a. Prove that 1 + p = p? and 1 + д = 4. 


b. Prove that 


F(n) — 


c. Use part (b) to prove that 


di E 2 5 


is a closed-form solution for the Fibonacci sequence. 


У (1 + Уу ys(: - ssy 
2 


32. The Lucas sequence is defined by 


L(1)=1 
L(2) =3 
L(n) = L(n — 1) + L(n — 2) forn = 2 


a. Write the first five terms of the sequence. 
b. Prove that L(n) = F(n + 1) + F(n — 1) for n = 2 where F is the Fibonacci sequence. 


For Exercises 33—36, decide whether the sequences described are subsequences of the Fibonacci sequence, that 
is, whether their members are some or all of the members, in the right order, of the Fibonacci sequence.? 


33. The sequence A(7), where A(n) = 1 + (the sum ofthe first и terms of the Fibonacci sequence), n = 1. The 
first four values are 2, 3, 5, 8, which—so far—form a subsequence of the Fibonacci sequence. 


34. The sequence B(n), where B(n) = (n — 1)2" ? + 1, n = 1. The first four values are 1, 2, 5, 13, which—so 
far—form a subsequence of the Fibonacci sequence. 


35. The sequence C(), where C(7) is the number of ways in which и coins can be arranged in horizontal rows 
with all the coins in each row touching and every coin above the bottom row touching two coins in the row 
below it, n = 1. The first five values are 1, 1, 2, 3, 5, which—so far—form a subsequence of the Fibonacci 
sequence. 


@ өө eee Ф 6000 Goo odo 


nzl n=2 nea п=4 


$0000 deee odee собо coo 


36. The sequence D(n), where D(n) describes the number of ways to paint the floors on an n-story building 
where each floor is painted yellow or blue and no two adjacent floors can be blue (although adjacent floors 


Exercises 33—36 are taken from “Mathematical Recreations” by Ian Stewart, Scientific American, May 1995. 
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can be yellow), n = 1. The first four values are 2, 3, 5, 8, which—so far—form a subsequence of the 
Fibonacci sequence. For example, D(3) = 5 because a three-story building can be painted 


(Hint: think about a recursive expression for D(n + 1).) 


37 a. The original problem posed by Fibonacci concerned pairs of rabbits. Two rabbits do not breed until 
they are 2 months old. After that, each pair of rabbits produces a new pair each month. No rabbits ever 
die. Let R(n) denote the number of rabbit pairs at the end of n months if you start with a single rabbit 
pair. Show that R(n) is the Fibonacci sequence. 


b. Write 27 and 62 as the sum of distinct nonconsecutive Fibonacci numbers. 
38.a. The sequence of Catalan numbers is defined recursively by 


C(0) = 1 
Са) =1 


ОО 
k=1 


Compute the values of C(2), C(3), and C(4) using this recurrence relation. 


b. Frank and Jody are both candidates for president of the County Council. The number of votes cast equals 
2n, where n votes are cast for Frank and for Jody. Votes are counted sequentially. The ballot problem 
asks: In how many ways can the votes be counted so that Jody’s total is never ahead of Frank’s total? 
The answer, as it turns out, is С(л), the nth Catalan number. For example, if = 5, one possible counting 
sequence that meets this requirement is 


FFJJFJFFJJ 


Using п = 3, write down all the satisfactory counting sequences and compare the result to the Catalan 
number C(3). 


39. A sequence is recursively defined by 


S(1) = 2 
SQ) = 2 
5(3) = 6 


S(n) = 3S(n — 3) forn = 3 


Prove that $(л) is an even number for n = 1. 
40. A sequence is recursively defined by 


Т(5) = 6 
T(6) = 10 
T(n) = 2T(n — 2) + 2 forn = 7 


Prove that Т(л) = 2n forn = 7. 
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41. A sequence is recursively defined by 


5(0) = 1 
SQ) = 1 
5(п) = 2S(n — 1) + S(n — 2) forn = 2 


a. Prove that S(7) is an odd number for n = 0. 
b. Prove that S(n) < 6S(n — 2) юги = 4. 


42. A sequence is recursively defined by 


T(0) = 1 
Т(1) =2 
T(n) = 2T(n — 1) + T(n — 2) forn = 2 


Prove that (n) = (2)" for n = 0. 


43. Write a recursive definition for a geometric progression with initial term a and common ratio r (see 
Exercise 27, Section 2.2.). 


44. Write a recursive definition for an arithmetic progression with initial term a and common difference d (see 
Exercise 28, Section 2.2.). 


45. In an experiment, a certain colony of bacteria initially has a population of 50,000. A reading is taken every 
2 hours, and at the end of every 2-hour interval, there are 3 times as many bacteria as before. 


a. Write a recursive definition for A(n), the number of bacteria present at the beginning of the mth time 
period. 


b. Atthe beginning of which interval are there 1,350,000 bacteria present? 


46. An amount of $500 is invested in an account paying 1.296 interest compounded annually. 
a. Write a recursive definition for P(n), the amount in the account at the beginning of the mth year. 
b. After how many years will the account balance exceed $570? 


47. A set T of numbers is defined recursively by 


1. 2 belongs to 7. 
2. If x belongs to 7, so does x + 3 and 2 * x. 


Which of the following numbers belong to 7? 
a. 6 b. 7 c. 19 d. 12 


48. A set M of numbers is defined recursively by 


1. 2 апа 3 belong to M. 
2. If x and y belong to M, so does x * y. 


Which of the following numbers belong to M? 
a. 6 b. 9 c. 16 d. 21 e. 26 f. 54 g. 72 h. 218 


49. 


50. 


5 


— 


52. 


53. 
54. 


55. 
56. 
57. 
58. 
59. 
60. 
61. 
62. 


63. 
64. 
65. 
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A set S of strings of characters is defined recursively by 


1. a and b belong to S. 
2. If x belongs to S, so does xb. 


Which of the following strings belong to S? 
a.d b. ab c. aba d. aaab e. bbbbb 


A set W of strings of symbols is defined recursively by 


1. a, b, and c belong to W. 
2. If x belongs to W, so does a(x)c. 


Which of the following strings belong to W? 
a. a(b)c b. a(a(b)c)c c. a(abc)c d. a(a(a(a)c)c)c e. a(aacc)c 


. A set S of integers is defined recursively by 


1. 0 and 3 belong to S. 
2. If x and y belong to S, so does x + y. 


Use structural induction to prove that every integer in S is a multiple of 3. 


A set T of strings is defined recursively by 


1. pqq belongs to 7. 
2. If x and y belong to T, so do pxqq, qqxp, and xy. 


Use structural induction to prove that every string in T has twice as many q's as p's. 
Give a recursive definition for the set of all unary predicate wffs in x. 


Give a recursive definition for the set of all well-formed formulas of integer arithmetic, involving integers 
together with the arithmetic operations of +, —, *, and /. 


Give a recursive definition for the set of all odd integers. 

Give a recursive definition for the set of all strings of well-balanced parentheses. 

Give a recursive definition for the set of all binary strings containing an odd number of Os. 
Give a recursive definition for the set of all binary strings containing an even number of 15. 
Give a recursive definition for the set of all binary strings ending with 0. 

Give a recursive definition for the set of all binary strings with an equal number of 0s and 1s. 
Use BNF notation to define the set of positive integers. 


Use BNF notation to define the set of decimal numbers, which consist of an optional sign (+ or —), 
followed by one or more digits, followed by a decimal point, followed by zero or more digits. 


Give a recursive definition for x^, the reverse of the string x. 
Give a recursive definition for |x|, the length of the string x. 


Give a recursive definition for the factorial operation n! for n = 1. 
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66. Give a recursive definition for the addition of two nonnegative integers m and n. 

67.a. Write a recursive definition for the operation of taking the maximum of n integers а, ..., a,, n = 2. 
b. Write a recursive definition for the operation of taking the minimum of л integers a), ..., a,, n = 2. 

68.a. Give a recursive definition for the conjunction of n statement letters in propositional logic, n > 2. 


b. Write a generalization of the associative property of conjunction (tautological equivalence 2b of 
Section 1.1) and use induction to prove it. 


69. Let A and В|, B», ..., D, be statement letters. Prove the finite extension of the distributive equivalences of 
propositional logic: 
AN (ВІ ЛВ, A+++ ^B) = (АМ ВІ) (АМ Bj) A+++ ^(AV B,) 
and 
AA(B, VB, У · · VBS (АХВ) М (ANB) У · V(AA^B,) 


for n = 2. 


70. Let В|, B5, ..., В, be statement letters. Prove the finite extension of De Morgan’s laws: 


(B, МВ, V= VBy BY, ABA AB’ 


п 


апа 


(В, ЛВ, A ABY & B' oV B'NV VB 


n 
for n = 2. 


In Exercises 71—76, write the body of a recursive function to compute S(n) for the given sequence S. 
71.1,3,9,27, 81, ... 
72. 2, 1, 1/2, 1/4, 1/8, ... 
73. 1, 2, 4, 7, 11, 16, 22, ... 
74. 2, 4, 16, 256, ... 
75. a, b, a + b, a + 2b,2a + 3b,3a + Sb, ... 


76.p,p —q.p + q,p — 29р + 2q,p — 34, ... 
77. What value is returned by the following recursive function Mystery for an input value of n? 


Mystery (positive integer л) 
ifn — 1 then 
return 1 
else 
return Mystery(n — 1) + 1 
end if 
end function Mystery 


78. The following recursive function is initially invoked with an i value of 1. L is a list (array) of 10 integers. 
What does the function do? 
g(list L; positive integer i; integer x) 
if > 10 then 
return 0 
else 


79. 
80. 
81. 


82. 


83. 


84. 


85. 


86. 


87. 


88. 
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if L[ i] = х then 
return 10 
else 
return g(L, i + 1, х) 
end if 
end if 


end function g 

Informally describe a recursive algorithm to reverse the entries in a list of items. 

Informally describe a recursive algorithm to compute the sum of the digits of a positive integer. 
Informally describe a recursive algorithm to compute the greatest common divisor of two positive integers 


a and b where a > b. (Hint: The solution is based on the Euclidean algorithm, discussed in Section 2.3. In 
particular, make use of expression (5) on page 134.) 


The famous Towers of Hanoi puzzle involves 3 pegs with n disks of varying sizes stacked in order from 
the largest (on the bottom) to the smallest (on the top) on 1 of the pegs. The puzzle requires that the disks 
end up stacked the same way on a different peg; only one disk at a time can be moved to another peg, and 
no disk can ever be stacked on top of a smaller disk. Informally describe a recursive algorithm to solve the 
Towers of Hanoi puzzle. 


Simulate the execution of algorithm SelectionSort on the following list L; write the list after every 
exchange that changes the list. 


4, 10, 76,2, 5 


Simulate the execution of algorithm Se/ectionSort on the following list L; write the list after every 
exchange that changes the list. 


9, 0,2, 6,4 
The binary search algorithm is used with the following list; x has the value *Chicago." Name the elements 
against which x is compared. 
Boston, Charlotte, Indianapolis, New Orleans, Philadelphia, San Antonio, Yakima 


The binary search algorithm is used with the following list; x has the value “flour.” Name the elements 
against which x is compared. 


butter, chocolate, eggs, flour, shortening, sugar 


Do a proof of correctness for the iterative function given in this section to compute S(n) of Example 1, 
S(n) = 2". 

The Online Encyclopedia of Integer Sequences (OEIS) was originated and maintained for many years by 
Neil Sloane, a mathematician at AT&T who has also written several books about sequences. The OEIS 
Foundation now manages the database, which contains more than 200,000 sequences of integers that have 
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been submitted and studied by many people. (See oeis.org). There is even a YouTube movie about the OEIS! 
Recaman 5 sequence (number A005132 in the OEIS catalog) is a recursive sequence defined as follows: 


a(l)= 1 
Forn > 1, 
a(n — 1) — n if that number is positive and not already in the sequence, 
a(n) = otherwise 
a(n—1)-*n 


a. Confirm that the first few terms of this sequence are 1, 3, 6, 2, 7, 13. 


b. It has been conjectured that every nonnegative integer will eventually appear in this sequence. Find the 
index of this sequence at which the following numbers appear: 10, 12, 23. 


SECTION 3.2 | RECURRENCE RELATIONS 


We developed two algorithms, one iterative and one recursive, to compute a 
value S(n) for the sequence S of Example 1. However, there is a still easier way to 
compute S(n). Recall that 


S(1) = 2 (1) 
S(n) = 2S(n — 1) forn = 2 (2) 
Because 
5(1)=2=2! 
S(2) = 4= 2° 
S3) = 8 = 2? 
S(4) = 16 = 2f 


and so on, we can see that 
S(n) = 2” (3) 


Using Equation (3), we can plug in a value for n and compute S(n) without hav- 
ing to compute—either explicitly, or, through recursion, implicitly—all the lower 
values of S first. An equation such as (3), where we can substitute a value and get 
the output value back directly, is called a closed-form solution to the recurrence 
relation (2) subject to the basis step (1). Finding a closed-form solution is called 
solving the recurrence relation. 

Recurrence relations can be used to describe a variety of things, from chemi- 
cal degradation (see the opening problem for this chapter) to the amount in an 
interest-bearing account, from the growth of species to the spread of a computer 
virus. Clearly, it is nice to find a closed-form solution to a recurrence relation 
whenever possible. 


Linear First-Order Recurrence Relations 


Expand, Guess, and Verify 
One technique for solving recurrence relations is an "expand, guess, and verify" 
approach that repeatedly uses the recurrence relation to expand the expression for 
the nth term until the general pattern can be guessed. Finally the guess is verified 
by mathematical induction. 
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| EXAMPLE 15 | Consider again the basis step and recurrence relation for the sequence 5 of 


Example 1: 
S(1) = 2 (4) 
S(n) = 2S(n — 1) forn = 2 (5) 
REMINDER Let’s pretend we don’t already know the closed-form solution and use the expand, 


guess, and verify approach to find it. Beginning with S(1), we expand by using the 
Е P recurrence relation repeatedly. Keep in mind that the recurrence relation is a recipe 

and “n — 1" in the recur- à ; 
ee Think efit that says 5 at any value can be replaced by two times S at the previous value. We 
e аа чаа apply this recipe to S at the values n, n — 1, n — 2, and so оп: 
S(n) = 2S(n — 1) 

= 22500 2) = 27S = 2) 

= 2? [2S(n — 3)] = 2?8(n — 3) 


Don’t get hung up on “n” 


times S at the previous 
value.” 


By looking at the developing pattern, we guess that after k such expansions, the 
equation has the form 


S(n) = 2^$(п — k) 


This expansion of S values in terms of lower S values must stop when n — k = 1, 
that is, when k = n — 1. At that point, 


S(n) = 2" !S[n — (n — 1)] 
= 291601) = 2" 1(2) = 2" 

which expresses the closed-form solution. 

We are not yet done, however, because we guessed at the general pattern. 
We now confirm our closed-form solution by induction on the value of n. The 
statement we want to prove is therefore S(n) = 2” for n = 1. 

For the basis step, S(I) = 2'. This is true by equation (4). We assume that 
S(k) = 2) Then 

S(k + 1) = 2S(K) (by equation (5)) 


= Ж) (by the inductive hypothesis) 
= 2+1 


This proves that our closed-form solution is correct. ө 


PRACTICE 11 | Find a closed-form solution for the recurrence relation, subject to the basis step, for 
sequence 7: 
1. 70d) = 1 
2. T(n) = T(n – 1) + 3 forn = 2 
(Hint: Expand, guess, and verify.) и 
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A Solution Formula 

Some types of recurrence relations have known solution formulas. A recurrence 
relation for a sequence S(n) is linear if the earlier values of S appearing in the 
definition occur only to the first power. The most general linear recurrence rela- 
tion has the form 


S(n) = fi(n)S(n — 1) + (и) (п — 2) + +++ + (п) (п — К) + g(n) 


where the /;'s and g can be expressions involving n. The recurrence relation has 
constant coefficients if the f;s are all constants. It is first-order if the nth term 
depends only on term n — 1. Linear first-order recurrence relations with constant 
coefficients therefore have the form 


S(n) = cS(n — 1) + g(n) (6) 


Finally, a recurrence relation is homogeneous if g(7) = 0 for all n. 

We will find the solution formula for equation (6), the general linear first- 
order recurrence relation with constant coefficients, subject to the basis that S(1) 
is known. We will use the expand, guess, and verify approach. The work here is a 
generalization of what was done in Example 15. Repeatedly applying equation (6) 
and simplifying, we get 


S(n) = cS(n — 1) + g(n) 
= с[с5(п — 2) + g(n — 1)] + g(n) 
= с28(п — 2) + cg(n — 1) + g(n) 


с? [cS(n — 3) + g(n — 2)] + eg(n — 1) + g(n) 
= Sn — 3) + с^ g(n — 2) + cg(n — 1) + g(n) 


After k expansions, the general form appears to be 


S(n) = c*S(n — k) + с !g(n — (k — 1)) + +++ + eg(n — 1) + g(n) 


If the sequence has a base value at 1, then the expansion terminates when 
n—k=1lork=n — 1, at which point 


S(n) = c" 1S(1) + c" ?g(2) + +++  eg(n — 1) + g(n) 
= PISA) + h?g) + +++ + clg(n — 1) + den) (7) 


We сап use summation notation to write part of this expression more compactly. 
The uppercase Greek letter sigma, X, stands for summation. The notation 


q 
У (expression) 
i=p 
says to substitute into the expression successive values of i, the index of 
summation, from the lower limit p to the upper limit g, and then sum the results. 
(See Appendix B for further discussion of summation notation.) Thus, for example, 


YQi-1-1 F3-45-4-- Qn- 1) 
ї=1 


Section 3.2 Recurrence Relations 183 


In Example 14, Section 2.2, we proved by induction that the value of this summa- 
tion is n°. 
In summation notation, Equation (7) becomes 


Sin) = ~IS) + д 
i=2 


Induction can be used, much as was done in Example 15, to verify that this for- 
mula is the solution to recurrence relation (6) (see Exercise 26). 
Therefore, the solution to the recurrence relation (6) is 


S(n) = c" 'S(1) + Dc" (0) (8) 
i=2 
This is not yet a closed-form solution, however, because we must find an expres- 
sion for the summation. Usually it is either trivial to find the sum or we found its 
value in Section 2.2 using mathematical induction. (If we can’t find an expression 
for the summation, we are really no better off than before. We must iterate through 
the summation to find the desired value as opposed to iterating through the recur- 
rence relation to get the desired value.) 

The work we’ve done here gives a general solution—Equation (8)—once and 
for all for any recurrence relation of the form shown in (6); this work need not be 
repeated. All that is necessary is to match your problem to equation (6) to find the 
value for c and the formula for g(n) and then plug these results into the expression 
in (8). The g(n) in Equation (6) is the usual notation for a function of л; although 
we will study functions formally in Chapter 5, you can think of g(n) as giving a 
"recipe" for what to do with its argument n. If, for example, 


g(n) — 2n 
then g doubles whatever its argument value is: 
g(3) = 2(3) = 6 g(27) = 2(27) = 54 and g(i) = 2i 


This last value, 2i, would be used in Equation (8) if о(и) = 2n. 


|. EXAMPLE 6 | The sequence S(n) of Example 15, 


5(1) = 2 
S(n) = 25(п — 1) forn = 2 


is a linear, first-order, homogeneous recurrence relation with constant coefficients. 
In other words, it matches equation (6) with c = 2 and g(n) = 0. Because g(n) = 0, 
the g function evaluates to 0 no matter what the argument is. From formula (8), the 
closed-form solution is 


би) = 2"-1Q) + So = 7 


і=2 


which agrees with our previous result. e 


You now have a choice of two alternative ways to solve a linear, first-order recur- 
rence relation with constant coefficients. Table 3.2 summarizes these approaches. 
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TABLE 3.2 


To Solve Recurrence Relations of the Form S(n) = cS(n — 1) + g(n) Subject to Basis S(1) 
Method Steps 


Expand, guess, verify 1. Repeatedly use the recurrence relation until you can guess a pattern. 
2. Decide what that pattern will be when n — k = 1. 
3. Verify the resulting formula by induction. 


Solution formula 1. Match your recurrence relation to the form 
S(n) = cS(n — 1) + о(п) to find с and g(n). 
2. Use с, g(n), and S(1) in the formula 


Sn) = c^ -'S(1) + Sc” gi) 
i-2 


3. Evaluate the resulting summation to get the final expression. 


| EXAMPLE 17 | Find a closed-form solution to the recurrence relation 


S(n) = 2S(n — 1) + 3 forn = 2 
subject to the basis step 
5(1) = 4 
We'll use the solution formula method. Comparing our recurrence relation 
S(n) = 2S(n — 1) + 3 
with the general form S(n) = cS(n — 1) + g(n), we see that 
c=2 g(n) = 3 


The fact that g(n) = 3 says that g has a constant value of 3 no matter what the value 
of its argument; in particular, g(7) = 3. Substituting into the general solution form 


S(n) = с" 1s(1) + о, 


ї=2 
we get 


S(n) = 2^- (4) + У 2-43) 
і=2 


= 27-107) Ке 
i-2 


= 2"t14 3[2"72 + a7 3 +--+ 21 4 2°] 
zat 312? >= {| (from Example 15, Section 2.2) 
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REMINDER 


So the value of S(5), for example, is 2° + 3(2^ — 1) = 64 + 3(15) = 109. 


When expanding, be sure — — Alternatively. by the expand, guess, and verify technique, we expand 


to pick up all the pieces 


of Ше ыр т S(n) = 2 S(n = 1) db 5 
; + 
ү = 25а — 2) + 3] + 3 = P5(n— 2) +2343 


=7 Pe- 3) ol вза оз 


The general pattern seems to be 
Sun) SOS ОО оз 
which, when n — k = 1 ork = n — 1, becomes 
S(n) = 2-191) + 22-2 -8 4 2973.3 a 3 
= 27 (4) + 3772 4- 23 $e 42274241] 
= СРИ (from Example 15, Section 2.2) 
Finally, we must prove by induction that S(m) = 2"*! + 3[2" ! — 1]. 
Base case: л = 1: S(1) = 4 = 22 + 3[2° — 1], true 


Assume S(k) = 2**! + 3[2*! — 1] 
Show S(k + 1) = 2*? + 3[2* — 1] 


S(k + 1) = 25(Ё) + 3 (by the recurrence relation) 
= 20! + 3/2" 1 — 1) + 3 (by the inductive hypothesis) 
= 2243-2643 (multiplying out) 
= 2k? + 3D* — 1] e 


Practice 12 | Rework Practice 11 using Equation (8). 


|. EXAMPLE 18 | ‚БИЙ Find a closed — form solution to the recurrence relation 


T(n) = T(n — 1) + (n + 1) forn = 2 
subject to the basis step 
Т1) = 2 


Using the solution formula method and comparing the recurrence relation 
with the general form from Equation (6), S(n) = cS(n — 1) + g(n), we find 
that c = 1 and g(n) = n + 1. We'll substitute into the solution equation (8) 


S(n) = c"^!S(1) + У c" ei), where g(i) will be + 1. 
i=2 
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ma) = M E ++ mE ze di 
=2+ x +1) 

і=2 
= 2+ (@ 4E s (n) 


(ns NG d 2) : : 
= 2 1 (from Practice 7, Section 2.2) 
ө 


Consider the problem of reading data from a computer disk drive.* The circular 
drive is organized as a series of concentric tracks, divided into sectors. Each sector 
contains a block of data (Figure 3.1). 


Figure 3.1 


The time to read a particular block of data into memory has three components: 


1. Seek time—the time to position the read head over the proper track. This 
time varies depending on the relative position of the read head and the 
proper track when the read request is generated. In the best case, the read 
head is already over the proper track and the seek time is 0. At the worst 
case, assuming there are п tracks, the read head might be over track 1 and 
have to move to track n, which would be л — 1 units, where a unit is the 
distance between adjacent tracks. We can assume that the seek time would 
be some multiple of the number of units. 

2. Latency time—the time for the proper sector to rotate underneath the read 
head. This time also varies, depending on whether the correct sector is just 
coming under the read head (minimum latency time) or whether it has just 
gone by and a full rotation must occur (maximum latency time). 

3. Transfer time—the time to read a block once it is positioned under the read 
head, usually a constant amount of time for all blocks. 


The problem is to find the average seek time, actually the average number А(л) of 
units. The assumptions are that there are n tracks, the read head is positioned over 
some track 7, and the read head is equally likely to have to move to any track j. 


‘This example is based on work found in “Research Problem for Undergraduate Students which Spans 
Hardware Issues, Mathematical Methods and Programming: Average Seek Time of a Computer Disk," by 
Jan Plaza, http://faculty.plattsburgh.edu/jan.plaza/teaching/papers/seektime.html 
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Table 3.3 shows the number of units in going from one track to another, where 
a row is the source track and a column is the destination track. For example, if the 
read head is currently over track 3 and must end up over track n, then n — 3 units are 
required, as shown by the entry in row 3, column л. The entry in row п, column 3 is 
the same because it takes the same number of units to move in the opposite direction. 


TABLE 3.3 
Destination 
Source track 1 2 3 n-1 n 
track 
1 0 1 2 n-2 | n-1 
2 1 0 1 n-3 mm 
3 2 1 0 i= dl = © 
= 1 п=2 = 8 0 = 4 0 1 
п п = 1 n-2 n-3 1 0 


Table 3.3 illustrates the n” different possible track moves. We find the average 
number A(n) of units for these п? cases by computing the total number of units 
shown in the table, T(n), and then dividing by n’. To compute T(n), note that 
T(n) = T(n — 1) + (the total of the last row plus the last column) and that the last 
row plus the last column contribute 


Ga = D 


2 
= (n — l)n 


AL 4E 2 473 49e» se (p = Ile 2| | (using Practice 7, Section 2.2) 


so that 
KGa) = (о — I) se @ = Wye 


The base case is 7(1) = 0 (no seek time for a 1-track disk). This is a linear, 
first-order recurrence relation with constant coefficients. We can solve it using 
Equation (8), where c = 1 and g(n) = (n — 1)и. The solution is 


T(n) = 0 + SE = 
RE 


оа 2234-34 4-5co- (mp — [у 
minc 
i 3 


Therefore the average number of units is 


(from Exercise 19, Section 2.2) 


(n — Dn(n + D, , mD-—mtm-n п-п r-l n 1 
qu = 


А pam 
v» 3 3g зю w 3 m 


Because the best case is 0 and the worst case is n — 1, we might have expected the 


average case to be close to n/2, but in fact it is slightly less than n/3. e 
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| EXANMPLE20 | Not every recurrence relation fits the pattern of Equation (6). Consider the recur- 


rence relation 


Т1) = 1 
T(n) = 2nT(n — 1) forn = 2 


Although this is a linear, first-order recurrence relation, it does пої have constant 
coefficients. Equation (8) does not apply. To find a closed-form solution, we have 
to go back to the expand, guess, and verify technique. 
T(n) = 2nT(n — 1) 
= 2n[2(n — 1)T(n — 2)] = 2?n(n — 1)T(n — 2) 
= 2^nin = [йл = Jre = 3)) = 2nin = 1n = Эу = 3) 


In general, it seems that 
T(n) = 2*n(n — 1)(п — 2)... (n — (Е — D)T(n — k) 
When n — k = 1, thenk = n — 1 and 
T(n) = 2" n(n — 1)(n — 2)... (DTA) = n(n — 1)(n — 2) ... (1) = 2" In! 


This is our guess at a closed-form solution, which we verify by induction on л. 


Base case, T1): 7(1) = 2" 1! = 2*1) = 1, true 
Assume 7(k): Tot 
Show T(k + 1): T(k + 1) = 2k + 1)! 


T(k + 1) = 2(k + 1)T(A) (by the recurrence relation) 
= 2k + 028 (by the inductive hypothesis) 
= 2k + 1)! 
Therefore our closed-form solution guess was correct. e 


Linear Second-Order Recurrence Relations 


In a first-order recurrence relation, the nth term depends only on the previous 
term. In a second-order recurrence relation, the nth term depends on the two 
previous terms. Linear second-order homogeneous recurrence relations with con- 
stant coefficients therefore have the form 


S(n) = e4S(n — 1) + e3S(n — 2) (9) 
The Fibonacci sequence is an example (Exercise 37 asks for a solution): 


ЕП) = 1 
ЕО) = 1 
F(n) = F(n — 1) + F(n — 2) forn > 2 
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In such a sequence, we must have two “base case” values, that is, two known 
values of the sequence in order to generate subsequent values. 

We'd like to find a general solution formula for recurrence relations like (9). 
If we drop the second term, we of course have a linear first-order homogeneous 
recurrence relation with constant coefficients: 


S(n) = с|5(п — 1) 
From Equation (8), we know that the solution to this recurrence relation has the form 
S(n) = ct 1s(1) 
Let's express this solution as 
S(n) = pr^! (10) 
where r (that is, cj) is the solution (root) of the linear equation 
t—c,=0 (11) 
and р (that is, S(1)) satisfies equation (10) for the initial condition л = 1: 
Ше а ' =p =p 
This viewpoint suggests a way in which we might conjecture a solution to 
equation (9). Since we now have two terms in the equation itself, let’s add a second 
term to (10) and represent a potential solution as 
S(n) = pri + qr! (12) 
where r; and r, are two distinct roots of (extending (11) to a quadratic equation) 
Ё - сі со = 0 (13) 
The p and q will have to be chosen to satisfy the two initial conditions: 


S( = pr +45 =p+q 
S(2) = pri! + qr} ' = pr, + qr» 


or, simplifying, 


р = 5(1) 


14 
pr, + qr, = 50) (14) 


Of course, this is just a wild leap of speculation on our part, so we must now verify 
that Equation (12) is a closed-form solution to recurrence relation (9). 
We are trying to prove that 
(п) = pri ' + qr ' 


(where 7, r2, p, and q are as described) is a solution to 


S(n) = с|5(п — 1) + с,(п — 2) 
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for all л = 1. The “for all n = 1” phrase suggests a proof by mathematical induc- 
tion. Because S(n) has to “reach back” two values to compute the current value, 
we Should use the second principle of induction. 


Base Cases: 


When л = 1, the proposed solution gives 
SQ) = pri tan '-ptq 
When n = 2, the proposed solution gives 
SQ) = pri ! + 4% ' = pr, + qr 


Both are trivially true because we chose p and q to meet these very conditions. 

Assume that for all r, 1 rk, S(r) = pri | qr; |. Show that 
S(k + 1) = pri + qr. Before we proceed, note that because ғ and r, are solutions 
of the equation £ — cıt — c; = 0, it is true that 


rn-—cn-cc20 or 1 = сү + с, аз) 
ч! ог № = с + с, 
Now 
S(k + 1) = с|5(К) + eS(k — 1) (by the recurrence relation) 


с\(рг | + qr 1) + eprE? + аг?) (by the inductive 
hypothesis, applied twice) 


—— 2 k-2 
= pri (c? + сүү) + gry (e; + сү) 
k-2,2 


= pri 272 + дрі 2р2 (by Equation (15)) 
= pri + qr 
which is the desired result. This confirms that Equation (12) is a solution to 
Equation (8). 
The key to the solution is the quadratic equation 
t 2 [6 1Ї um C5 = 0 


which is called the characteristic equation of the recurrence relation 


S(n) = c,S(n — 1) + с,5(п — 2) 


ИШИ Solve the recurrence relation 


S(n) = 2S(n — 1) + 3S(n — 2) forn = 3 
subject to the initial conditions 


SQ) = 3 
502) = 1 
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In this recurrence relation, c, = 2 and c; = 3. To find the closed-form solution, we 
form the characteristic equation 


=й == 
which has roots r; = 3, r, = — 1. Equation (12) gives the solution form: 
SO cues ce cibi s 
where from Equation (14) p and q satisfy 


pt+q=3 
PO) al) = | 


Solving this system of equations results in p = 1, = 2. Therefore the closed-form 
solution is 


(п) = 37! + 2-1)! e 


Practice 13 


a. Using the base cases and the recurrence relation, compute the first five terms of the sequence 
S(n) of Example 21. 
b. Check that the closed-form solution formula in Example 21 produces the correct first five terms. ш 


Practice 14 | Solve the recurrence relation 


T(n) = 6T(n — 1) —5T(n — 2) юги = 3 
subject to the initial conditions 


Та) = 5 
ТО) = 13 a 


Although it would seem at this point that we have the solution method in 
hand for any linear second-order homogeneous recurrence relations with constant 
coefficients, such is not the case. Consider the system of Equation (14): 


pt*tq-S) 
pry + qr, = SQ) 


We can solve the first equation for p— 


p-S0)-q 
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—and then substitute in the second equation to solve for q: 


[S(1) — q]ri + qr; = SQ) 
409 = т) = S) = Sry 
SQ) — Sn 
q= nn | 


Now what happens if the characteristic equation 2 — cıt — c; = 0 happens 
to have one repeated root, that is, rı = r2? Oops—we can’t solve this system of 
equations. The solution form when the characteristic equation has a repeated root 
r looks like 


S(n) = р"! + q(n — Ir"! forall n = 1 
where р and q satisfy the equations 


p =S) 
рг + qr = S(2) 


This сап be proved by induction in a manner similar to the distinct roots сазе 
(see Exercise 44). 


| EXAMPLE2? | Solve the recurrence relation 


S(n) = 8S(n — 1) — 16S(n — 2) forn = 3 


subject to the initial conditions 


SQ) = 1 
S(2) = 12 
In this recurrence relation, c, = 8 and c; = — 16. To find the closed-form solution, 
we form the characteristic equation 
2 – 8t+ 16=0 
(t—4y = 0 


which has a repeated root r = 4. The solution is 
S(n) = p4^-'! + Шш — 1/4"! 
where 


р=1 
p(4) + q(4) = 12 


Solving this system of equations, p = 1 and = 2, so the solution is 


S(n) = 4'7! + 2(n — 14^! = Qn — 147! ө 
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Table 3.4 summarizes the solution steps for a linear second-order homoge- 
neous recurrence relation with constant coefficients: 


TABLE 3.4 


To Solve Recurrence Relations of the Form S(n) = c,S(n — 1) + c,S(n — 2) 
Subject to Initial Conditions S(1) and S(2) 


1. Solve the characteristic equation t? — c't — c? = 0 


2. If the characteristic equation has distinct roots г; and г, the solution is 
Sn) = pri ' + qrg! 
where 
P +q = S(1) 
pry + qr = SQ) 
3. If the characteristic equation has a repeated root r, the solution is 


S(n) = pr^! + q(n — 1)r^ ^! 
where 


р = S(1) 
pr + qr = S(2) 


The proofs for case 2 and case 3 are unchanged if the roots of the characteristic 
equation turn out to be complex numbers. In other words, the solution formulas still 
work. 


Divide-and-Conquer Recurrence Relations 


Still another recurrence relation form occurs when the value of 5(и) depends not 
on the previous term or on the two previous terms, but on the value halfway back 


in the sequence, 5 (3) 


| EXAMPLE 23 | Consider the sequence with the following values: 


S(1) = 2, S(2) = 4, S(4) = 8, S(8) = 16, 5(16) = 32, ... 


We are looking at only selected terms of the sequence, namely S(7) where n is a 


power of 2. For these terms, we can see that S(n) — 2S 3 e 


Such recurrence relations will occur in the analysis of certain "divide- 
and-conquer” algorithms, algorithms that solve a problem by breaking it into 
smaller versions, each half the size of the original (see the next section). Hence 
such recurrence relations are called divide-and-conquer recurrence relations. 
The general form is 


S(n) — Ө + g(n) forn = 2, n = 2” (16) 
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where с is a constant and g can be an expression involving л. It’s convenient to 
look only at values of n that are powers of 2 because then cutting n in half over 
and over always results in an integer. As we will see in the next section, this is not 
a significant restriction. 

To solve a divide-and-conquer recurrence relation, we go back to the expand, 
guess, and verify approach. Also, the solution will involve the logarithm function; 
for a review of the logarithm function and its properties, see Appendix C. 


| EXAMPLE 24 | Solve the recurrence relation 


C(n) = 1 + 45) forn 2 2, п = 2" 
subject to the basis step 
C(1) = 1 


Expanding, we get 


Il 
= 
| 


diet) 
C) 


and the general term seems to be 


бй = + 45) 


The process stops when n/2^ = 1 or 2* = n, which means k = log, n. (We'll omit 
the base 2 notation from now on—log n will mean log; n. See Appendix C for a 
brief discussion of logarithms.) Then 


C(n) = log n + С( = 1 + logn 


Now we will use induction to verify that C(n) = 1 + log n for all n = 1, 
n = 2". This is a somewhat different form of induction, because the only values of 
interest are powers of 2. We still take 1 as the basis step for the induction, but then 
we prove that if our statement is true for a value Ё, it is true for 2k. The statement 
will then be true for 1, 2, 4, 8, ... , that is, for all nonnegative integer powers of 2, 
which is just what we want. 

For the base case, 


С(1) =1+log1 = 1 + 0 = 1, true 
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Assume that C(k) = 1 + log k. Then 


Ck) = 1+ C(K) (by the recurrence relation) 
=1+1+logk (by the inductive hypothesis) 
= 1 + 1052 + log k (log 2 = 1) 
= 1 + log 2k (property of logarithms) 
This calculation completes the inductive proof. ө 


We'd like to find a closed-form solution for (16) subject to the basis that S(1) 
is known. We could use the expand, guess, and verify approach to find the general 
solution, but instead we will do a transformation on (16) to convert it to a first- 
order recurrence relation with constant coefficients, use the solution formula we 
already have for such a recurrence relation, and then reverse the transformation. 
Figure 3.2 shows this round-about approach. 


B. First-order 
recurrence relation, 


constant coefficients Solution to B 


A. Divide & conquer Solution to A 


Figure 3.2 recurrence relation 


Equation (16) assumes that n = 2" with л > 2. From this it follows that 
m = log n and m > 1. Substituting 2" for n in equation (16) results in 


SQ") = aga" T) + 22") (17) 
Now, letting 7(m) represent S(2") in Equation (17), we get 
T(m) = cT(m — 1) + g(2”) form > 1 (18) 


Equation (18) is a linear, first-order equation with constant coefficients; from 
Equation (8), we obtain the solution 


m 


Тт) = c"! T) + У ст 8025) (19) 
i-2 


subject to the basis condition that 7(I) is known. Because Equation (18) holds for 
m = 1, we know that 


T(1) = cT(0) + g(2) 
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Making this substitution in (19) results in 

T(m) = c"T(0) + Yee) (20) 
Now reversing the substitution 7(m) = S(2”), (20) becomes 

S(2") = c™S(2°) + Sere) 


Finally, letting 2” = n or m = log n, we get 


logn 
S(n) = Sd) + У сй -:0(27) (21) 

i=l 
REMINDER Equation (21) thus represents the solution for the recurrence relation (16). As 
In the summation part before, to use this general solution you need only match your recurrence relation 
of the general solution to (16) to determine c and g(n), then substitute into Equation (21). Again as be- 


formula, c is raised to the 
(log n) — i power, not 
(log n) — 1 


fore, о(и), gives a recipe for what to do with an argument n; in Equation (21), the 
argument is 2'. If you can evaluate the resulting summation, you will then have a 
closed-form solution. Table 3.5 outlines the solution steps. 


TABLE 3.5 


To Solve Recurrence Relations of the Form S(n) = es(7) + g(n) for n = 2, 
n = 2" Subject to Initial Condition S(1) 


1. Match your recurrence relation to the form 


S(n) = es?) + gín) 


to find c and g(n). 


2. Use c, g(n) and S(1) in the formula 


logn | 
S(n) = c"S(1) + У, с(9)-'9(2/) 
i=] 


3. Evaluate the resulting summation to get the final expression. 


| EXAMPLEZS | The recurrence relation 


С) = 1 


C(n) 214 45) гп 2 2, п = 2" 


matches Equation (16), with c = 1 and g(n) = 1. Because g(n) = 1, the g function 
evaluates to 1 no matter what the argument is. The solution, according to formula 
(21), is 


log n 
posset) dE 5 pose cu 
i=1 


1 + (log n(1) = 1 + logn 


C(n) 


which agrees with our previous result from Example 24. ө 
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| EXANPLE26 | Solve the recurrence relation 


т) = 3 
Т(п) = Ө + 2n 


This is a match for Equation (16), where c = 2 and g(n) = 2n. Therefore 
g(2') = 2(2'). Substituting into Equation (21)—the solution of Equation (16)— 
gives the following result, where we use the fact that 2^?" = n. 
log n 
T(n) = ОЕ 1) E D жешсп бу) 


i=l 


log n 


= 2198 (3) Jt pu 
i=1 


= n(3) + (2%"* log п 
= 3n + (2*5"- 2)logn 
= Зп + 2nlogn ө 


Practice 15 | Show that the solution to the recurrence relation 


sa) =1 


S(n) = 2s( +) + forn =2,n=2" 


is 2n — 1. (Hint: See Example 15 in Section 2.2 and note that 29" = n.) и 


SECTION 3.2 REVIEW 


TECHNIQUES * Solve divide-and-conquer recurrence relations by 


| using a solution formula. 
W) Solve recurrence relations by the expand, guess, 8 


and verify technique. 
: А : МАІМ ІрЕА 
W) Solve linear, first-order recurrence relations with 
constant coefficients by using a solution formula. * Certain recurrence relations have closed-form 
* Solve linear, second-order homogeneous recurrence solutions. 


relations with constant coefficients by using the 
characteristic equation. 


EXERCISES 3.2 


In Exercises 1—12, solve the recurrence relation subject to the basis step. 


1. SQ) = 5 
S(n) = S(n — 1) + 5 forn = 2 
2. BA) = 5 


Bin) = 3B(n — 1) юги = 2 


| 198 Recursion, Recurrence Relations, and Analysis of Algorithms 


3. Е@)=2 
F(n) = 2F(n — 1) + 2" forn = 2 
4. T1) =1 


T(n) = 2T(n — 1)+ 1 юги = 2 

(Hint: See Example 15 in Section 2.2.) 
5. A(1)=1 

A(n) = A(n — 1) +n forn = 2 

(Hint: See Practice 7 in Section 2.2.) 
6. 5(1) = 1 

S(n) = S(n — 1) + 2n — 1) forn = 2 

(Hint: See Example 14 in Section 2.2.) 
7. T(1) = 1 

T(n) = T(n — 1) + п? forn = 2 

(Hint: See Exercise 7 in Section 2.2.) 
8. Рр(1) = 2 

P(n) = 2P(n — 1) + n2" forn 2 2 

(Hint: See Practice 7 in Section 2.2.) 


9. FA) = 1 

F(n) = nF(n — 1) forn = 2 
10. 5(1) = 1 

S(n) = nS(n — 1) + n! forn = 2 
I1. 4(1) = 1 


A(n) = 2(n — 1)A(n — 1) forn = 2 
(Hint: 0! is defined to equal 1.) 
12. P(1) = 2 
P(n) = 3(n + 1)P(n — 1) юги = 2 
13. At the beginning of this chapter the contractor claimed: 


The material to be stored at the chemical disposal site degrades to inert matter at the rate of 5% per 
year. Therefore only about one-third of the original active material will remain at the end of 20 years. 


a. Write a recurrence relation Т(л) for the amount of active material at the beginning of year n. Assume 
that 7(1) — X, a specific but unknown amount. 


b. Solve the recurrence relation. 


c. Compute 7(21) to check the contractor's claim; note that the end of 20 years is the beginning of the 
215% year. 


14. A colony of bats is counted every 2 months. The first four counts are 1200, 1800, 2700, and 4050. 
a. Assuming that this growth rate continues, write a recurrence relation for the number of bats at count л. 
b. Solve the recurrence relation. 
c. What will the 12th count be? 


15. Spam e-mail containing a virus is sent to 1,000 e-mail addresses. After 1 second, a recipient machine 
broadcasts 10 new spam e-mails containing the virus, after which the virus disables itself on that 
machine. 


16. 


17. 


18. 


19. 


20. 


2 


— 


22. 
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a. Write a recurrence relation for the number of e-mails sent at the start of the nth second. 
b. Solve the recurrence relation. 
c. How many e-mails are sent at the end of 20 seconds (that is, at the beginning of the 21st second)? 


Total natural gas consumption in the state of New Jersey was 614,908 million cubic feet in 2008 and 
653,459 million cubic feet in 2010. 


a. Assuming a constant annual percentage growth rate r, write a recurrence relation (in terms of r) for the 
total natural gas consumption in New Jersey in year n. 


b. Solve the recurrence relation (in terms of r). 

c. Using the given data, compute the value of r. 

d. What will be the total natural gas consumption in New Jersey in the year 2020? 

A loan of $5,000 is charged a 1296 annual interest rate. An $80 payment is made each month. 
a. Write a recurrence relation for the loan balance remaining at the beginning of month л. 


b. Solve the recurrence relation. (See Exercise 27 of Section 2.2 for the formula for the sum of a geometric 
sequence.) 


c. How much is left of the loan balance at the beginning of the 19th month? 


In an account that pays 396 annually, $1,000 is deposited. At the end of each year, an additional $100 is 
deposited into the account. 


a. Write a recurrence relation for the amount in the account at the beginning of year л. 


b. Solve the recurrence relation. (See Exercise 27 of Section 2.2 for the formula for the sum of a geometric 
sequence.) 


c. What is the account worth at the beginning of the 8th year? 


The shellfish population in a bay is estimated to have a count of about 1,000,000. Studies show that 
pollution reduces this population by about 2% per year, while other hazards are judged to reduce the 
population by about 10,000 per year. 


a. Write a recurrence relation for the shellfish population at the beginning of year n. 


b. Solve the recurrence relation. (See Exercise 27 of Section 2.2 for the formula for the sum of a geometric 
sequence.) 


c. What is the approximate shellfish population at the beginning of year 10? 


A certain protected species normally doubles its population each month. The initial population is 20, 
but by the beginning of the next month, 1 specimen has died of an infection. In successive months, the 
infection kills 2, then 4, then 8, and so forth. 


a. Write a recurrence relation for the size of the population at the beginning of month и. 
b. Solve this recurrence relation. 
c. What is the size of the population at the beginning of month 7? 


. A computer virus that spreads by way of e-mail messages is planted in 3 machines the first day. Each 


day, each infected machine from the day before infects 5 new machines. By the end of the second day, a 
software solution has been found to counteract the virus, and 1 machine is clean at that point. Each day 
thereafter, 6 times as many machines are clean as were clean the day before. 


a. Write a recurrence relation for the total number of infected machines on day n. 
b. Solve this recurrence relation. 

c. How many days will it be before the effects of the virus are completely gone? 
This problem concerns the Towers of Hanoi puzzle (see Exercise 82 in Section 3.1). 


a. Based on the recursive algorithm of Exercise 82 in Section 3.1, find a recurrence relation M(n) for the 
number of disk moves required to solve the Towers of Hanoi puzzle for n disks. 
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b. Solve this recurrence relation. (Hint: See Exercise 15 in Section 2.2.) 


с. Go through the steps of the solution algorithm for п = 3 and record the number of disk moves required. 
Compare this number with the result from part (b) with n = 3. 


d. The mythical origin of the Towers of Hanoi puzzle concerns 64 golden disks that a group of monks are 
moving from one tower to another. When their task is complete, the world will end. Assuming that the 
monks can move 1 disk per second, calculate the number of years to complete the task. 


23. Early members of the Pythagorean Society defined figurate numbers to be the number of dots in certain 
geometrical configurations. The first few triangular numbers are 1, 3, 6, and 10: 


AAA 


Find and solve a recurrence relation for the nth triangular number. (Hint: See Practice 7 in Section 2.2.) 


24. The first few square numbers (see the previous Exercise) are 1, 4, 9, and 16: 


NES 


Find and solve a recurrence relation for the nth square number. (Hint: See Example 14 in Section 2.2.) 
25. The first few pentagonal numbers (see Exercise 23) are 1, 5, 12, and 22: 


o € 


Find and solve a recurrence relation for the nth pentagonal number. (Hint: See Exercise 28 of Section 2.2 
for the formula for the sum of an arithmetic sequence.) 


26. Use induction to verify that equation (8) of this section is the solution to the recurrence relation (6) subject 
to the basis condition that S(1) 1s known. 
In Exercises 27—34, solve the recurrence relation subject to the initial conditions. 
27.T(1) = 5 
TQ) = 11 
T(n) = 5T(n — 1) — 67(n — 2) forn = 3 
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28 А(1)=7 

A(2) = 18 

A(n) = 6A(n — 1) — 8A(n — 2) forn = 3 
29. S(1) =4 

S(2) = -2 

S(n) = —S(n — 1) + 2S(n — 2) for n = 3 
30. P(1) = 5 

P2) = 17 

P(n) = 7P(n — 1) – 12P(n — 2) for n = 3 
31. F(1) = 8 

F(2) = 16 

F(n) = 6F(n — 1) — 5F(n — 2) forn = 3 
32. T(1) = -1 

ТО) =7 

T(n) = —AT(n — 1) – 3T(n — 2) forn = 3 
33. (1) = 3 

B(2) = 14 

B(n) = 4B(n — 1) — 4B(n — 2) forn = 3 
34. F(1) = —10 

F(2) = 40 


F(n) = —10F(n — 1) — 25F(n — 2) fr n = 3 


In Exercises 35 and 36, solve the recurrence relation subject to the initial conditions; the solutions involve 
complex numbers. 


35. A(1) = 8 
AQ) =8 
A(n) = 2A(n — 1) —2A(n — 2) forn = 3 
36. S(1) = 4 
S(2) = -8 


S(n) = —4S(n — 1) — 5S(n — 2) for n = 3 


37. Solve the Fibonacci recurrence relation 


Е(1) = 1 
Е(2) = 1 
F(n) = Еп — 1) + F(n — 2) forn > 2 


Compare your answer with Exercise 31 of Section 3.1. 
38. Find a closed-form solution for the Lucas sequence 


L(1) = 1 
L(2) = 3 
L(n) = L(n — 1) + L(n — 2) forn = 3 
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39. Houses in a new development go on sale initially for an average price of $200,000. At the beginning of 
month 2, the average sale price has risen to $250,000. At the beginning of each succeeding month, the 
average price increase is half what it was the previous month. 

a. Write and solve a recurrence relation for M(n), the average sale price at the beginning of month n. 
b. At the beginning of which month is the average price within $2,000 of $300,000? 

40. A contaminated soil site is tested monthly for the presence of a particular microorganism. Initially, 
950 microorganisms per cubic foot of soil are found; at the beginning of month 2, there are 1,000 organ- 
isms per cubic foot. Left untreated, the growth rate of this microorganism increases by 25% per month. 


a. Write and solve a recurrence relation for O(n), the number of organisms present per cubic foot at the 
beginning of month n. 


b. At the end of what month does the number of organisms first exceed 5,000 per cubic foot? 
4 


— 


. Prove that the number of binary strings of length n with no two consecutive Os is given by the Fibonacci 
sequence term (n + 2). (Hint: Write a recurrence relation; consider strings of length n that end in 1 and 
those that end in 0.) 


42. a. Find a recurrence relation for the number of binary strings of length n that have two consecutive 15. 
b. How many binary strings of length 4 have two consecutive 15? What are these strings? 


43. Consider the recurrence relation S(n) = c,S(n — 1) as a linear second-order homogeneous recurrence 
relation with constant coefficients where c; = 0. Solve this recurrence relation using its characteristic 
equation, and prove that the solution is the same as that of Equation (8). 


44. Prove that 
5(л) = pr^! афт — Yr! 
where 
p = S0) 
pr + qr = S(2) 


is a solution to the recurrence relation S(n) = c,S(n — 1) + c,S(n — 2) for all n = 1 ifr is a repeated root 
of the characteristic equation. 


In Exercises 45—48, solve the recurrence relation subject to the basis step. (Hint: See Example 15 in Section 2.2, 
and note that 2^?" = и.) 


45. P(1) = 1 


P(n) = (s) +3 гл 22, п = 2" 


46. T(1) = 3 
T(n) = д») + п forn 2 2, п = 2" 
47. 51) = 1 


S(n) = (2) + п forn 2 2, п = 2" 
48. Р(1) = 1 


Р(п) = (s) + п? їогп 2 2, п = 2" 
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SECTION 3.3 | ANALYSIS OF ALGORITHMS 


The General Idea 


Often more than one algorithm can perform the same task. Because we assume that 
all these algorithms perform correctly, we need some other basis of comparison to 
decide which algorithm to use in a given situation. Several criteria could be used 
to judge which is the “best” algorithm. We might ask, for example, which is easi- 
est to understand, which makes the most efficient use of machine memory when 
implemented as computer code, or which runs most efficiently. One might expect 
that to judge whether an algorithm is “running efficiently” means standing by with 
a stopwatch while the computer code executes. But the stopwatch approach may tell 
us more about the speed of the processor than it does the inherent efficiency of the 
algorithm. Even timing the code for competing algorithms on the same processor 
and using the same input data can give a misleading picture of what might happen 
when the input data set is larger. 

Instead, we evaluate the time efficiency of an algorithm by estimating the 
number of operations it must perform. We count only the operations that are 
basic to the task at hand, not “housekeeping” operations that make just a small 
contribution to the total work required. 

The study of the efficiency of algorithms, that is, the number of operations 
they perform, is called analysis of algorithms. Various techniques for algorithm 
analysis have been developed. Sometimes a rather straightforward analysis can be 
done simply by inspecting the algorithm. 


Following is an algorithm to write out, for each student on a grade roster, the sum 
of m quiz grades minus the lowest grade. The outer loop goes through each of the 
n students; the inner loop goes through the quizzes for the current student. For each 
student, the successive quiz grades are added and the lowest grade is ultimately 
subtracted from the sum of all the grades. These additions and subtractions seem 
fundamental to how the algorithm works, so we will count the work contributed by 
these arithmetic operations. 


for i = 1 to n do 
low = roster[i].quiz[1] 
sum = roster[i].quiz[1] 


for j — 2 to m do 
sum = sum + roster [i].quiz| j] IIA 
if roster[i].quiz[ j] < low then 
low = roster[i].quiz| j] 


end if 
end for 
sum = sum — low /IS 
write( "Total for student", i, “is”, sum) 
end for 


Subtraction occurs at the line marked //S, which is executed once for each 
pass through the outer loop (once for each student), a total of n times. Addition, 
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however, occurs at the line marked //A. This is done within the inner loop, which 
executes m — 1 times for each student, that is, for each of the n passes of the outer 
loop. The total number of additions is therefore n(m — 1). The total number of 
arithmetic operations is n + n(m — 1) = nm. Of course the value of this expression 
depends on л (the number of students) and m (the number of quizzes). The quanti- 
ties n and m measure the amount of input data; virtually any algorithm's work will 
change with the input size. 

The algorithm also does some “housekeeping” work. Values are assigned 
to variables, comparisons are done (to find the lowest quiz grade for each stu- 
dent), and for the loop indices 7 and j have to be incremented. But the number of 
times these operations are done also depends on the number of times through the 
loops, so their effect might be to multiply the лт result by some constant factor. In 
comparing algorithms А and В, we're usually looking for bigger differences than 
just a constant multiple, which is why we ignore the housekeeping details. e 


Now suppose the task is to search a sorted list of n words or numbers for a 
particular target value x. We already have one algorithm to perform this task, the 
binary search algorithm from Section 3.1. Another algorithm for the same task, the 
sequential search algorithm, simply compares x with each entry in the list in turn 
until either x is found or the list is exhausted. (This algorithm actually works on any 
list, sorted or not.) A pseudocode description of the sequential search algorithm is 
given in the following box. 


ALGORITHM 


SEQUENTIALSEARCH 


SequentialSearch(list L; integer n; itemtype x) 
//searches a list L of n items for item x 
Local variable: 
integer і //marks position in the list 
i=1 
while L[i] Æ x and i < n do 
і= 1+ 1 
end while 
if L[i] = x then 
write(“Found’’) 
else 
write(“Not found") 
end if 
end function SequentialSearch 


Both binary search and sequential search work by comparing elements from 
the list with the target value x until a match is found. In fact it is difficult to imag- 
ine how any possible search algorithm could avoid such comparisons, so these 
comparisons will be the basic operation to count in analyzing these two algorithms. 


EXAMPLE 28 
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The sequential search algorithm does the maximum amount of work (number 
of comparisons) when x is the last item in the list or when x does not appear in 
the list at all. In either case, all elements are compared to x, so n comparisons are 
done. This is the “worst case” for this algorithm, and the work depends on the size 
n of the input (the length of the list). The minimum amount of work is done when 
x is the very first item in the list; only one comparison is made. This is the “best 
case” for this algorithm. (In the algorithm of Example 27, the same number of 
arithmetic operations is always done; there is no best or worst case.) 

There are many possibilities between the best case and the worst case. If x falls 
in the exact middle of the list, the search would require roughly 1/2 comparisons. 
It would be helpful to obtain some measure ofthe "average" amount of work done. 
This measure would require some way to describe the average list being searched 
and the average relationship of a target item x to that list. Exercises 35 and 36 
in this section explore some aspects of average case analysis for the sequential 
search algorithm. For most algorithms, however, average behavior is very difficult 
to determine. To compare the efficiency of algorithms, therefore, we often content 
ourselves with the worst-case count of the number of operations required. 


Given a long string of text characters, can we find the first instance of a particular 
substring or “pattern” within the text? This problem has a number of important 
applications, such as 


* looking for specific strings in an HTML document that might be governed 
by a style rule of a Cascading Style Sheet. 


* using the UNIX grep command to search a file for a specified string, or the 
"Find" command in any text editor or word processor. 


* looking for specific gene sequences within a strand of DNA. 


DNA is a long molecule that is basically a chemically bonded chain of smaller 
molecules called nucleotides. There are four nucleotides, abbreviated as А, C, С, 
and 7. Thus a section of DNA might be represented as the sequence 


...JTAATCATGGICATAGCTGTTTICCTGTGTGAAATITG... 


DNA is stored within the cells of living organisms on chromosomes; various 
sections of these chromosomes are identified as genes. Genes, through their 
DNA "instructions," create proteins that control specific functions or traits with- 
in the organism (hair color, blood type, and so on). Thus our entire genetic code 
requires only four symbols! *Mapping the human genome", that is, determining 
the entire DNA sequence of humans, was a huge scientific undertaking, essen- 
tially completed in 2003, although the work of identifying specific genes and 
their particular function is ongoing. It is known, for example, that the disease 
cystic fibrosis is caused by a mutation in a particular gene (DNA sequence) that 
is composed of about 230,000 nucleotides. 

The most intuitive (although not the most efficient) string pattern-matching 
algorithm compares the pattern (a string of length m) against the text (a string of 
length п with n > m), starting with the first character of the text and marching 
through the pattern. 
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Р! Р, Ру. ce 


If all m characters match, the pattern has been found. If there is a mismatch at some 
point between the text and the pattern, the pattern slides over one character in the 
text, and the matching process begins again. 


papi cp Тууу А 


P, P, P3 ++ ЈР) 


т 


The last segment of text where the pattern could possibly occur is the last т 
elements of the text. This segment begins at T ,-m+1, as shown below. 


T, T, Т; 555 ИЕА Тао соо d m Е И 


IP. P, ++ Р» 


For example, if the text is 23 characters long and the pattern is 5 characters long, 
then the last piece of text that can possibly hold the pattern is 71975975,T55T55. 

The unit of work in this algorithm is comparisons between a text character and 
a pattern character. The best case occurs when the pattern is found as the first m 
characters of the text, which requires m comparisons. The worst case is when the 
pattern does not occur in the text at all, and the pattern “window” slides all the way 
over to T, „+. From each of those n — m + 1 starting points, the pattern fails 
to be found, but the worst case is if the pattern almost matches and only the last 
character fails. For example, consider the text and pattern shown: 


Шех SIILI ЛИЛИ 
Pattern: ТТТТТ5 


This example requires m comparisons (the first m — 1 are matches and only the 
mth comparison fails) at each of the n — m + 1 starting positions, making the total 
number of comparisons m(n — m + 1). e 


Analysis Using Recurrence Relations 


In this section we will analyze algorithms that are defined recursively. Because 
much of the activity of a recursive algorithm takes place “ош of sight" in the 
many invocations that can occur, an analysis using the direct counting technique 
of Example 27 won't work. Analysis of recursive algorithms usually involves 
solving a recurrence relation. 
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We can recast the sequential search algorithm from an iterative version 
(repeating some action over and over in a loop) to a recursive version. The base 
case says that you check whether you have run off the end of the list and if not, 
you check for the target x at the first position in the list. If you find it, fine, if not, 
then you invoke the algorithm again on the rest of the list. Here is pseudocode 
for the recursive function to search the list from L[i] to L[n]; the function is 
invoked initially with i = 1. 


ALGORITHM SEQUENTIALSEARCHRECURSIVE 


SequentialSearchRecursive(list L; integer i, n; itemtype x) 
//searches list L from L[i] to L[n] for item x 
if > n then 
write(“not found") 
else 
if L[;] = x then 
write(“found”) 
else 
SequentialSearchRecursive(L, i + 1, n, x) 
end if 
end if 
end function SequentialSearchRecursive 


Figure 3.3 gives a visual representation of the recursive sequential search algorithm. 
Each time the algorithm is invoked, the new list to be searched is only 1 element shorter 
than the previous list, so in the worst case the algorithm has to work quite hard. 


Search if fail, 
here; search here 
i | | 
Search if fail, 
here; search here 
4 —— ^ Н 
Search if fail, 
here; search here 
B 3 


Search if fail, 
here; search here 
4 rn 


CJ 


Search if fail, 
here; search here 
tn 


= 


Search if fail, 
here; notin list 


4 
Figure 3.3 = 
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Let C(n) represent the maximum number of comparisons required for an 
n-element list. This is a symbolic expression for an answer we are assuming we 
don’t already know, but will learn by solving the recurrence relation. By this no- 
tation, C(n — 1) symbolically represents the maximum number of comparisons 
required to search the rest of the list after the first position. The recurrence relation is 


Cd) =1 (1 comparison to search a l-element list) 
C(n) = 1+ C(n — 1) forn=2 (1 comparison against the first element, 
then however many comparisons 
are required for the rest of the list) 


This is a first-order, linear recurrence relation with constant coefficients. By 
Equation (8) in Section 3.2, the solution is 


C(n) (1 (1) + yaa) =1+(@-l=n 


This agrees with our previous analysis for the worst case. e 


Now let's do a worst-case analysis of the binary search algorithm. Recall that binary 
search is a recursive algorithm that operates on a list that is sorted in increasing 
order. It first does one comparison of the target with the midpoint value of the list. 
Ifthis comparison fails, then the process is repeated on the right half or the left half 
of the list, depending on whether the target value is greater than or less than the 
midpoint value. Figure 3.4 illustrates one possible worst-case path. 


Search if fail, test 


here; for smaller or larger 
гт 
Search if fail, test 
here; for smaller or larger 
гт 
Search if fail, test 
here; for smaller or larger 
A 
Search if fail, 
here; not in list 
4 
Figure 3.4 I 


Binary search is a divide-and-conquer algorithm, where the problem is 
decomposed recursively into significantly smaller subproblems. If the original list 
is n elements long, then half the list is at worst n/2 elements long. (In the 8-element 
list of Example 14, for instance, when 10 is the midpoint value, the right “half” of 
the list has 4 elements but the left “half” has only 3.) Cutting the list in half makes 
much faster progress than reducing the list by one element, as in sequential search, 
so we expect the worst case of binary search to require less work. 


Practice 16 
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Let C(n) stand for the maximum number of comparisons required to do a 
> n n 
binary search on an n-element list. Then С 5 stands for the maximum number 


of comparisons required to search a list half that size. If we are to keep cutting the 
list in half, it is convenient to consider only the case where we get an integer value 
each time we cut in half, so we will assume that n = 2" for some m > 0.The recur- 
rence relation for C(7) is 


C(1) = I (1 comparison to search a 
n 1-element list) 

Cin) =14C (2) Rory = Эш = 2 (1 comparison against the middle 
element, then however many 
comparisons are required for half 
the list) 


This recurrence relation was solved in the previous section (Examples 24 and 25). 
The solution is 


С(п) = 1 + logn ө 


By the preceding example, the maximum number of comparisons required 
to do a binary search on an n-element ordered list, with n = 2”, is 1 + log n. 
In Example 14, п was 8, and four comparisons (1 + log 8) were required in the 
worst case (x not in the list). A sequential search would require eight comparisons. 
Because 


1+ logn < n forn =2",n24 


binary search is almost always more efficient than sequential search. However, 
the sequential search algorithm does have one big advantage—if the list being 
searched is unsorted, the sequential search algorithm works, but the binary search 
algorithm does not. If we first sort the list and then use the binary search algo- 
rithm, we must then consider the work involved in sorting the list. Exercises 13—34 
at the end of this section ask you to count the operations required to sort a list by 
several different algorithms. 


Fill in the following table for the worst-case number of comparisons required for 
sequential search and binary search on a list of the indicated size. 


n Sequential Search Binary Search 


Although we have computed the work for binary search only on a list of size n 
where n is a power of 2, this gives us a range for the work required for values of n 
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that fall between powers of 2. That’s why limiting n to powers of 2 in our analysis 
is not particularly significant. 


Upper Bound (Euclidean Algorithm) 


The Euclidean algorithm, as presented in Section 2.3, uses a while loop to do 
successive divisions to find gcd(a, b) for positive integers a and b, a> b. To 
analyze the Euclidean algorithm, we must first decide on the operation we are 
counting. Because the Euclidean algorithm does repeated divisions, we'll take the 
division operation as our unit of work. Because a > b, we can take a as a measure 
of the size of the input values (which we usually denote by л). We want to find £E(a), 
where this denotes the amount of work (the number of divisions) required to find 
gcd(a, Б) in the worst case. 

A recursive version of the Euclidean algorithm can also be written (see 
Exercise 81, Section 3.1); the key to the recursive version is to recognize that 
gcd(a, b) involves finding gcd(b, r), where г is the remainder on dividing a by 
b. We've just seen a case where the operations of a recursive algorithm (binary 
search) could be expressed neatly as a recurrence relation where the input size gets 
halved after each operation. A recurrence relation would express E(a) in terms of 
Е at smaller values. But what are these smaller values? To find gcd(a, b) we find 
gcd(b, r), so it is clear that the input size is getting smaller, but in what way? Con- 
sider Example 27 of Section 2.3, where to find gcd(420, 66) the following divisions 
were performed: 


6 2 1 3 
66)420 24)66 18)24 6)18 
396 48 18 18 

24 18 6 0 


Here the successive values being divided are 420, 66, 24, 18. The change from 
420 to 66 is much larger than cutting in half, while the change from 24 to 18 is less. 

In fact, we won't find a recurrence relation or an exact expression for E(a). But 
we will at least find an upper bound for E(a). An upper bound is a ceiling on the 
amount of work an algorithm does; the algorithm can require no more steps than 
the upper bound, but it may not require that many. 

To find this upper bound, we will show that if i > j and i is divided by j with 
remainder r, then r < 1/2. There are two cases: 


1. Ifj x i/2, then r < i/2 because r < j. 
2. Ifj > 12, theni = 1 *j + (i — j); in other words, the quotient is 1 and r is 
i — j, which is < 1/2. 


In the Euclidean algorithm, the remainder r at any step becomes the dividend 
(the number being divided) two steps later. So the successive dividends are at least 
halved every two divisions. The value a can be halved log n times, therefore at 
most 2 log п divisions are done. Thus 


E(a) = 21oga (1) 


The value of 2 log a for a — 420 is almost 18, whereas it took only 4 divisions 
to find ged(420, 66). Evidently this upper bound estimate is rather loose, like 
saying that every student in this class is under 12 feet in height. An improved (that 
is, lower) upper bound is derived in Exercises 37—40 at the end of this section. 


SPECIAL INTEREST PAGE 
Of Trees ... and Pancakes 


Of Trees ... 


Mapping the evolutionary “tree of life" has been 
the subject of research since Charles Darwin. Until 
recently, this research sought to find similarities be- 
tween species based on structural properties such 
as skeletons, but today scientists search for similari- 
ties in DNA and other genetic evidence. This field of 
research, called phylogenetics, can involve aligning 
the molecular sequences of many thousands of species, 
and the work becomes an enormous computational 
problem. Researchers at the University of Texas have 
developed a software package called SATé (and a new 
and improved SATé-ID)— Simultaneous Alignment 
and Tree Estimation—that uses a divide-and-conquer 
algorithm. Huge data sets are divided into small data 
sets, alignments are found for the small sets, and then 
the results are combined to determine an overall align- 
ment (and a likely tree) for the full data set. The re- 
sulting full alignment isn't foolproof, and the software 
repeats this process many times, creating new align- 
ments and trees. A statistical “maximum likelihood” 
method selects the best result by comparison with 
known answers. This approach has been proven to 
produce results comparable to other, slower methods, 
or to produce more accurate results in the same amount 
of time. 

Sequence alignment and evolutionary tree-building 
tools have applications to areas other than tracing the 
path of historical evolution. For example, the Centers 
for Disease Control use them to detect how a newly 
emerging virus differs from previous viruses in order 
to plan the best counterattack. 


http://www.tacc.utexas.edu/news/feature-stories/2012/ 
tree-of-life 
http://www.ncbi.nlm.nih.gov/pubmed/22139466 


ө Chapter 


.. and Pancakes 


A problem posed in the American Mathematical 
Monthly in 1975 by Jacob Goodman concerned a 
waiter in a café where the cook produced a stack of 
pancakes of varying sizes. The waiter, on the way to 
delivering the stack to the customer, attempted to ar- 
range the pancakes in order by size, with the largest 
on the bottom. The only action available was to stick 
a spatula into the stack at some point and flip the en- 
tire stack above that point. The question is: What is the 
maximum number of flips ever needed for any stack 
of n pancakes? This number, P,, is known as the nth 
pancake number. 

Here's a fairly simple algorithm to arrange the pan- 
cakes. Put the spatula under the largest pancake, and flip. 
This puts the largest pancake on top. Put the spatula at 
the bottom of the unordered section (in this case at the 
bottom) and flip. This puts the largest pancake on the 
bottom, where it belongs. Repeat with the rest of the pan- 
cakes. Each pancake therefore requires two flips, which 
would give a total of 27 flips required. But the last two 
pancakes require at most one flip; if they are already 
in order, no flips are needed, and if they are out of or- 
der, only one flip is needed. So this algorithm requires 
at most 2(n — 2) + 1 = 2n — 3 flips in the worst case, 
which means that P, = 2n — 3. Are there other algo- 
rithms that require fewer flips in the worst case? 

A faculty member at Harvard University posed 
this question to his class; several days later, a sopho- 
more from the class came to his office with a better 
algorithm. This algorithm, which requires at most 
(5n + 5)/3 flips, was published in the journal Discrete 
Mathematics in 1979. The authors were William Gates 
(the student) and Christos Papadimitriou. 

Yes, THAT William Gates. 


п, 


From SCHNEIDER/GERSTING. /nvitation to Computer 
Science, 6/E. © 2013 South-Western, a part of Cengage 
Learning, Inc. Reproduced by permission. 
www.cengage.com/permissions 
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SECTION 3.3 REVIEW 


TECHNIQUE * Analysis of recursive algorithms often leads to re- 
currence relations. 

* Lacking an exact expression for the number of op- 
erations an algorithm performs, it may be possible 
to find an upper bound. 


* Do a worst-case analysis of an algorithm either 
directly from the algorithm description or from a 
recurrence relation. 


MAIN IDEAS 


* Analysis of an algorithm estimates the number of 
basic operations that the algorithm performs, which 
is dependent on the size of the input. 


EXERCISES 3.3 


1. Modify the algorithm of Example 27 so that in addition to dropping the student’s lowest quiz grade, the 
highest quiz grade is counted twice (like the old version, your new algorithm should do no operations 
besides addition and subtraction). 


2. What is the total number of arithmetic operations done in the algorithm of Exercise 1? 


3. The following algorithm adds all the entries in a square n X n array A. Analyze this algorithm where the 
work unit is the addition operation. 


sum — 0 
for i = 1 to n do 
forj = 1 ton do 
sum = sum  A[i, j] 
end for 
end for 
write (“Total of all array elements is", sum) 


4. The following algorithm adds all the entries in the “upper triangular" part of a square n X n array A. 
Analyze this algorithm where the work unit is the addition operation. 


sum — 0 
for = 1 ton do 
forj = kto n do 
sum = sum + A[k, j] 
end for 
end for 
write (“Total of all upper triangular array elements is”, sum) 


5. Analyze the following algorithm where the work unit is the output statement. Assume that n = 2” for 
some positive integer m. 


integer j, k 
for k = 1 ton do 
у=, 
while j = 2 do 
write j 
j^j2 
end while 
end for 
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6. Analyze the following algorithm where the work unit is the output statement. (Hint: One of the exercises 
in Section 2.2 might be helpful). 


integer i 
real d, x; 
for i = 1 ton do 
d = 1.0/i; 
х= |; 
while x > 0 do 
write x 
x=x-d; 
end while 
end for 


Exercises 7 and 8 involve л! = n(n — 1)(n – 2) ·· · 1. 


7. a. Write the body of an iterative function to compute n! for n = 1. 


a. 
b. Analyze this function where the work unit is the multiplication operation. 


8. a. Write a recursive function to compute л! for n = 1. 
b. Write a recurrence relation for the work done by this function where multiplication is the unit of work. 
c. Solve the recurrence relation of part b. 
d. Compare your answer in part c to your result in Exercise 7b. 
Exercises 9 and 10 involve evaluating a polynomial a,x" + a, x" ! + +++ + ag for a specific value of x. 


9. A straightforward algorithm to evaluate a polynomial is given by the following function: 


Poly(real a,, real a, .,, ... , real ao, real c, integer n) 

/levaluates polynomial a,x" + a, ix" +: + do forx =c 
Local variables: 

integer i 

real sum = а, 

real product = 1 


fori = 1 to n do 
product = product * с 
sum = sum + a; * product 
end for 
return sum 
end function Poly 


a. Walk through this algorithm to compute the value of 2x? — 7x? + 5x — 14 for x = 4. 


b. The algorithm involves both additions and multiplications; analyze this algorithm where those operations 
are the work units. 


10. An alternative to the polynomial evaluation algorithm in Exercise 9 is an algorithm called Horner 5 
method. Horner’s method relies on an alternative expression for a polynomial, for example 


2x3 — 72 + 5x — 14 = —14 + x(5 + х(—7 + x(2))) 
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Horner(real a,, real a, .,, ... , real ap, real c, integer п) 
//evaluates polynomial ax" + a, x" ! + -+a forx=c 
//using Horner's method 

Local variables: 

integer i 

real result = a, 


for i = | to n do 
result = result * c + a 
end for 
return result 
end function Horner 


n—i 


а. Walk through this algorithm to compute the value of 2x? — 7x? + 5x — 14 for x = 4. 
b. Analyze this algorithm where addition and multiplication operations are the work units. 


с. In evaluating a polynomial of degree n = 98 for some value of x, how many operations have been saved 
by using Horner's method over the method of Exercise 9? 


— 
— 


. For the algorithm of Example 27, count the total number of assignments and comparisons done in the best 
case (least work) and the worst case (most work); describe each of these cases. 


12. a. Write a function to convert a binary string Б„Б„_ |... bibo to its decimal equivalent. 
b. Test your function on the binary string 10011 


c. Describe the worst case for this algorithm and find the number of multiplications and additions done in 
this case. 


d. Describe the best case for this algorithm and find the number of multiplications and additions done in 
this case. 


Exercises 13 and 14 relate to a recursive sorting algorithm called BubbleSort. 


13. Algorithm BubbleSort works by making repeated passes through a list; on each pass, adjacent elements 
that are out of order are exchanged. At the end of pass 1, the maximum element has “bubbled up” to the 
end of the list and does not participate in subsequent passes. The following algorithm is called initially 
with = n. 


BubbleSort(list L; integer j) 
//recursively sorts the items from 1 to j in list L into increasing order 


ifj — 1 then 
sort is complete, write out the sorted list 
else 
fori = 1 toj —1 do 
if L[i] > L[i + 1] then 
exchange L[i] and Ці + 1] 
end if 
end for 
BubbleSort(L, j — 1) 
end if 
end function BubbleSort 


14. 
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Walk through algorithm BubbleSort to sort the list 5, 6, 3, 4, 8, 2. 


. Write a recurrence relation for the number of comparisons of list elements done by this algorithm to sort 


an n-element list. 
Solve this recurrence relation. 


In algorithm BubbleSort, suppose we include exchanges of list elements as a work unit, in addition to 
comparisons between list elements. 


Describe the worst case and find the number of comparisons and exchanges done in this case. 


. Describe the best case and find the number of comparisons and exchanges done in this case. 


c. Assume that on the average exchanges between elements must be done about half the time. Find the 


number of comparisons and exchanges done in this case. 


Exercises 15—18 refer to the recursive algorithm SelectionSort of Section 3.1. 


15. 


16. 


17. 
18. 


In one part of algorithm SelectionSort, the index of the maximum item in a list must be found. This requires 
comparisons between list elements. In an n-element (unsorted) list, how many such comparisons are needed 
in the worst case to find the maximum element? How many such comparisons are needed in the average case? 


Defining the basic operation as the comparison of list elements and ignoring the amount of work required 
to exchange list elements, write a recurrence relation for the amount of work done by selection sort on an 
n-element list. (Hint: Use the result from Exercise 15.) 

Solve the recurrence relation of Exercise 16. 


Assume that the exchange of L[i] and L[j] takes place even if i = j. Write an expression for the total 
number of comparisons and exchanges done to sort an n-element list. 


Exercises 19—24 relate to a recursive sorting algorithm called MergeSort, which is described as follows: A one- 
element list is already sorted; no further work is required. Otherwise, split the list in half, sort each half using 
MergeSort (this is the recursive part), and then merge the two halves back into one sorted list. 


19. 


20. 


21. 


22. 


23. 


24. 


The merge part of algorithm MergeSort requires comparing elements from each of two sorted lists to see 
which goes next into the combined, sorted list. When one list runs out of elements, the remaining elements 
from the other list can be added without further comparisons. Given the following pairs of lists, perform 
a merge and count the number of comparisons to merge the two lists into one. 


а. 6, 8,9 and 1,4,5 
b. 1,5, 8 and 2, 3, 4 
с. 0, 2, 3, 4, 7, 10 and 1, 8, 9 


Under what circumstances will the maximum number of comparisons take place while merging two sorted 
lists? If the lengths of the lists are r and s, what is the maximum number of comparisons? 

Write a recurrence relation for the number of comparisons between list elements done by algorithm 
MergeSort in the worst case. Assume that n = 2”. 

Solve the recurrence relation of Exercise 21. 

Use the results of Exercises 18 and 22 to compare the worst-case behavior of SelectionSort (counting 
comparisons and exchanges) and MergeSort (counting comparisons) for n = 4, 8, 16, and 32 (use a 
calculator or spreadsheet). 

Use the results of Exercises 14 and 22 to compare the worst-case behavior of BubbleSort (counting 
comparisons and exchanges) and MergeSort (counting comparisons) for n = 4, 8, 16, and 32 (use a 
calculator or spreadsheet). 


Exercises 25—34 relate to a recursive sorting algorithm called QuickSort, which is described as follows: A 
one-element list is already sorted; no further work is required. Otherwise, take the first element in the list, call it the 
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pivot element, then walk through the original list to create two new sublists, / and L}. L, consists of all elements 
that are less than the pivot element апа L, consists of all elements that are greater than the pivot element. Put the 
pivot element between L, and L,. Sort each of L1 and L2 using QuickSort (this is the recursive part). Eventually 
all lists will consist of 1 element sublists separated by previous pivot elements, and at this point the entire original 
list is in sorted order. This is a little confusing, so here is an example, where pivot elements are shown in brackets: 


23. 
26. 
27. 
28. 
29. 


30. 
31. 


32. 
33. 


34. 


Original list: 6, 2, 1, 7, 9, 4, 8 

After Ist pass: 2, 1, 4, [6], 7,9, 8 

After 2nd pass: 1, [2], 4, [6], [7], 9, 8 

After 3rd pass: 1, [2], 4, [6], [7], 8, [9] Sorted 

Illustrate QuickSort as above using the list 9, 8, 3, 13. 

Illustrate QuickSort as above using the list 8, 4, 10, 5, 9, 6, 14, 3, 1, 12, 11. 

How many comparisons between list elements are required for pass 1 of QuickSort in the example list? 
How many comparisons between list elements are required for pass 1 of QuickSort on an n-element list? 


Suppose that for each pass, each pivot element splits its sublist into two equal-length lists, each approxi- 
mately half the size of the sublist (which is actually very difficult to achieve). Write a recurrence relation 
for the number of comparisons between list elements in this case. 


Solve the recurrence relation of Exercise 29. 


Suppose that for each pass, each pivot element splits its sublist (which has k elements) into one empty list 
and one list of size k — 1. Write a recurrence relation for the number of comparisons between list elements 
in this case. 


Solve the recurrence relation of Exercise 31. 


Unlike the situation described in Exercise 29 where each pivot element splits the sublist in half for the next 
pass, the situation described in Exercise 31 can easily occur. Describe a characteristic of the original list 
that would cause this to happen. 


Exercise 29 describes the best case of QuickSort and Exercise 31 describes the worst case of QuickSort 

with respect to comparisons between list elements. 

a. To which sorting algorithm (SelectionSort, BubbleSort, MergeSort) is the best case of QuickSort 
comparable in the number of comparisons required? 


b. To which sorting algorithm (SelectionSort, BubbleSort, MergeSort) 1s the worst case of QuickSort 
comparable in the number of comparisons required? 


Exercises 35 and 36 refer to algorithm SequentialSearch. It is not hard to do an average case analysis of the 
sequential search algorithm under certain assumptions. Given an n-element list and a target value x for which 
we are searching, the basic operation is a comparison of list elements to x, hence an analysis should count how 
many times such an operation is performed “on the average." The definition of “average” is shaped by our 
assumptions. 


35. 


Assume that x is in the list and is equally to be found at any of the n positions in the list. Fill in the rest of 
the table giving the number of comparisons for each case. 


Position at Which x Occurs | Number of Comparisons 
1 1 
2 
3 
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Find the average number of comparisons by adding the results from the table and dividing by n. (Hint: See 
Practice 7 of Section 2.2—we told you that you should remember this!) 


36. Find the average number of comparisons under the assumption that x is equally likely to be at any of the 
n positions in the list or not in the list. 


Exercises 37—40 concern a better upper bound for the number of divisions required by the Euclidean algorithm 
in finding gcd(a, b). Assume that a and b are positive integers with a > b. 


37. Suppose that m divisions are required to find gcd(a, b). Prove by induction that for m = 1, it is true that 
a > Ет + 2) and b > F(m + 1), where F(n) is the Fibonacci sequence. (Hint: To find gcd(a, b), after the 
first division the algorithm computes рса(р, r).) 

38. Suppose that m divisions are required to find gcd(a, b), with m = 4. Prove that 

3 


ml 
(3) <F(m+2)<a 


(Hint: Use the result of Exercise 37 here and Exercise 26 of Section 3.1.) 

39. Suppose that m divisions are required to find gcd(a, b), with m = 4. Prove that m < (log; 5 a) — 1. 

(Hint: Use the result of Exercise 38.) 

40. a. Compute gcd(89, 55) and count the number of divisions required. 

b. Compute the upper bound on the number of divisions required for gcd(89, 55) using Equation (1). 

c. Compute the upper bound on the number of divisions required for gcd(89, 55) using the result of 
Exercise 39. 

d. The eighteenth-century French mathematician Gabriel Lamé proved that an upper bound on the number 
of division done by the Euclidean algorithm to find gcd(a, b) where a > b is 5 times the number of 
decimal digits іп b. Compute the upper bound on the number of divisions required for gcd(89, 55) using 
Lamé's theorem. 
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SELF-TEST 


Answer the following true-false questions without looking back in the chapter. 


Section 3.1 
1. A sequence defined by 


1) = 7 


5(п) = 35(п—1)+2_ forn=2 


contains the number 215. 
2. Acollection T of numbers is defined recursively by 


1. бапа 8 belong to T 
2. If X and Y belong to 7, so does X + 2Y 


Every even number > 18 belongs to 7. 
3. Recursive algorithms are valuable primarily because 
they run more efficiently than iterative algorithms. 
4. [n the recursive algorithm SelectionSort, changing 
one line of the algorithm to 


“find the index 7 of the minimum item in L 
between | апау” 


sorts the list L in decreasing order. 
5. In applying the binary search algorithm to the list 


2, 5, 7, 10, 14, 20 


where x = 8 is the target item, x is never compared 
to 5. 


Section 3.2 

1. A closed-form solution to a recurrence relation 15 
obtained by applying mathematical induction to the 
recurrence relation. 


ON THE COMPUTER 


For Exercises 1—7, write a computer program that 
produces the desired output from the given input. 


1. Input: Binary string 
Output: Message indicating whether the input 
string is a palindrome (see Practice 7) 
Algorithm: Use recursion. 


2. Input: String of characters x and a positive integer n 
Output: Concatenation of n copies of x 
Algorithm: Use recursion. 


2. S(n) = 2S(n — 1) + 3S(n — 2) + 5n is a linear, 
first-order recurrence relation with constant 
coefficients. 


n 


3. S(n) = c" !S(1) + У eigi) is a closed-form 

i=2 
solution to any linear first-order recurrence relation 
with constant coefficients. 

4. The solution to the recurrence relation S(7) = 
c,S(n — 1) + с,5(и — 2) involves solving the char- 
acteristic equation 2 — cıt — c, = 0. 

5. Divide-and-conquer algorithms lead to recurrence 
relations that are not first-order. 


Section 3.3 

1. Analysis of an algorithm generally finds the amount 
of work done in the worst case because it is too 
difficult to analyze an average case. 

2. |n the worst case, the string pattern-matching 
algorithm requires n +m comparisons, where 
n — the text size and m — the pattern size. 

3. Binary search is more efficient than sequential 
search on a sorted list of more than three elements. 

4. The recursive version of the sequential search 
algorithm is a divide-and-conquer algorithm. 

5. An upper bound for the Euclidean algorithm gives 
a ceiling on the number of divisions required to find 
gcd(a, b). 


(Some programming languages provide built- 
in string manipulation capabilities, such as 
concatenation.) 


3. Input: Positive integer n 
Output: nth value in the Fibonacci sequence using 


a. iteration. 
b. recursion. 
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Now insert a counter in each version to indicate the 
total number of addition operations done. Run each 
version for various values of n and, on a single graph, 
plot the number of additions as a function of n for each 
version. 


4. 


Input: Two positive integers a and b with a — b 
Output: gcd(a, b) using 

a. the iterative version ofthe Euclidean algorithm 
b. arecursive version of the Euclidean algorithm 


. Input: Unsorted list of 10 integers 


Output: Input list sorted in increasing order 
Algorithm: Use the recursive selection sort of 
Example 12. 


. Input: Sorted list of 10 integers and an integer x 


Output: Message indicating whether x is in the list 
Algorithm: Use the binary search algorithm of 
Example 13. 


7. Input: Text string, pattern string 


Output: Location of beginning of pattern string in 
text string, or a message that the pattern string is 
not found within the text string 

Algorithm: See Example 28. 


. The value (1 + V5, known as the golden ratio, 


is related to the Fibonacci sequence by 
Кт +1) 1+%5 
Fm 2 
Verify this limit by computing F(n + 1)/F(n) for 


n — 10, 15, 25, 50, and 100 and comparing the 
result with the golden ratio. 


n— oo 


. Compare the work done by sequential search and bi- 


nary search on an ordered list of n entries by comput- 
ing п and 1 + log п for values of n from 1 to 100. 
Present the results in graphic form. 
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| CHAPTER OBJECTIVES | ө 


Question: 


After studying this chapter, you will be able to: 


Use the notation of set theory. 

Find the power set of a finite set. 

Find the union, intersection, difference, complement, and Cartesian product 
of sets. 

Identify binary and unary operations on a set. 

Prove set identities. 

Recognize that not all sets are countable. 

Apply the multiplication principle and the addition principle to solve counting 
problems. 

Use decision trees to solve counting problems. 

Use the principle of inclusion and exclusion to find the number of elements in 
the union of sets. 

Use the pigeonhole principle to decide when certain events must occur. 

Use the formulas for permutations and combinations of r objects, with and 
without repetition, from a set of n distinct objects. 

Find the number of distinct permutations of objects that are not all distinct. 
Generate all permutations of n distinct objects, and all combinations of r out 
of п distinct objects. 

Find the probability of an event given that all outcomes are equally likely, that 
a probability distribution has been assigned to the outcomes, or that another 
event has already occurred. 

Compute the expected value of a quantity with an assigned probability 
distribution. 

Use the binomial theorem to expand (a + bY". 


You survey the 87 computer users who subscribe to your electronic newsletter in 
preparation for the release of your new software product. The results of your survey 
reveal that 68 have a Windows-based system available to them, 34 have a Linux 
system available, and 30 have access to a Mac. In addition, 19 have access to both 
Windows and Linux systems, 11 have access to both Linux systems and Macs, and 
23 can use both Macs and Windows. 


How many of your subscribers have access to all three types of systems? 
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This is an example of a counting problem; you want to count the number of ele- 
ments in a certain collection or set—the set of all subscribers with access to all 
three systems. A formula that easily solves this counting problem is developed in 
Section 4.3. 

Set theory is one of the cornerstones of mathematics. Many concepts in math- 
ematics and computer science can be conveniently expressed in the language of 
sets. Operations can be performed on sets to generate new sets. Although most sets 
of interest to computer scientists are finite or countable, there are sets with so many 
members that they cannot be enumerated. Set theory is discussed in Section 4.1. 

It is often of interest to count the number of elements in a finite set. This may 
not be a trivial task. Section 4.2 provides some ground rules for counting the num- 
ber of elements in a set consisting of the outcomes of an event. Counting the ele- 
ments in such a set can be made manageable by breaking the event down into a se- 
quence of subevents or into disjoint subevents that have no outcomes in common. 
Some specialized counting principles appear in Section 4.3. Section 4.4 provides 
formulas for counting the number of ways to arrange objects in a set and to select 
objects from a set, as well as algorithms to generate all the possible arrangements 
or selections. Section 4.5 discusses the binomial theorem, an algebraic result that 
can also be viewed as a consequence ofthe counting formulas. Finally, Section 4.6 
extends "counting" to the more general idea of probability. 


SECTION 4.1 | SETS 


Definitions are important in any science because they contribute to precise commu- 
nication. However, if we look up a word in the dictionary, the definition is expressed 
using other words, which are defined using still other words, and so on. Thus, we 
have to have a starting point for definitions where the meaning is taken to be under- 
stood; our starting point in this discussion will be the idea of a set, a term that we 
will not formally define. Instead, we will simply use the intuitive idea that a set is a 
collection of objects. Usually all of the objects in a set share some common property 
(aside from that of belonging to the same set!); any object that has the property is a 
member of the set, and any object that does not have the property is not a member. 
(This is consistent with our use of the word set in Section 3.1, where we talked about 
the set of propositional well-formed formulas, the set of all strings of symbols from 
a finite alphabet, and the set of identifiers in some programming language.) 


Notation 


We use capital letters to denote sets and the symbol € to denote membership 
in a set. Thus a € A means that object a is a member, or element, of set 4, and 
b ¢ A means that object b is not an element of set A. Braces are used to indicate 
a set. 


|. EXAMPLE: | If A = (violet, chartreuse, burnt umber}, then chartreuse є A and magenta ¢ 4. ө 


No ordering is imposed on the elements in a set; therefore (violet, chartreuse, 
burnt umber} is the same as {chartreuse, burnt umber, violet}. Also, each element 
of a set is listed only once; it is redundant to list it again. 
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Two sets are equal if they contain the same elements. (In a definition, “if” 
really means “if and only if”; thus two sets are equal if and only if they contain 
the same elements.) Using predicate logic notation, 


А = B means (Vx)[(x € 4—»5 xe B)^(xe B—xe A)] 


In describing a particular set, we have to identify its elements. For a finite set (one 
with п elements for some nonnegative integer л), we might do this by simply listing 
all the elements, as in set А of Example 1. Although it is impossible to list all ele- 
ments of an infinite set (one that is not finite), for some infinite sets we can indicate 
a pattern for listing elements indefinitely. Thus, we might write (2, 4, 6, ...} to 
express the set S of all positive even integers. (Although this is a common prac- 
tice, the danger exists that the reader will not see the pattern that the writer has in 
mind.) S can also be defined recursively by giving an explicit member of S and then 
describing other members of S in terms of already known members. For example, 


1. 2є5 
2. Ifn eS, then (п + 2) є 5 


But the clearest way to describe this particular set 5 is to describe the 
characterizing property of the set elements in words and write 


S = {х |х is a positive even integer} 


read as “the set of all x such that x is a positive even integer.” 
We’ve now given three ways to describe a set: 


1. List (or partially list) its elements. 
2. Use recursion to describe how to generate the set elements. 
3. Describe a property P that characterizes the set elements. 


Later in this section we’ll see that there are sets for which the first approach won’t 
work; often the second approach is difficult to use. The third method is usually 
the best choice. 

The notation for a set S whose elements are characterized as having property 
Р is {х | P(x)}. Property P here is a unary predicate; this term was introduced in 
Chapter 1. For any given x, P(x) is either true or false. In fact, the formal logic 
notation of Chapter 1 again comes to the rescue to clarify what we mean by a 
characterizing property of a set’s elements: 


S = {x|P(x)} means (Vx)[(x є S— P(x)) ^ (Р(х) > x e S)] 


In words, every element of S has property P and everything that has property P 
is an element of S. 


PRACTICE 1 | Describe each of the following sets by listing its elements. 


а. {x|x is an integer and 3 < x = 7) 
b. {x|x is a month with exactly 30 days} 
c. {x|x is the capital of the United States} и 


РВАСТІСЕ 2 | Describe each of ће following sets by giving a characterizing property. 


a. (1,4,9, 16} 


b. {the butcher, the baker, the candlestick maker} 
Cs 1122. Sk Oy 7 TU у п 
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It is convenient to name certain standard sets so that we can refer to them 
easily. We will use 


N = set of all nonnegative integers (note that 0 є N) 
Z = set of all integers 

Q = set of all rational numbers 

R = set of all real numbers 

C = set of all complex numbers 


Sometimes we will also want to talk about the set with no elements (the empty set, 
or null set), denoted by С or { }. For example, if S = {x|x € N and x < 0), then 
S = ©. Note that Ø, the set with no elements, is not the same as {0}, which is a 
set with a single element where the single element is the empty set. 


| EXAMPLE2 | Suppose that a set А is described as 
A = {x| Ey) = {0, 1, 2} and x = у?)} 


Because y is not a free variable here, this is still of the form А = {х | P(x)}. The 
members of A can be found by letting y assume each of the values 0, 1, and 2 and 
then taking the cube of each such value. Therefore A = {0, 1, 8}. For 


В = (x|x e Nand (3y)(v € N and x = y)} 


choosing y = 0 gives x = 0; choosing y = 1 givesx = 0 or 1; choosing y = 2 gives 
x = 0, 1, or 2; and so on. In other words, В consists of all nonnegative integers 
that are less than or equal to some nonnegative integer, which means that B — N. 
For the set 


С = {x|x e Nand (Vy (vy eNx x y) 


0 is the only nonnegative integer that is less than or equal to every nonnegative 
integer, so C = {0}. ө 


РВАСТ!СЕ 3 | Describe each set. 


а. А = {х|хє Nand (Му)(ує {2, 3, 4,5} >x =y)} 
b. В = {x|(y)(z)(y e {1,2} andz e (2,3) andx = у + z)) п 


Relationships Between Sets 


For А = {2, 3, 5, 12} and В = {2, 3, 4, 5, 9, 12}, every member of A is also a 
member of В. When this happens, А is said to be a subset of В. 


PRACTICE 4 | Complete the definition: A is a subset of B if 


(Vxy(xeA— +) " 
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If A is a subset of B, we denote the relationship by A C B. If A C B but A # B (there 
is at least one element of B that is not an element of A), then А is a proper subset 
of B, denoted by A C B. 


PRACTICE 5 | Use formal logic notation to define A C B. 


АЕ 7915) 
R= 47O} 
@ = 17 9 1520} 


Then the following statements (among others) are all true: 
BEE ISee 
BCA ti, 9» (Є 5 


BCA ҮЛ СД 
АФС ОСС 


REMINDER 


Be sure you understand 


the difference between , | 
the symbols є (element o) The last statement (Ø C C) is true because the statement (Vx)(x € © — x € C) is 


and C (subset of). true because x € @ is always false. ө 


PRACTICE 6 | Let 


A = {x|x e Nand x = 5} 
B = (10, 12, 16, 20} 
C= {x|Gy)(v e Nand x = 2y)} 


Which of the following statements are true? 


а BCC g. {12} eB 

b. BCA h (12. elu 

CA cc i. (x|xe Nandx < 20} ¢ B 

d. 26€ С j 5CA 

e {11,12,13)CA К (rcm 

f 411,12,13}сС я н 


Suppose that В = {x | Р(х)} and that А С В. Because every element of A is 
also an element of B, and P is a property characterizing all elements of B, then 
every element in A also has property P(x). The elements of A “inherit” property 
Р. In fact, to prove that A С В, we pick an arbitrary x є A and show that P(x) 


| 226 Sets, Combinatorics, and Probability 


holds. If A is a proper subset of B, A’s elements will usually have some additional 
characterizing property not shared by all elements of B. (This is the same notion 
of “inheritance” that prevails when a child type, or subtype, or derived type is de- 
fined in an object-oriented programming language. The child type inherits all of 
the properties and operations from the parent type with the addition of specialized 
local properties or operations as needed.) 


Let 

B = {x|x is a multiple of 4} 
and let 

A = {х |х is a multiple of 8} 


Then we have А С В. To prove it, let x є A; note that x is a completely arbitrary 
member of A. We must show that x satisfies the characterizing property of B; 
in other words, we must show that x is a multiple of 4. Because we have x € A, x 
satisfies the characterizing property of A; that is, x is a multiple of 8 and thus 
we can write x = т · 8 for some integer m. This equation can be written as 
x—m:2:40rx = Е: 4, where К = 2m, so kis an integer. This shows that x is a 
multiple of 4, and therefore x € В. 

There are numbers (like 12) that are multiples of 4 but not multiples of 8, so 
A С B. Another way to describe А is 


A = {x|x = Е: 4and k is an even number} 
In this form it is clear that A's elements have inherited the characterizing property 


of B—being a multiple of 4—but that there is an additional restriction that makes 
A less general than В. ө 


PRACTICE 7 | Let 


A = {x|x e Randx — 4x + 3 =0} 
В = {x|xeNand1 sx = 4) 


Prove that А С В. п 


We know that А апа В are equal sets if they have the same elements. We can 
restate this equality in terms of subsets: A = В if and only if 4 C B and B с A. 
Proving set inclusion in both directions is the usual way to establish the equality 
of two sets. 


REMINDER 


To find gS), start with Ø. 
Then add sets taking 1 ele- 
ment from S at a time, then 
2 elements at a time, then 
3 at a time, and so forth. 


PRACTICE 8 | For A = (1,2, 3}, what is (4)? 
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We will prove that {x |x є N and x? < 15} = {x|x e Nand 2x < 7}. 

Let A = {x|x e Nand x? < 15} and B = {x|x € N and 2x < 7). To show that 
A = В, we show А С В and В С A. For A С В, we must choose an arbitrary 
member of A—that is, anything satisfying the characterizing property of A—and 
show that it also satisfies the characterizing property of B. Let x є A. Then х is a 
nonnegative integer satisfying the inequality х2 < 15. The nonnegative integers 
with squares less than 15 are 0, 1, 2, and 3, so these integers are the members of A. 
The double of each of these nonnegative integers is a number less than 7. Hence, 
each member of A is a member of B, and A С В. 

Now we show В C А. Any member of В is a nonnegative integer whose 
double is less than 7. These numbers are 0, 1, 2, and 3, each of which has a square 
less than 15, so B C A. e 


Sets of Sets 


For a set S, we can form a new set whose elements are all of the subsets of S. This 
new set is called the powerset of S, (S). 


For S = (0, 1}, e(S) = (2, {0}, {1}, (0, 133. Note that the members of the power 
set of a set are themselves sets. e 


For any set S, (S) will always have at least (Z and S itself as members, since 
@ C S and S C S are always true. 


PRACTICE 9 


In Practice 8, A has 3 elements and (4) has 8 elements. Try finding g(S) for 
other sets S until you can guess the answer to the following practice problem. 


If S has n elements, then (5) has elements. (Does your answer work for 
n — 0,too?) 


There are several ways we can show that for a set 5 with n elements, (S) will 
have 2" elements. The following proof uses induction. For the basis step of the 
induction, we let л = 0. The only set with 0 elements is (Z. The only subset of (Z 
is Ø, so 9((2) = {Ø}, aset with 1 = 2° elements. We assume that for any set with 
k elements, the power set has 2^ elements. 


PRACTICE 10 
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Given that (2x — y, x + у) = (7, — 1), solve for x and у. 


Now let S have k + 1 elements and put one of these elements, call it x, aside. 
The remaining set has k elements, so by our inductive assumption, its power set has 
2* elements. Each of these elements is also a member of (5). The only members of 
@(5) not counted by this procedure are those including element x. All the subsets in- 
cluding x can be found by taking all those subsets not including x (of which there are 
2^) and throwing in the x; thus, there will be 2^ subsets including x. Altogether, there 
are 2^ subsets without x and 2* subsets with x, or 2^ + 2* = 2 - 2* = 20! subsets. 
Therefore, (5) has 2‘*' elements. 

Analogy with the truth tables of Section 1.1 is another way to show that g(S) 
has 2” elements for a set S with n elements. There we had п statement letters and 
showed that there were 2” true-false combinations among these letters. But we can 
also think of each true-false combination as representing a particular subset, with 
T indicating membership and F indicating nonmembership in that subset. (For 
example, the row of the truth table with all statement letters F corresponds to the 
empty set.) Thus, the number of true-false combinations among n statement letters 
equals the number of subsets of a set with л elements; both are 2”. 


Binary and Unary Operations 


By itself a set is not very interesting until we do something with its elements. For 
example, we can perform several arithmetic operations on elements of the set 
Z. We might subtract two integers, or we might take the negative of an integer. 
Subtraction acts on two integers; it is a binary operation on Z. Negation acts on 
one integer; it is a unary operation on Z. 

To see exactly what is involved in a binary operation, let’s look at subtraction 
more closely. For any two integers x and y, x — y produces an answer and only one 
answer, and that answer is always an integer. Finally, subtraction is performed on 
an ordered pair of numbers. For example, 7 — 5 does not produce the same result 
as 5 — 7. An ordered pair is denoted by (x, y), where x is the first component of the 
ordered pair and y is the second component. Order is important in an ordered pair; 
thus, the sets (1, 2} and (2, 1} are equal, but the ordered pairs (1, 2) апа (2, 1) are 
not. You are probably familiar with ordered pairs used as coordinates to locate a 
point in the plane. The point (1, 2) is different from the point (2, 1). Two ordered 
pairs (x, y) and (и, v) are equal only when it is the case that x = u and y = v. 


PRACTICE 11 


Let S = (3, 4}. List all the ordered pairs (x, y) of elements of S. 


We will generalize the properties of subtraction on the integers to define a 
binary operation ° on a set S. The symbol ° is merely a placeholder; in any spe- 
cific discussion, it will be replaced by the appropriate operation symbol, such as 
a subtraction sign. 
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DEFINITION BINARY OPERATION 
о is a binary operation on a set 5 if for every ordered pair (x, y) of elements of S, 
x ? y exists, is unique, and is a member of S. 


In other words, if ° is a binary operation on S, then for any two values x and 
y in S, x ° y produces one and only one answer, and that answer belongs to S. 
That the value x ° y always exists and is unique. It is described by saying that 
the binary operation » is well-defined. The property that x ° y always belongs to 
S is described by saying that S is closed under the operation ». Uniqueness does 
not mean that the result of a binary operation occurs only once; it means that for 
a given x and y, there is only one result. For subtraction, there are many x and y 
values such that x — y — 7, but for a given x and y, like x — 5 and y — 2, there is 
only one answer for x — y. 


Addition, subtraction, and multiplication are all binary operations on Z. For 
example, when we perform addition on the ordered pair of integers (x, y), x + y 
exists and is a unique integer. ө 


SOVATA: The logical operations of conjunction, disjunction, implication, and equivalence 
are binary operations on the set of propositional wffs. If P and Q are propositional 
у, then P ^ Q, P V О, P — О, and P < О are unique propositional wffs. e 


A candidate ° for an operation can fail to be a binary operation on a set S in 
any of three ways: (1) There are elements x, y € S for which x » y does not exist; (2) 
there are elements x, y € S for which x ° y gives more than one result; or (3) there 
are elements x, y € S for which x ° y does not belong to 5. 


| EXAMPLE | Division is not a binary operation on Z because x + 0 does not exist. ө 
| EXAMPLE 10 | Define x ° y on N by 


m Jis = 5 
YT oriras 


Then, by the first part of the definition for °, 5 ° 1 = 1, but by its second part, 
5 ° 1 = 0. Thus, ° is not well-defined on N because the result of 5 ° 1 is not 
unique. e 


Subtraction is not a binary operation on N because N is not closed under subtrac- 
tion. (For example, 1 — 10 є №.) e 
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For # to be a unary operation on a set S, it must be true that for any x € S, 
x# is well-defined and S is closed under #; in other words, for any x € S, x# exists, 
is unique, and is a member of S. We do not have a unary operation if any of these 
conditions is not met. 


Let x£ be defined by x£ = —x so that х# is the negative of x. Then £ is a unary 
operation on Z but not on N because N is not closed under 7. e 


The logical connective of negation is a unary operation on the set of propositional 
wits. If P is a propositional wff, then P’ is a unique propositional wff. ө 


From these examples it is clear that whether ° (or #) is a binary (or unary) 
operation can depend not only on its definition but also on the set involved. 


PRACTICE 12 | Which of the following candidates are neither binary nor unary operations on the given 
sets? Why not? 


х°у=х- у; S= set of all positive integers 

хоу =x + у; 5 = set of all positive rational numbers 

xoy—-x5S-R 

x° y = maximum of x and y; 5 = № 

x* = Vx; S = set of all positive real numbers 

x* = solution to equation (x^? = x; S = C п 


њоро сь 


So far, all our binary operations have been defined by means of а descrip- 
tion or an equation. Suppose S is a finite set, 5 = {x,, X2, ... , x,}. Then a binary 
operation ° on S can be defined by ann X n table, where element i, j (ith row and 
jth column) denotes x; ° x;. 


ИШЕ? ЛҮ Let S = (2, 5, 9}, and let » be defined by the table 


о 


о л № 

о O го | 
сл о mj|o 
о mM оо 


Thus, 2° 5 = 2 and 9 ° 2 = 5. Inspecting the table, we see that ° is a binary 
operation on S. e 


Operations on Sets 


Most of the operations we have seen operate on numbers, but we can also op- 
erate on sets. Given an arbitrary set S, we can define some binary and unary 
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operations on the set g(S). S in this case is called the universal set or the 
universe of discourse. The universal set defines the context of the objects 
being discussed. If S — Z, for example, then all subsets will contain only integers. 

A binary operation on g(S) must act on any two subsets of S to produce a 
unique subset of S. There are at least two natural ways in which this can happen. 


Let S be the set of all students at Silicon U. Then the members of (S) are sets of 
students. Let 4 be the set of computer science majors, and let B be the set of busi- 
ness majors. Both А and B belong to (S). A new set of students can be defined 
that consists of everybody who is majoring in either computer science or business 
(or both); this set is called the union of А and B. Another new set can be defined 
that consists of everybody who is majoring in both computer science and business. 
This set (which might be empty) is called the intersection of A and B. ө 


DEFINITION UNION AND INTERSECTION OF SETS 
Let A, B є g(S). The union of A and B, denoted by A UB, is {x|x e A orx € Bj. 
The intersection of A and B, denoted by AM B, is (x| xe Aand x € В}. 


Let 4 = (1,3, 5, 7, 9) and В = (3,5, 6, 10, 11}. Here we may consider А and В 
as members of g(N). Then 4 U B = (1,3,5,6, 7, 9, 10, 11} and AM B = {3,5}. 
Both A U B and A N B are members of g(N). ө 


PRACTICE 13 | Let A, B € g(S) for any set S. Is it always the case that 4 N B C AU B? 


We can use Venn diagrams (named for the nineteenth-century British math- 
ematician John Venn) to visualize the binary operations of union and intersection. 
The shaded areas in Figures 4.1 and 4.2 illustrate the set that results from perform- 
ing the binary operation on the two given sets. 


S S 


AUB ANB 
Figure 4.1 Figure 4.2 


We will define one unary operation on g(S). 
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DEFINITION COMPLEMENT OF A SET 
For a set A € o(S), the complement of A, 4’, is {x|x € Sand x € A}. 


PRACTICE 14 | Illustrate A’ in a Venn diagram. 


In Section 1.1 we discussed the use of logical connectives to formulate Web search 
queries. А query such as 


"used cars" AND (Ford OR Buick) AND NOT trucks 


is asking the search engine to return a set of pages (or more properly, a set of links 
to pages). If 


U — set of used car pages 
F — set of Ford pages 

B — set of Buick pages 

T — set of truck pages 


then 
TX ECT MT 


represents the set of Web pages that is the desired result of the query. ө 


Another binary operation on sets A and В in (S) is set difference: 4 — В = 
{x|x € A and x € B}. This operation can be rewritten as A — В = (x|x € A and 
x € B’} and, finally, as A — B = A(B'. 


PRACTICE 15 | Illustrate А — B in a Venn diagram. 


Two sets A and B such that AM B = © are said to be disjoint. Thus, 4 — B 
and В — А, for example, are disjoint sets. 


|. ExawPLE 18 | 18 Let 


A = {x|x is an even nonnegative integer} 
В = {х|(Зу)(у € Nand x = 2y + 1)} 
C = {x|Gy)(y є Nand x = 4y)} 


be subsets of N. Because B represents the set of nonnegative odd integers, A and B 
are disjoint sets. Also, every nonnegative integer is either even or odd, so 4 U B = №. 
These two facts also tell us that A’ = B. Every multiple of 4 is an even number, 
so C is a subset of A, from which it follows that A U C = A. C is in fact a proper 
subset of A, and A — C = í(x|(3y)(y € Nand x = 4y + 2)}. e 
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PRACTICE 16 | Let 


A = {1,2,3, 5, 10} 
В = {2,4,7,8, 9} 
C = {5, 8,10} 


be subsets of S = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}. Find 
a. AUB БЕС c. B'TÓ0(4U C) п 


We will define one final operation using elements of (S). 


* DEFINITION CARTESIAN PRODUCT 
Let A and B be subsets of S. The Cartesian product (cross product) of A and B, 
denoted by A X B, is defined by 


AX B= {(x,y)|x eA andy EB} 


Thus, the Cartesian product of two sets A and B is the set of all ordered pairs 
whose first component comes from A and whose second component comes from B. 
The cross product is not a binary operation on g(S). Although it acts on an or- 
dered pair of members of g(S) and gives a unique result, the resulting set is not, 
in general, a subset of S. The elements are not members of S but ordered pairs of 
members of S. So the resulting set is not a member of g(S). The closure property 
for a binary operation fails to hold. 

Because we will often be interested in the cross product of a set with itself, we 
will abbreviate А Х A as A’; in general, we use A” to mean the set of all ordered 
n-tuples (x1, x», ... , X,) of elements of A. 


PRACTICE 17 | Let A = (1,2) and B = {3,4}. 
a. Find A X B. b. Find B X A. c. Find 42. d. Find АЎ. H 


Set Identities 


There are many set equalities involving the operations of union, intersection, 
difference, and complementation that are true for all subsets of a given set S. 
Because they are independent of the particular subsets used, these equali- 
ties are called set identities. Some basic set identities follow. The names and 
forms of these identities are very similar to the tautological equivalences of 
Section 1.1 (check back and compare). We will see in Chapter 8 that this 
similarity is not a coincidence. 
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Basic SET IDENTITIES 


la AUB=BUA Ib. A08 » B A (commutative 
properties) 
2a. (AUB)UC=AU(BUC) 2b. (ANB)NC=AN(BNC) (associative 
properties) 
За. AU(BNC) = (AUB)N(AUC) 3b. AN (BUC) = (A0 B)U (AC) (distributive 
properties) 
4a. АЧФ — A 4b. ANS=A (identity 
properties) 
5а. AUA'=S 5b. AN A’= Ø (complement 
properties) 


(Note that 2a allows us to write 4 U B U C with no need for parentheses; 2b allows 
us to write AN B N C) 


Let’s prove identity 3a. We might draw Venn diagrams for each side of the equa- 
tion and see that they look the same. However, identity 3a is supposed to hold for 
all subsets 4, B, and C, and whatever one picture we draw cannot be completely 
general. Thus, if we draw А and B disjoint, that’s a special case, but if we draw А 
and B not disjoint, that doesn’t take care of the case where A and B are disjoint. 
To do a proof by Venn diagrams requires a picture for each possible case, and the 
more sets involved (4, B, and C in this problem), the more cases there are. To avoid 
drawing a picture for each case, let’s prove set equality by proving set inclusion in 
each direction. Thus, we want to prove 


AU(BNC) C(AUB)N(AUC) 
and also 
(AUB)N(AUC)CAU(BNC) 


To show that 4 U (B N C) C (4 U B) N (4 UC), we let x be an arbitrary member 
of AU (B f1 C). Then we can proceed as follows: 


xeAU(B(OC)»5xeAorx e(Bf|C) 
эх eAor(xe Bandx e C) 
—(xeAorxe B)and(xeAorx e C) 
— x € (4 U B) and x e (AUC) 
>xE(AUB)N(AUC) 


To show that (4 U B)M(4 UC) C AU (BN C), we reverse the above argument. Ө 


PRACTICE 18 | Prove identity 4a. 
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Once we have proved the set identities in this list, we can use them to prove 
other set identities. Just as the tautological equivalences of propositional logic 
represent recipes or patterns for transforming wffs, the set identities represent 
patterns for transforming set expressions. And, as with tautologies, the set identity 
can be applied only when the set expression exactly matches the pattern. 


| EXAMPLE20 | We can use the basic set identities to prove 


[AU Bncyna4ugm]ncynancy-o 
for A, B, and C any subsets of S. In the following proof, the number to the right is 
that of the basic set identity used to validate each step. The first step uses identity 
2b because the expression 


[4U (Bh C] q^ UB €] 0 (62 €) 


matches the right side of 2b, A N (B N C) where A is [A О (B N C)], B is 
[4' U (B Г\ C)], and C is (BM Су 


[4U (Bn Cyn q4' U (BNIN BNC’) 
-((qaUgGncyjnp'ugncopnüancy (2) 


REMINDER —-([Bnc)uA4]n[8nc)UA4'pninc) (la twice) 

You must match the = [1 0)U (4 4)]n ( ГЇ (3a) 

pattern of a set identity in =[BNC)US]NBNC)' (5b) 

order to use it. In the set ES p 

identities, A, B, and C can = (B'1O) (f) (4a) 

represent any sets. = (5b) ө 


The dual for each set identity in our list also appears in the list. The dual is 
obtained by interchanging U and ( and interchanging S and (Z. The dual of the 
identity in Example 20 is 


[АС (8U C)]JU ([4' 1(BUU C)])U (BU C)) = 5 
which we could prove true by replacing each basic set identity used in the proof of 


Example 20 with its dual. Because this method always works, any time we have 
proved a set identity by using the basic identities, we have also proved its dual. 


PRACTICE 19 


a. Using the basic set identities, establish the set identity 
[(CON(AUB)UL[AUB)NC]=AUB 


(A, B, and C are any subsets of S.) 
b. State the dual identity that you now know is true. H 
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Table 4.1 summarizes the approaches to proving set identities. 


TABLE 4.1 
Method Comment 
Draw a Venn diagram Not a good plan because no one diagram fits all cases 


and it will not prove the general identity. 


Establish set inclusion Take an arbitrary member of one side and show it 


in each direction belongs to the other side, and conversely. 
Use already proved Be sure to match the pattern of the identity you want to 
identities use. 


Countable and Uncountable Sets 


In a finite set S, we can always designate one element as the first member, s;, 
another element as the second member, s, and so forth. If there are k elements in 
the set, then these can be listed in the order we have selected: 


$1, 82, -++ > Sk 


This list represents the entire set S. The number of elements in a finite set is the 
cardinality of the set, so this would be a set of cardinality k, denoted by |S] = k. 

If the set is infinite, we may still be able to select a first element s;, a second 
element s5, and so forth, so that the list 


$1, $2, 83, ... 


represents all elements of the set. Every element of the set will eventually appear 
in this list. Such an infinite set is said to be denumerable. Both finite and denu- 
merable sets are countable sets because we can count, or enumerate, all of their 
elements. Being countable does not mean that we can state the total number of 
elements in the set; rather, it means that we can say, “Неге is a first one,” “Неге 
is a second one,” and so on, through the set. There are, however, infinite sets that 
are uncountable. In an uncountable set, the set is so big that there is no way to 
count out the elements and get the whole set in the process. Before we prove that 
uncountable sets exist, let's look at some denumerable (countably infinite) sets. 


The set N is denumerable. 
To prove denumerability, we need only exhibit a counting scheme. For the set 
N of nonnegative integers, it is clear that 
Ооз 


is an enumeration that will eventually include every member of the set. ө 


PRACTICE 20 | Prove that the set of even positive integers is denumerable. 
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The set Q” of positive rational numbers is denumerable. 

We assume that each positive rational number is written as a fraction of 
positive integers. We can write all such fractions having the numerator 1 in one 
row, all those having the numerator 2 in a second row, and so on: 


11 1/2 1/3 14 15 
2A 202 2/5 OBL Ж 
ЭЛ 92 Ma 3 SS 
4/1 4/2 4/3 4/4 4/5 


To show that the set of all fractions in this array is denumerable, we will thread 
an arrow through the entire array, beginning with 1/1; following the arrow gives 
an enumeration of the set. Thus the fraction 1/3 is the fourth member in this 
enumeration: 


4/2 4/3 44 


Therefore the set represented by the array is denumerable. Note that our path 
through the array must “spread out" from one corner. If we begin to follow just the 
first row or just the first column, for example, we will never finish it to get on to 
other rows (or columns). 

To obtain an enumeration of ©“, we use the enumeration of the set shown 
but eliminate any fractions not in lowest terms. This avoids the problem of listing 
both 1/2 and 2/4, for example, which represent the same positive rational. The 
enumeration of Q* thus begins with 


ЙК ОЛ П WE}, AVAL, occ 


For example, we have eliminated 2/2, which reduces to 1/1. e 


PRACTICE 21 | What is the 11th fraction in the above enumeration? What is the I Ith positive rational? | 


Now let’s show that there is an uncountable (not countable) infinite set. The 
proof technique that seems appropriate to prove that set A does not have property 
B is to assume that A does have property B and look for a contradiction. The 
proof in Example 23 is a very famous proof by contradiction known as Cantor’s 
diagonalization method, after Georg Cantor, the nineteenth-century German 
mathematician known as the “father of set theory.” 
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We will show that the set of all the real numbers between 0 and 1 is uncountable. 
We will write such numbers in decimal form; thus any member of the set can 
be written as 


O 


Now let us assume that our set is countable. Therefore some enumeration of the 
set exists. A number such as 0.24999999 ... can be written in alternative form as 
0.2500000 ... (see Exercise 102 for an explanation of why these are alternative 
representations of the same number). To avoid writing the same element twice in 
our enumeration, we will choose (arbitrarily) to always use the former representa- 
tion and not the latter. We can depict an enumeration of the set as follows, where 
d; is the jth decimal digit in the ith number in the enumeration: 


0.414413 
0.4493 
0.d31d32d33 


We now construct a real number p = 0.рр›р; ... as follows: p; is always chosen 
to be 5 if d; z£ 5 and 6 if d; = 5. Thus p is a real number between 0 and 1. For 
instance, if the enumeration begins with 


0.342134... 
0257001 
0.546122... 
0.716525... 


then dj, = 3, dy = 5, d33 = 6, and dy, = 5, so ру = 5, po = 6, p; = 5, and p; = 6. 
Thus p begins with 0.5656 ... . 

If we compare p with the enumeration of the set, p differs from the first number 
at the first decimal digit, from the second number at the second decimal digit, from 
the third number at the third decimal digit, and so on. 


Therefore p does not agree with any of the representations in the enumeration. Fur- 
thermore, because p contains no 05 to the right of the decimal, it is not the alterna- 
tive representation of any of the numbers in the enumeration. Therefore p is a real 
number between 0 and 1 different from any other number in the enumeration, yet 
the enumeration was supposed to include all members of the set. Here, then, is the 
contradiction, and the set of all real numbers between 0 and 1 is indeed uncount- 
able. (You can see why this proof is called a “diagonalization method.”) ө 


SECTION 4.1 


TECHNIQUES 
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Although it is interesting and perhaps surprising to learn that there are 
uncountable sets, we are usually concerned with countable sets. A computer, of 
course, can manage only finite sets. In the rest of this chapter, we too, limit our 
attention to finite sets and various ways to count their elements. 


REVIEW 


MAIN IDEAS 


* Describe sets by a list of elements and by a  * Sets are unordered collections of objects that can 


characterizing property. 


be related (equal sets, subsets, etc.) or combined 


* Prove that one set is a subset of another. (union, intersection, etc.). 

* Find the power set of a set. * Certain standard sets have their own notation. 

e Check that the required properties for a binary ог ° The power set ofa set with n elements has 2" elements. 
unary operation are satisfied. * Basic set identities exist (in dual pairs) and can be 

W) Form new sets by taking the union, intersection, used to prove other set identities; once an identity is 
complement, and cross product of sets. proved in this manner, its dual is also true. 

QJ Prove set identities by showing set inclusion in ° Countable sets can be enumerated, and uncountable 
each direction or using the basic set identities. sets exist. 


* Demonstrate the denumerability of certain sets. 
* Use the Cantor diagonalization method to prove 
that certain sets are uncountable. 


EXERCISES 4.1 


1. 


Let S = (2, 5, 17, 27}. Which of the following expressions are true? 


а. 5є 5 b. 2+5є5 с. бє d. SES 
. Let B = {x|x € Q and —1 < x < 2}. Which of the following expressions аге true? 
a. 0EB b. -1eB c. -084€ В d. V2eB 
. How many different sets are described here? What are they? 
(2, 3, 4} Ø 
{x |x is the first letter of cat, bat, or apple} {x |x is the first letter of cat, bat, and apple} 
{x|xeNand2 =x = 4} {2, a, 3, b, 4, с} 
{a, b, с} {3,4,2} 


. How many different sets are described here? What are they? 


{x|x = F(n) ^n € {5, 6, 7}} [F(n) is a Fibonacci number] 

{x |x|24} [x divides 24] 

{1, 2, 3, 4} 

{5, 8, 13} 

{x|x ENAO <x = 4} 

{x |x € ф(5)} [Ф(и) is the Euler phi function] 

{12, 2, 6, 24, 8, 3, 1, 4} 

{x |x is a digit in the decimal equivalent of the Roman numeral MCCXXXIV} 


ы 


ы 


П. 
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. Describe each of the following sets by listing its elements: 


a. {x|x e Nand x? < 25} 

b. {x|x e Nand x is even and 2 € x < 11} 

c. {x|x is one of the first three U.S. presidents} 

d. {x|x eR and x = –1} 

e. {x|x is one of the New England states} 

f. {х|хє Zand|x|< 4} (|x|denotes the absolute value function) 


. Describe each of the following sets by listing its elements: 


а. {х|хє Nandx? — 5x + 6 = 0) 
b. {х|хє Rand x? = 7} 
c. {x| хє Nand x? — 2x — 8 = 0} 


. Describe each of the following sets by giving a characterizing property: 


а. {1, 2, 3,4, 5} 

b. {1, 3,5, 7,9, 11, ...} 

c. {Melchior, Gaspar, Balthazar} 

d. {0, 1, 10, 11, 100, 101, 110, 111, 1000, ...} 


. Describe each of the following sets: 


a {x|x e Nand (Aq\(q є (2, 3} and x = 2q)} 
b. {x|x € N and (3y)(3z)(» є (0, 1} and z E (3, 4} andy < x <z)} 
c. {х|х e Nand (Му)(у even > x # у)} 


. Given the description of a set A as A = (2, 4,8 ...}, do you think 16 є A? 
. What is the cardinality of each of the following sets? 


a. S = la, {a, {a}}} 

b. S= {{a}, (333 

с. $ = (y 

d. S = (a, 12), 2] 

e. 8 — {@, 10, 103, 10, {D, (00311) 


Let 
А = {2,5,7} 
B= {1,2,4,7,8} 
С= {7,8} 


Which of the following statements are true? 
a. 5 СА а. 7eB 

b. CCB e. {2,5} CA 

c. OEA f. OCC 


. Let 


А = {x| xeNand 1 «€ x < 50} 
В = {x| xeRand1<x< 50} 
С = {x| хє Zand|x|= 25} 


14. 


15; 


16. 
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Which of the following statements are true? 


a. ACB e. V3EB 
b. 17є4 f. {0,1,2} CA 
c. ACC в. eB 
d. —40 € C h. {x|x € Zand x? > 625} CC 
. Let 
R = {1,3, т, 4.1, 9, 10} Т = 11,3, 7} 
S = {11}, 3, 9, 10} U= {{1, 3, т}, 1} 


Which of the following statements are true? For those that аге пої, why not? 
а. SCR e. {1} CT 
Б. 1eER f (cs 


с. les g. TCR 

d ICU 

Let 
R= {1,3, т, 4.1, 9, 10} T= {1, 3, т} 
S= 1115,3,9, 10} U= {{1,3, T}, 1} 


Which of the following statements are true? For those that are not, why not? 
a. {leS e. TER 


b OCS f TCR 

с. TCU g. SC {1, 3, 9, 10} 
а. TEU 

Let 


А = (a, {а}, {{a}}} В = {а} C = {O, {а, {a}}} 


Which of the following statements аге true? For those that аге not, where do they fail? 


а. BCA f. (a, {a}} eA 
b. BEA g. {a, {a}} CA 
c. CCA h BCC 

d О.С i Ча} СА 
e. ec 

Let 


А= {00,40,40} B= C= D= {O, {Ot} 


Which of the following statements are true? For those that are not, where do they fail? 


a. CCA f. C=B 
‚СєА g. CCD 
.Be€A h. CeD 
‚ВСА i DCA 
.Bec 


о co c 


17. 


20. 
21. 


22. 


23. 
24. 
25; 


26. 


27. 
28. 
29; 
30. 
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Let 
A = {(x, y) |Œ, у) lies within 3 units of the point (1, 4)} 
and 
В = {(х,у)|(х 0 + (у — 4 = 25} 


Prove that A C В. 


. Let 


А = {x|x eRand x? — 4x + 3 < 0) 
and 
В = (x|xeRand0 «x < 6} 


Prove that 4 С В. 


. Program QUAD finds and prints solutions to quadratic equations of the form ах? + bx + с = 0. Program 


EVEN lists all the even integers from —2n to 2n. Let О denote the set of values output by QUAD and Е 
denote the set of values output by EVEN. 


a. Show that fora = 1, b = —2,c = —24, and n = 50, О С Е. 

b. Show that for the same values of a, b, and c, but a value for n of 2, Oz E. 
Let A = {x|cos(x/2) = 0) and B ={x|sin x = 0}. Prove that A C B. 

Which of the following statements are true for all sets А, В, and С? 

a. IfA C B and B C A, then A = B. d. Ge (2 


b. (21) =O e. ОСА 

c. {©} = {0} 

Which of the following statements are true for all sets А, В, апа С? 
a. CEA 

b. 42) = (OD 


c. If AC Band B C C, then AC C. 

d. If A # B and B = C, then A = C. 

e. If € Band B ¢ С, then A ¢ C. 

Prove that if 4 C B and B C C, then A C C. 
Prove that if A' C B', then B C A. 


Prove that for any integer n = 2, a set with n elements has n(n — 1)/2 subsets that contain exactly two 
elements. 


Prove that for any integer л > 3, a set with n elements has n(n — 1)(n — 2)/6 subsets that contain exactly 
three elements. (int: Use Exercise 25.) 


Find @(5) for S = {a}. 

Find g(S) for S = (a, bj. 

Find g(S) for S = (1, 2, 3, 4). How many elements do you expect this set to have? 
Find @(5) for S = {8}. 


31. 
32. 
33, 
34. 
35. 
36. 
37. 


3 


оо 


39; 


40. 


41. 


42. 
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Find (S) for S = {@, {GO}, (2, {Ott}. 

Find 0(0(5)) for S = {a, b}. 

What can be said about A if 9(4) = 10, {x}, {y}, {х,у}}? 

What can be said about A if ф(А) = 1, {а}, ( (a)? 

Prove that if g(A) = p(B), then A = B. 

Prove that if A C B, then g(A) C (В). 

Solve for x and y. 

a. (y, x + 2) = (5, 3) b. (2x, y) = (16, 7) с. (2x —y,x + у) = (-2, 5) 


‚а. Recall that ordered pairs must have the property that (x, у) = (и, v) if and only if x = u and y = v. 


Prove that {{х}, їх, y33 = {{u}, {u, v}} if and only if x = u and y = v. Therefore, although we know that 
(x, y) 7 (x, у}, we can define the ordered pair (х, y) as the set {{x}, (x, у}}. 


b. Show by an example that we cannot define the ordered triple (x, у, z) as the set { {х}, їх, у}, іх, y, z}}. 


Which of the following candidates are binary or unary operations on the given sets? For those that are not, 
where do they fail? 


а.хоу=х+1;$=\ 


Б. хоу= х +у- 1; 5 = № 
x — lifxis odd 

с. хоу = КОШ S=Z 
x if x is even 

d. x# = ах; S = К 

Which of the following candidates are binary or unary operations on the given sets? For those that are not, 

where do they fail? 

x exec 

b. 


o 


$= {1,2,3} 


on ~ 

о N aj- 
A о юр 
a A UJO 


с. хоу = that fraction, x or y, with the smaller denominator; 5 = set of all fractions. 


d. x° у = that person, x or y, whose name appears first in an alphabetical sort; S = set of 10 people with 
different names. 


Which of the following candidates are binary or unary operations on the given sets? For those that are not, 
where do they fail? 
1/х if x is positive 
а. х°у = ore : 
1/(—x) ifx is negative 


b. x° y = xy (concatenation); S = set of all finite-length strings of symbols from the set (p, q, r} 
с. x# = |x| where |x| denotes the greatest integer less than or equal tox; $ = К 
а. хоу = min(x, у); S=N 


Which of the following candidates are binary or unary operations on the given sets? For those that are not, 
where do they fail? 


а. хоу = greatest common multiple of x and y; 5 = № 
b. хоу =x + y; S = the set of Fibonacci numbers 


43. 


44. 


45. 


46. 


4T. 


48. 
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c. х# = the string that is the reverse of x; 5 = set of all finite-length strings of symbols from the set (p, q, r} 
d.xoy=xt+y;S=R-Q 

How many different unary operations can be defined on a set with л elements? (Hint: Think about filling 
in a table.) 


How many different binary operations can be defined on a set with n elements? (Hint: Think about filling 
in a table.) 


We have written binary operations in infix notation, where the operation symbol appears between the two 
operands, as in A + B. Evaluation of a complicated arithmetic expression is more efficient when the op- 
erations are written in postfix notation, where the operation symbol appears after the two operands, as in 
AB+. Many compilers change expressions in a computer program from infix to postfix form. One way to 
produce an equivalent postfix expression from an infix expression is to write the infix expression with a 
full set of parentheses, move each operator to replace its corresponding right parenthesis, and then elimi- 
nate all left parentheses. (Parentheses are not required in postfix notation.) Thus, 


A*B+C 
becomes, when fully parenthesized, 
((A * В) + C) 


and the postfix notation is AB * С+. Rewrite each of the following expressions in postfix notation: 
a. (A + B) *(C — D) 

b. A ** B — C * D (**denotes exponentiation) 

c. A* C + ВС + D * В) 

Evaluate the following postfix expressions (see Exercise 45): 

a 24*5 + Б. 51+ 2/1 – с. 34+ 51 - * 

Let 


A= 41 Py G73} 
B= §r,t, v} 
C= {p,s,t, и} 


be subsets of S = {p, q,r, 5, t, u, v, w}. Find 
a BAC b. AUC c. C' d. АВГ С 
Let 


A= 1р0) 
B= §r,t, v} 
C= {p,s,t, и} 


be subsets of S = {p, q,r, 5, t, u, v, w}. Find 
a В- С b. (AU B) c. AX B d. (AU BIO C’ 


49. 


50. 


91. 


32. 


Let 
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A = {2,4, 5,6,8} 
B= {1,4,5, 9} 
C= {x|xeZand2 =x < 5} 


be subsets of S = {0, 1, 2, 3, 4, 5, 6, 7, 8, 9}. Find 


a. AUB 
b. ANB 
с. AMC 
d. BUC 


Let 


e.A—B 
f. A’ 
g. ANA’ 


A= 12,4, 5,6, 8} 
B= {1,4,5, 9} 
C= {x|xeZand2 =x < 5} 


be subsets of S = {0, 1, 2, 3, 4, 5, 6, 7, 8, 9}. Find 


а. (AN BY e. (C'U BY 
b. С-В fBxC 
с. (СПАВ) ЈА’ о. (В – АОС 


d. (B — Ay N (4 — B) 


Let 


A= (a, {a}, {taps} 
В = {©, {a}, ta, {a}}} 
C= {a} 


be subsets of S = (2, a, {a}, {{a}}, fa, {a}}}. Find 


а. AMC 
b BNC 
c. AUB 


Let 


Find 
a. ANB 


d. On Е A (IB 
е. (BUC)NA g. {O} NB 


А = {x|x is the name of a former president of the United States} 
B= {Adams, Hamilton, Jefferson, Grant} 
C = {х |х is the name of a state} 


b. ANC с. ВАС 


53. 


54. 


59: 


56. 
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Let S = A X B where A = {2,3,4} and B = {3, 5}. Which of the following statements are true? 
a ACS d. (5,4)eS 

b.3esS е CS 

с. 3,3)ES f (2,5) CS 

Let 


A = {x|x is a word that appears before dog in an English language dictionary} 
B = {x|x is a word that appears after canary in an English language dictionary} 
C = {х |х is a word of more than four letters} 


Which of the following statements are true? 


a BCC 
b. AU B = {х |х is a word in an English language dictionary} 
c. cat e B(1C' 


а. bamboo € A — B 


Consider the following subsets of Z: 


A = {x|(Ay)\(y € Z and y = 4 and x = 3y)} 


B= (ix| Gy)» € Zand x = 2y)} 
C = {х|хє Zand |x| x 10} 


Using set operations, describe each of the following sets in terms of A, В, and C. 

a. set of all odd integers 

b. (—10, —8, —6, —4, —2, 0, 2, 4, 6, 8, 10} 

c. {x|Gy)(y € Z and y = 2 and x = 6y)} 

d. 1—9, —7, —5, —3, —1,1,3,5, 7,9] 

е. {х| (Ө) € Z and y = 5 and x = 2y + 1)} U {x| Ey) € Z and y = —5 and x = 2y — 1)} 


Let 


А = ix|xeRand1 €x x 3j 
В = {x|xeRand2 =x = 5} 


Using set operations, describe each of the sets shown in terms of А апа В. 


c 

A 
Ne 
We 
Y 
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57. Consider the following subsets of the set of all students: 


А = set of all computer science majors 
B — set of all physics majors 

C — set of all science majors 

D = set of all female students 


Using set operations, describe each of the following sets in terms of А, B, C, and D: 
a. set of all students not majoring in science 
b. set of all female physics majors 
c. set of all students majoring in both computer science and physics 
d. set of all male computer science majors 
58. Consider the following subsets of the set of all students: 


A = set of all computer science majors 
B — set of all physics majors 

C — set of all science majors 

D = set of all female students 


Using set operations, describe each of the following sets in terms of А, B, C, and D: 

a. set of all male students who are not physics majors 

b. set of all science majors who are not computer science majors 

c. set of all students who are female or science majors 

d. set of all students who are science majors but are neither computer science nor physics majors 


59. Write the set expression for the desired results of the Web search query for sites pertaining to dogs that are 
not retrievers (Exercise 31, Section 1.1). Assume that D — set of dog pages, R — set of retriever pages. 


60. Write the set expression for the desired results of the Web search query for sites pertaining to oil paintings 
by Van Gogh or Rembrandt but not Vermeer (Exercise 32, Section 1.1). Assume that O — set of oil paint- 
ing pages, G = set of Van Gogh pages, Ё = set of Rembrandt pages, V = set of Vermeer pages. 

61. Write the set expression for the desired results of the Web search query for sites pertaining to novels or 
plays about AIDS (Exercise 33, Section 1.1). Assume that N — set of novel pages, P — set of play pages, 
A — set of AIDS pages. 

62. Wrte the set expression for the desired results of the Web search query for sites pertaining to coastal 
wetlands in Louisiana but not in Alabama (Exercise 34, Section 1.1). Assume that C — set of coastal 
wetlands pages, L = set of Louisiana pages, А = set of Alabama pages. 


63. Which of the following statements are true for all sets А, B, and C? 


а. AUA- A d. (4)! = A 
b. BnB-B e A-B-(B-— Ay 
c. (ANB) = A' Г\В' f (4—В)П(В—-А)=@ 


64. Which of the following statements are true for all sets А, В, апа С? 
aIfAMN B = Ø, then AC B. d. Ont) =O 
b BXA-AXB e (4— B)U(B-C)=A-C 
c. OX A — ØD f£ (4—-C)Qn(4—-B)-A-(BUC) 


65. 


66. 


67. 


68. 


69. 
70. 
71. 
72. 
73. 
74. 
T3: 
76. 
TI: 
78. 
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Which of the following statements are true for all sets 4, B, and C? 

a. AU (B X C) = (AUB) X (AU C) 

b. AX(BAC)=(4XB)N(4XC) 

с AX Ø= ØD 

Which of the following statements are true for all sets 4, B, and C? 

a. p(4) X e(4) = e(4^) 

b AX(BXC)-(AXB)XC 

c. (A X B) = ф(А) X p(B) 

For each of the following statements, find general conditions on sets А and B to make the statement true: 
а. АОВ = А c. AUO = Ø е. AUBCANB 

b. ANB=A d B-4=Ø f AXB=BXA 

For any finite set S, | S| denotes the number of elements in S. If| A | = 3 and |B | = 4, find 
а. [АХ В| 

b. | А? | 

c. [В| 

d. the maximum possible value for | A N B | 

e. the minimum possible value for | 4 U B | 

Prove that (4 ( B) C A where A and B are arbitrary sets. 

Prove that А C (А U B) where A and В are arbitrary sets. 

Prove that (А) N ф(В) = ф(А N B) where A and В are arbitrary sets. 

Prove that (А) U ф(В) C e(A U B) where A and В are arbitrary sets. 

Prove that if 4 U B = A — B, then B = ©. (Hint: Do a proof by contradiction.) 

Prove that if (4 — B) U (B — А) = A U B, then A N B = ©. (Hint: Do a proof by contradiction.) 
Prove that if C C B — A, then A N C = Ø. 

Prove that if (4 — B) UB = A, then B C A. 

Prove that 4 C B if and only if A( B' = ©. 

Prove that (4 N B) UC = AN (ВОС) ifand only if C C A. 


Exercises 79 and 80 refer to a binary operation on sets called the symmetric difference, which is defined by 
АФВ = (A — В) О(В — А). 


79. а. Draw a Venn diagram to illustrate А Ф B. 


b. For A = (3, 5, 7, 9) and B = (2, 3, 4, 5, 6), what is А Ф B? 
c. Prove that А © B = (4 UB) — (AM B) for arbitrary sets А and В. 


80. a. For an arbitrary set A, what is А ® A? What is © @ A? 


81. 


82. 


b. Prove that 4 © B = B Ф A for arbitrary sets A and B. 
c. For any sets А, В, and C, prove that (А © B) © C = A @© (B © C). 


Verify the basic set identities on page 234 by showing set inclusion in each direction. (We have already 
done 3a and 4a.) 


A and В are subsets of a set S. Prove the following set identities by showing set inclusion in each direction. 
а. (АОВ) SA'N B’ 


| De Morgan’s laws 
b. (AB) =A'UB' 


83. 


84. 


85. 


86. 


87. 


88. 


89. 


90. 
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c. АО(ВПА) = А 

а. (AN B) UB=A'UB 

е. (AN B)JU(ANB')=A 

f£ [AN(BUC)’ =A UBNC) 

A, B, and C are subsets of a set S. Prove the following set identities using the basic set identities listed in 
this section. Give a reason for each step. State the dual of each of these identities. 

а. (AUB)N(AUB')=A4 

b. (AN CINBJUL[ANCINB])UANC)=S 

с. (AUC)N[ANBUC'NB)]=ANB 

A is a subset of a set S. Prove the following set identities: 

a АЈА = A а. 405 = 5 

bANA=A e.(4)! =A 

c AND = D 

A, B, and C are subsets of a set S. Prove the following set identities by using previously proved identities, 
including those in Exercises 82—84. Give a reason for each step. 

а. АП(ВОЈА) = ВПА 

b. (4UB)—-C-(A-C)U(B- C) 

с. (4—B)—-C-(A-C)-B 

A, B, and C are subsets of a set S. Prove the following set identities by using previously proved identities, 
including those in Exercises 82—84. Give a reason for each step. 

а. [W U B)14']' =A 

b (4- В - С= (4А - С) – (В – С) 

с. 4 (А — В) = АП В 

а. (AUB) — (4N B) = (4 – B)U (B — А) 

The operation of set union can be defined as ап n-ary operation for any integer n > 2. 


a. Give a definition similar to that for the union of two sets for 4, ОА, U ++- U A, 
b. Give a recursive definition for 4, U A, U ++- U Ap 


Using the recursive definition of set union from Exercise 87(b), prove the generalized associative property 
of set union, which is that for any n with n = 3 and any p with 1 - p - n — 1, 


(А, UA, U s "U Ap) U (Ap U A,,5U ns -UA,) =A,UA,U er "UA, 


The operation of set intersection can be defined as an n-ary operation for any integer n = 2. 
a. Give a definition similar to that for the intersection of two sets for А, (1 45 П: - - NA, 
b. Give a recursive definition for 4; (1 45 ON ++: N A, 


Using the recursive definition of set intersection from Exercise 89(b), prove the generalized associative 
property of set intersection, which is that for any n with n = 3 and any p with 1 =р —- n — І, 


(A4 MALIN? CO AC GA aa CASS TTA) = А ПА, П: 0A 


п 
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91. Prove that for subsets А, 45, ... , 4, and B ofa set S, the following generalized distributive properties hold, 
where n > 2. (See Exercises 87 and 89.) 
a. ВО(А ПА П: 04,) = (BU AJOY(BU A59) 0 ---(Y(BU A) 
b. ВГ\(А, СО A,U---U А) = (BNA) U (BNA) U --- U(BTYA,) 

92. Prove that for subsets А, А, ... , A, of a set S, the following generalized De Morgan's laws hold, where 
n > 2. (See Exercises 82, 87, and 89.) 
a. (AAU A;U---UA)' = АППА П: OA, 
b. (4,0450 04,9 = AUASU- UA 

93. The operations of set union and set intersection can be extended to apply to an infinite family of sets. We 


may describe the family as the collection of all sets А,, where i takes on any of the values of a fixed set Z. 
Here, Z is called the index set for the family. The union of the family, U ic, Ah is defined by 


2,4 = {х |х is a member of some А;} 


The intersection of ће family, NQ ur 18 defined by 


Г\ 4: = {х |х is a member of each Aj]. 
a. Let = (1,2, 3, ...}, and for each i є J, let A; be the set of real numbers in the interval (— 1/7, 1/i). 
What is UJ, 4? What is (1,4? 
b. Let = 11, 2, 3 ...$, and for each i € I, let A; be the set of real numbers in the interval [—1/i, l/i ]. 
What is | Ј _ 4,? What is ( ^], 4? 
iei іЄІ 


94. According to our use of the word “set,” if C is a subset of the universal set S, then every element of S either 
does or does not belong to C. In other words, the probability of a member x of S being a member of C is 
either 1 (x is a member of C) or 0 (x is not a member of C). C is a fuzzy set if every x € S has a probability 
р, 0 = р = 1, of being a member of C. The probability p associated with x is an estimate of the likelihood 
that x may belong to C when the actual composition of C is unknown. Set operations can be done on fuzzy 
sets as follows: If element x has probability p, of membership in C and probability p, of membership in D, 
then the probability of x being a member of CUD, C N D, and С” is, respectively, max(p,, p2), min(pi, р»), 
and | — p,. (If we consider the statements x є C and x є D as propositional wffs A and B, respectively, with 
certain probabilistic truth values, then the probability of x € CU D is the probability that А V B is true. The 
rules for fuzzy set operations then parallel the rules for fuzzy logic, discussed in Exercise 54, Section 1.1.) 

Let Sbe a set of possible disease-causing agents, S = {genetics, virus, nutrition, bacteria, environment}. 
The fuzzy sets AIDS and ALZHEIMERS are defined as AIDS = {genetics, 0.2; virus, 0.8; nutrition, 0.1; 
bacteria, 0.4; environment, 0.3} and ALZHEIMERS = {genetics, 0.7; virus, 0.4; nutrition, 0.3; bacteria, 
0.3; environment, 0.4}. 


a. Find the fuzzy set AIDS U ALZHEIMERS. 
b. Find the fuzzy set AIDS N ALZHEIMERS. 
c. Find the fuzzy set (AIDS)’. 


Exercises 95 and 96 complete the proof, begun in Section 2.2, that the second principle of induction, the first 
principle of induction, and the principle of well-ordering are all equivalent. 


95. The principle of well-ordering says that every nonempty set of positive integers has a smallest member. 
Prove that the first principle of mathematical induction, that is, 


1. P(1) is true 


2. (УЮ[Р(Ю) true > P(k + 1) true] | — P(n) true for all positive integers n 


96. 


97. 
98. 
99. 
100. 
101. 
102. 


103. 


104. 


105. 
106. 
107. 
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implies the principle of well-ordering. (Hint: Assume that the first principle of mathematical induction 
is valid, and use proof by contradiction to show that the principle of well-ordering is valid. Let T be a 
nonempty subset of the positive integers that has no smallest member. Let P(n) be the property that every 
member of 7'is greater than л.) 


Prove that the principle of well-ordering (see Exercise 95) implies the second principle of mathematical 
induction. Hint: Assume that the principle of well-ordering is valid, and let P be a property for which 


1.' P(1) is true 
2.' (VK)[P(r) true for all r, 1 = r  k — Р(К + 1) true] 


Let T be the subset of the positive integers defined by 
T = {t| P(t) is not true} 


Show that T is the empty set. 

Prove that the set of odd positive integers is denumerable. 

Prove that the set Z of all integers is denumerable. 

Prove that the set of all finite-length strings of the letter a is denumerable. 
Prove that the set of all finite-length binary strings is denumerable. 

Prove that the set Z X Z is denumerable. 


In Example 23, the claim was made that 0.249999999 ... is the same number as 0.250000000 ... . The first 
representation is a nonterminating decimal, and a calculus-type argument can be made that “in the limit" 
these are the same values. Here is a slightly different argument: 


a. Let n = 0.249999 .... 
Compute 100л by multiplying both sides of this equation by 100. 
Subtract n from 1007 to give a value for 99и. 
Solve the resulting equation for n. 

b. Let m — 0.250000 .... 
Compute 100m by multiplying both sides of this equation by 100. 
Subtract m from 100m to give a value for 99m. 
Solve the resulting equation for m. 

c. Compare the values of n and m. 


Use Cantor's diagonalization method to show that the set of all infinite sequences of positive integers is 
not countable. 


Use Cantor’s diagonalization method to show that the set of all infinite strings of the letters (a, b} is not 
countable. 


Explain why the union of any two denumerable sets is denumerable. 
Explain why any subset of a countable set is countable. 
Sets can have sets as elements (see Exercise 13, for example). Let B be the set defined as follows: 


В = (S|Sisasetand S ¢ Sj 


Argue that both B є B and B є B are true. This contradiction is called Russell 5 paradox, after the famous 
philosopher and mathematician Bertrand Russell, who stated it in 1901. (A carefully constructed axiomat- 
ization of set theory puts some restrictions on what can be called a set. All ordinary sets are still sets, but 
peculiar sets that get us into trouble, like B in this exercise, seem to be avoided.) 
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SECTION 4.2 | COUNTING 


Combinatorics is the branch of mathematics that deals with counting. Count- 
ing questions are important whenever we have finite resources (How much stor- 
age does a particular database consume? How many users can a given computer 
configuration support?) or whenever we are interested in efficiency (How many 
computations does a particular algorithm involve?). 

Counting questions often boils down to finding how many members there are 
in some finite set, that 15, what is the cardinality of the set. This seemingly trivial 
question can be difficult to answer. We have already answered some “how many” 
questions—How many rows are there in a truth table with n statement letters, and 
how many subsets are there in a set with л elements? (Actually, as we've noted, 
these questions can be thought of as the same question.) 


Multiplication Principle 


We solved the truth table question by drawing a tree of possibilities. This tree sug- 
gests a general principle that can be used to solve many counting problems. Before 
we state the general principle, we look at another tree example. 


A child is allowed to choose one jellybean out of two jellybeans, one red and one 
black, and one gummy bear out of three gummy bears, yellow, green, and white. 
How many different sets of candy can the child have? 

We can solve this problem by breaking the task of choosing candy into two 
sequential tasks of choosing the jellybean and then choosing the gummy bear. 
The tree of Figure 4.3 shows that there are 2 X 3 = 6 possible outcomes: {R, Y}, 
{R, G}, {R, №), (B, Y), (B, С}, and (B, №). 


Choose jellybean. 


Choose gummy bear. 


{R, YY {R,G} (RW) (B, Y) {B,G} {B, W} 
Figure 4.3 
In this problem the sequence of events could be reversed; the child could 
choose the gummy bear first and the jellybean second, resulting in the tree of 
Figure 4.4, but the number of outcomes is the same (3 X 2 = 6). Thinking of a se- 
quence of successive events helps us solve the problem, but the sequencing is not 
a part of the problem since the set {R, Y} is the same as the set {Y, R}. 


Choose gummy bear. 


Choose jellybean. 


IY, К) ГОВ {G,R} {G,B} М, R} М, В} 
Figure 4.4 


Both these trees are “balanced” in the sense that the second level has a fixed num- 
ber of outcomes regardless of the outcome at the previous level. ө 
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Example 24 illustrates that the total number of outcomes for a sequence 
of events can be obtained by multiplying the number of outcomes for the first 
event by the number of outcomes for the second. This idea is summarized in the 
multiplication principle. 


Ф PRINCIPLE MULTIPLICATION PRINCIPLE 
If there are л, possible outcomes for a first event and n, possible outcomes for 
a second event, there are n, X п» possible outcomes for the sequence of the two 
events. 


The multiplication principle can be extended by induction to apply to a 
sequence of any finite number of events. (See Exercise 73 at the end of this sec- 
tion.) The multiplication principle is useful whenever we want to count the total 
number of possible outcomes for a task that can be broken down into a sequence 
of successive subtasks. 


REMINDER 


Use the multiplication 
principle when there is a 
sequence of events. 


The last part of your telephone number contains four digits. How many such 
four-digit numbers are there? 

We can construct four-digit numbers by performing a sequence of subtasks: 
choose the first digit, then the second, the third, and finally the fourth. The first 
digit can be any one of the 10 digits from 0 to 9, so there are 10 possible outcomes 
for the first subtask. Likewise, there are 10 different possibilities each for the sec- 
ond digit, the third, and the fourth. Using the multiplication principle, we multiply 
the number of outcomes for each subtask in the sequence. Therefore there are 
10 - 10 - 10 - 10 = 10,000 different numbers. ө 


If an element cannot be used again—that is, if repetitions are not allowed— 
the number of possible outcomes for successive events will be affected. 


| EXAMPLE 26 | Referring to Example 25, how many four-digit numbers are there if the same digit 


cannot be used twice? 

Again we have the sequence of subtasks of selecting the four digits, but no 
repetitions are allowed. There are 10 choices for the first digit, but only 9 choices 
for the second because we can’t use what we used for the first digit, and so on. 
There are 10-9 - 8-7 = 5040 different numbers. ө 


| EXAMPLE 27 | a. How many ways are there to choose three officers from a club of 


25 people? 

b. How many ways are there to choose three officers from a club of 25 people 

if someone can hold more than one office? 

In (a), there are three successive subtasks with no repetitions. The first subtask, 
choosing the first officer, has 25 possible outcomes. The second subtask has 24 out- 
comes, the third 23 outcomes. The total number of outcomes is 25 · 24 · 23 = 13,800. 
In (b), the same three subtasks are done in succession, but repetitions are allowed. 
The total number of outcomes is 25 - 25 - 25 = 15,625. e 
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ЭДШ ЯШ When you order pizza at your favorite pizza place, you have the choice of thin 
crust, regular crust, or deep-dish; small, medium, or large; and pepperoni, sausage, 
barbeque, extra cheese, or vegetable. How many different pizzas can be ordered? 

Again, there is a sequence of tasks: choose the crust, choose the size, and 
choose the topping. The total number of outcomes is 3 · 3 · 5 = 45. e 


PRACTICE 22 | Ifa man has four suits, eight shirts, and five ties, how many outfits can he put together? | 


| EXAMPLE 29 | For any finite set S, | S| denotes the number of elements in S. If 4 and В are finite 


sets, then 


|A X В| = |А [18| 


A X B consists of all ordered pairs with first component from А and second com- 
ponent from В. Forming such ordered pairs can be thought of as the sequence of 
tasks of choosing the first component, for which there are | А | outcomes, and then 
choosing the second component, for which there are | B | outcomes. The result fol- 
lows from the multiplication principle. ө 


Addition Principle 


Suppose we want to select a dessert from three pies and four cakes. In how many 
ways can this be done? There are two events, one with three outcomes (choosing 
a pie) and one with four outcomes (choosing a cake). However, we are not do- 
ing a sequence of two events here, since we are getting only one dessert, which 
must be chosen from the two disjoint sets of possibilities. The number of different 
outcomes is the total number of choices we have, 3 + 4 = 7. This illustrates the 
addition principle. 


Ф PRINCIPLE ADDITION PRINCIPLE 
If A and B are disjoint events with n, and n, possible outcomes, respectively, then 
the total number of possible outcomes for event “А or В” is n, + m. 


The addition principle can be extended by induction to the case of any finite 
number of disjoint events. (See Exercise 74 at the end of this section.) The addi- 
tion principle is useful whenever we want to count the total number of possible 
outcomes for a task that can be broken down into disjoint cases. 


A customer wants to purchase a vehicle from a dealer. The dealer has 23 autos and 

14 trucks in stock. How many selections does the customer have? 
The customer wants to choose a car or truck. These are disjoint events; choosing 
REMINDER an auto has 23 outcomes and choosing a truck has 14. By the addition principle, 
U ЧЕЗ choosing a vehicle has 23 + 14 = 37 outcomes. Notice the requirement that the 

se the addition principle ЕЯ P 

only when the events are outcomes for events А and В be disjoint sets. Thus, if a customer wanted to pur- 
disjoint—have no com- chase a vehicle from a dealer who had 23 autos, 14 trucks, and 17 red vehicles in 
mon outcomes. stock, we could not conclude that the customer had 23 + 14 + 17 choices! ө 
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Let A and B be disjoint finite sets. Then| 4 U B| = |A| * | B| 

Finding | А U B | сап be done by the disjoint cases of counting the number of 
elements in 4, | A |, and the number of elements in B, | B |. By the addition principle, 
we sum these two numbers. e 


| EXAMPLE? | IfA and B are finite sets, then 


|4 — BI И ЕЕ (СВ 
апа 
|4 —B|=|A|—|BlifBCA 
To prove the first equality, note that 


(A — B)U(ANB) = (ANB) U(ANB) 


= АГ\(В' UB) 
= А05 
= А 
so that А = (4 — B) U (4 N B). Also, А — В and A N В are disjoint sets; therefore, 
by Example 31, 
|A|=|(4 - B)U(ANB)|=|4 - BI+|ANB| 
or 


|4 - B|-|4|- |4 B| 


The second equation follows from the first, because if B C Athen A(1B = В. ө 


Using the Principles Together 


Frequently the addition principle is used in conjunction with the multiplication 
principle. 


Referring to Example 24, suppose we want to find how many different ways the 
child can choose the candy, rather than the number of sets of candy the child can 
have. Then choosing a red jellybean followed by a yellow gummy bear is not the 
same as choosing a yellow gummy bear followed by a red jellybean. We can con- 
sider two disjoint cases—choosing jellybeans first or choosing gummy bears first. 
Each of these cases (by the multiplication principle) has six outcomes, so (by the 
addition principle) there are 6 + 6 = 12 possible ways to choose the candy. ө 
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How many four-digit numbers begin with a 4 or a 5? 

We can consider the two disjoint cases—numbers that begin with 4 and num- 
bers that begin with 5. Counting the numbers that begin with 4, there is 1 out- 
come for the subtask of choosing the first digit, then 10 possible outcomes for the 
subtasks of choosing each of the other three digits. Hence, by the multiplication 
principle there are 1 · 10 - 10 · 10 = 1000 ways to get a four-digit number beginning 
with 4. The same reasoning shows that there are 1000 ways to get a four-digit num- 
ber beginning with 5. By the addition principle, there are 1000 + 1000 = 2000 total 
possible outcomes. ө 


PRACTICE 23 | Ifa woman has 7 blouses, 5 skirts, апа 9 dresses, how many different outfits does she 
have? 


Often a counting problem can be solved in more than one way. Although the 
possibility of a second solution might seem confusing, it provides an excellent 
way to check our work; if two different ways of looking at the problem produce 
the same answer, it increases our confidence that we have analyzed the problem 
correctly. 


Consider the problem of Example 34 again. We can avoid using the addition princi- 
ple by thinking of the problem as four successive subtasks, where the first subtask, 
choosing the first digit, has two possible outcomes—choosing a 4 or choosing a 5. 
Then there are 2 · 10 - 10 · 10 = 2000 possible outcomes. e 


How many three-digit integers (numbers between 100 and 999 inclusive) are even? 

One solution notes that an even number ends in 0, 2, 4, 6, or 8. Taking these 
as separate cases, the number of three-digit integers ending in 0 can be found by 
choosing the three digits in turn. There are 9 choices, 1 through 9, for the first digit; 
10 choices, 0 through 9, for the second digit; and 1 choice for the third digit, 0. 
By the multiplication principle, there are 90 numbers ending in 0. Similarly, there 
are 90 numbers ending in 2, 4, 6, and 8, so by the addition principle, there are 
90 + 90 + 90 + 90 + 90 = 450 numbers. 

Another solution takes advantage of the fact that there are only 5 choices for 
the third digit. By the multiplication principle, there are 9 · 10 · 5 = 450 numbers. 

For this problem, there is a third solution of the “serendipity” type we dis- 
cussed in Section 2.1. There are 999 — 100 + 1 = 900 three-digit integers in the 
range specified. Half are even and half are odd, so 450 of them must be even. Ф 


Suppose the last four digits of a telephone number must include at least one re- 
peated digit. How many such numbers are there? 

Although it is possible to do this problem by using the addition principle direct- 
ly, it is difficult because there are so many disjoint cases to consider. For example, 
if the first two digits are alike but the third and fourth are different, there are 
10- 1-9-8 ways this can happen. If the first and third digit are alike but the second 


EXAMPLE 38 
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and fourth are different, there are 10 - 9 · 1 · 8 ways this can happen. If the first two 
digits are alike and the last two are also alike but different from the first two, there are 
10 - 1-9-1 such numbers. Obviously, there are many other possibilities. 

Instead, we solve the problem by noting that numbers with repetitions and 
numbers with no repetitions are disjoint sets whose union equals all four-digit 
numbers. By Example 31 we can find the number with repetitions by subtracting 
the number with no repetitions (5040, according to Example 26) from the total 
number (10,000, according to Example 25). Therefore, there are 4960 numbers 
with repetitions. ө 


A computer (or tablet, or camera, or cell phone) that wishes to connect to the In- 
ternet must have an IP (Internet Protocol) address assigned to it. This allows the 
device to be “found” over the Internet, much as a postal address allows a building 
to be “found” via regular mail. The version of IP known as IPv4 uses an address 
that is a 32-bit, or 4-byte, number (1 byte equals 8 bits). The first part of the ad- 
dress, called the netid, identifies the network that the machine is part of and the 
rest, called the hostid, identifies the machine itself. Note that this is a hierarchical 
addressing scheme. A router trying to decide where to send a data packet looks at 
the netid to determine the network. The hostid bytes need never be consulted until 
the data packet has reached the correct network. U.S. postal addresses are hierar- 
chical in the opposite order, with the most specific information first. 

How many different IPv4 addresses are there? Each of the 32 bits can be set to 
0 or 1, so by the multiplication principle, there are2 -2 -2+ -++ - 2 = 2? different 
bit patterns. Looking at a more abstract view, assume that a particular IP address 
uses 16 bits for the netid and 16 bits for the hostid. Again using the multiplication 
principle, this would give 2'° - 2'° = (again) 2? unique IP addresses. This number 
is roughly 4.3 billion, which seems large enough to satisfy the world's needs. But 
no — the pool of IPv4 addresses allotted to some regions of the world began to run 
out in 2011 and more would do so in another year or two. Hence the switch to IPv6. 

An IPv6 address is 128 bits, divided into 64 bits for the network prefix that 
identifies a particular network and the last 64 bits for the interface ID that identifies 
the unique node on the network. While the gross structure of an IPv6 address there- 
fore sounds just like an IPv4 address only bigger, there are details that make the 
IPv6 scheme more efficient. And exactly how big is the pool of IPv6 addresses? 
Using the same reasoning as before, there are 2'** unique addresses. This number 
is roughly 3.4 X 10, or 340 trillion trillion trillion, enough, it is said, for every 
star in the known universe to have the equivalent of its very own IPv4 internet. 

The World IPv6 Launch occurred on June 6, 2012, but it was not like turning 
on a switch. Many major companies already supported IPv6 and IPv4 will con- 
tinue to be supported over a few years of transition. e 


Decision Trees 


Trees such as those in Figures 4.3 and 4.4 illustrate the number of outcomes of an 
event based on a series of possible choices. Such trees are called decision trees. 
We will see in Chapter 6 how decision trees are used in analyzing algorithms, 
but for now we use them to solve additional counting problems. The trees of 
Figures 4.3 and 4.4 led to the multiplication principle because the number of 
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outcomes at any one level of the tree is the same throughout that level. In 
Figure 4.4, for example, level 2 of the tree shows two outcomes for each of the 
3 branches formed at level 1. Less regular decision trees can still be used to solve 
counting problems where the multiplication principle does not apply. 


H H 
T T 
H H 
T T 
H T 


Figure 4.5 


а= чч 
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PERDE: 


Tony is pitching pennies. Each toss results in heads (H) or tails (T). How many 
ways can he toss the coin 5 times without having 2 heads in a row? 


Toss 1 


Toss 2 


Toss 3 


Toss 4 


Toss 5 


ArAra 
maar 
ач ч 
mata 
a 
a 
=ч чч 
PERGE 


Figure 4.5 shows the decision tree for this problem. Each coin toss has 2 outcomes; 
the left branch is labeled H for heads, the right branch is labeled T for tails. When- 
ever an H appears on a branch, the next level can only contain a right (T) branch. 


There are 13 possible outcomes, shown at the bottom of the tree. 


PRACTICE 24 | Explain why the multiplication principle does not apply to Example 39. > 


PRACTICE 25 


not have а Z following а Y. 


SECTION 4.2 REVIEW 


TECHNIQUE 


QJ Use the multiplication principle, the addition prin- 
ciple, and decision trees for counting the number of 
objects in a finite set. 


MAIN IDEAS 


* The multiplication principle is used to count 
the number of possible outcomes for a sequence of 
events, each of which has a fixed number of outcomes. 


Draw the decision tree for the number of strings of X’s, Y's, апа Z’s with length 3 that do 


The addition principle is used to count the number 
of possible outcomes for disjoint events. 

The multiplication and addition principles are often 
used together. 

Decision trees can be used to count the number 
of outcomes for a sequence of events where the 
number of outcomes for a given event is not con- 
stant but depends on the outcome of the preced- 
ing event. 
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EXERCISES 4.2 


1. 


20. 


A frozen yogurt shop allows you to choose one flavor (vanilla, strawberry, lemon, cherry, or peach), 
one topping (chocolate shavings, crushed toffee, or crushed peanut brittle), and one condiment (whipped 
cream or shredded coconut). How many different desserts are possible? 


. In Exercise 1, how many dessert choices do you have if you are allergic to strawberries and chocolate? 
. A video game is begun by making selections from each of 3 menus. The first menu (number of players) 


has 4 selections, the second menu (level of play) has 8, and the third menu (speed) has 6. In how many 
configurations can the game be played? 


. A multiple choice exam has 20 questions each with 4 possible answers and 10 additional questions each 


with 5 possible answers. How many different answer sheets are possible? 


. A user's password to access a computer system consists of 3 letters followed by 2 digits. How many dif- 


ferent passwords are possible? 


. On the computer system of Exercise 5, how many passwords are possible if uppercase and lowercase let- 


ters can be distinguished? 


. A telephone conference call is being placed from Central City to Booneville by way of Cloverdale. There 


are 45 trunk lines from Central City to Cloverdale and 13 from Cloverdale to Booneville. How many dif- 
ferent ways can the call be placed? 


. A, B, C, and D are nodes on a computer network. There are 2 paths between A and C, 2 between B and 


D, 3 between A and B, and 4 between C and D. Along how many routes can a message from A to D 
be sent? 


. How many nine-digit Social Security numbers are possible? 
. An apartment building purchases a new lock system for its 175 units. A lock is opened by punching in a 


two-digit code. Has the apartment management made a wise purchase? 


. A palindrome is a string of characters that reads the same forward and backward. How many five-letter 


English language palindromes are possible? 


. How many three-digit numbers less than 600 can be made using the digits 8, 6, 4, and 2? 
. A binary logical connective can be defined by giving its truth table. How many different binary logical 


connectives are there? 


. Three seats on the county council are to be filled, each with someone from a different party. There are 4 


candidates running from the Concerned Environmentalist party, 3 from the Limited Development party, 
and 2 from the Friends of the Spotted Newt party. In how many ways can the seats be filled? 


. In the original BASIC programming language, an identifier must be either a single letter or a letter 


followed by a single digit. How many identifiers are possible? 


. A president and vice-president must be chosen for the executive committee of an organization. There are 


17 volunteers from the Eastern Division and 24 volunteers from the Western Division. If both officers 
must come from the same division, in how many ways can the officers be selected? 


. A dinner special allows you to select from 5 appetizers, 3 salads, 4 entrees, and 3 beverages. How many 


different dinners are there? 


. In Exercise 17, how many different dinners are there if you may have an appetizer or a salad but not both? 
. A new car can be ordered with a choice of 10 exterior colors; 7 interior colors; automatic or three-speed 


or five-speed transmission; with or without air conditioning; with or without power steering; and with or 
without the option package that contains the power door lock and the rear-window defroster. How many 
different cars can be ordered? 


In Exercise 19, how many different cars can be ordered if the option package comes only on a car with an 
automatic transmission? 


21 


22; 


23. 


24. 


25. 


26. 
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. In one state, automobile license plates must have two digits (no leading zeros) followed by one letter 


followed by a string of two to four digits (leading zeros are allowed). How many different plates are 
possible? 


A Hawaiian favorite fast food is the “loco moco,” invented at a Hilo restaurant. It consists of a bed of rice 
under a meat patty with egg on top, the whole thing smothered in brown gravy. The rice can be white or 
brown, the egg can be fried, scrambled, or poached, and the meat can be hamburger, Spam, Portuguese sau- 
sage, bacon, turkey, hot dog, salmon, or mahi. How many different loco mocos can be ordered? 


A customer at a fast-food restaurant can order a hamburger with or without mustard, ketchup, pickle, 
or onion; a fish sandwich with or without lettuce, tomato, or tartar sauce; and a choice of 3 kinds of soft 
drinks or 2 kinds of milk shakes. How many different orders are possible if a customer can order at most 
1 hamburger, | fish sandwich, and 1 beverage but can order less? 


How many unique ways are there to stack two 2 X 4 Lego bricks of the same color? Two stacks that look 
the same if you merely rotate them are considered to be the same arrangement. Here is one such stack, the 
original on top and the same stack after rotating 180°. 


еее ө 
e € e e e ө e 
e ө e © 


өөө ө 
өө өөө ө ө 
өөө ө 


What is the value of Count after the following pseudocode has been executed? 


Count = 0 
for i = 1 to 5 do 
for Letter = ‘A’ to ‘C’ do 
Count = Count + 1 
end for 
end for 


What is the value of Result after the following pseudocode has been executed? 


Result = 0 
for Index = 20 down to 10 do 
for Inner = 5 to 10 do 
Result = Result + 2 
end for 
end for 


Exercises 27—32 concern the set of three-digit integers (numbers between 100 and 999 inclusive). 


21; 
28. 
29. 
30. 
31. 
32. 


How many are divisible by 5? 

How many are divisible by 4? 

How many are not divisible by 5? 

How many are divisible by 4 or 5? 

How many are divisible by 4 and 5? 

How many are divisible by neither 4 nor 5? 
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Exercises 33—42 concern the set of binary strings of length 8 (each character is either the digit 0 or the digit 1). 


33. 
34. 
35. 
36. 


24. 
38. 


39. 
40. 
41. 
42. 


How many such strings are there? 

How many begin and end with 0? 

How many begin or end with 0? 

How many have | as the second digit? 

How many begin with 111? 

How many contain exactly one 0? 

How many begin with 10 or have a 0 as the third digit? 
How many are palindromes? (See Exercise 11.) 

How many contain exactly seven 15? 


How many contain two or more 05? 


In Exercises 43—48, 2 dice are rolled, 1 black and 1 white. 


43. 


44. 
45. 
46. 
4T. 
48. 


How many different rolls are possible? (Note that a 4-black, 1-white result and a 1-black, 4-white result 
are two different outcomes.) 


How many rolls result in “snake eyes" (both dice showing 1)? 

How many rolls result in doubles (both dice showing the same value)? 

How many rolls result in a total of 7 or 11? 

How many rolls occur in which neither die shows the value 4? 

How many rolls occur in which the value on the white die is greater than the value on the black die? 


In Exercises 49—54, a customer is ordering a new desktop computer system. The choices are 21-inch, 23-inch, 
or 24-inch monitor (optional); 1 TB or 2 TB hard drive; 6 GB or 8 GB of RAM; 16X DVD or Blu-ray disk 
optical drive; Intel, AMD, or NVIDIA video card; inkjet, laser, or laser color printer; 1-, 2-, or 3-year warranty. 


49 
50. 
51: 
52. 
53. 


54. 


How many different machine configurations are possible? 

How many different machines can be ordered with a 2 TB hard drive? 

How many different machines can be ordered with a 21-inch monitor and an inkjet printer? 
How many different machines can be ordered if the customer decides not to get a new monitor? 


How many different machines can be ordered with a 21-inch monitor, a 1 TB hard drive, and an inkjet 
printer? 


How many different machines can be ordered if the customer does not want a 3-year warranty? 


Exercises 55—58 refer to the Konami code, a sequence of 10 keystrokes using 6 different characters: 
tt) 1 — < — B A. This code was inadvertently left by the developer in the first release of a video game; 
when entered on the video game console as the title screen is open, it surreptitiously adds assets to the player’s 
on-screen avatar. This code or versions of it have been retained in many video games. 


25. 
56. 
97. 


58. 


How many 10-character codes сап be created using these 6 characters? 
How many 10-character codes can be created if the BA sequence must be the last 2 characters? 


How many 10-character codes can be created if only 2 down-arrow characters can be used and they must 
be paired together wherever they appear? 


How many 10-character codes can be created if only the arrow characters are used? 
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In Exercises 59—68, a hand consists of 1 card drawn from a standard 52-card deck with flowers on the back and 
1 card drawn from a standard 52-card deck with birds on the back. A standard deck has 13 cards from each of 
4 suits (clubs, diamonds, hearts, spades). The 13 cards have face value 2 through 10, jack, queen, king, or ace. 
Each face value is a “kind” of card. The jack, queen, and king are “face cards.” 


59 


60. 
61. 
62. 
63. 
64. 
65. 
66. 
67. 
68. 
69. 


70. 


ТЕ 


72. 


73. 


74. 
Тә. 


'This 


. How many different hands are possible? (Note that a flower-ace-of-spades, bird-queen-of-hearts and a 
flower-queen-of-hearts, bird-ace-of-spades are two different outcomes.) 


How many hands consist of a pair of aces? 

How many hands contain all face cards? 

How many hands contain exactly | king? 

How many hands consist of two of a kind (2 aces, 2 jacks, and so on)? 

How many hands have a face value of 5 (aces count as 1, face cards count as 10)? 

How many hands have a face value of less than 5 (aces count as 1, face cards count as 10)? 
How many hands do not contain any face cards? 

How many hands contain at least 1 face card? 

How many hands contain at least 1 king? 


Draw a decision tree to find the number of binary strings of length 4 that do not have consecutive Os. 
(Compare your answer with the one for Exercise 41 of Section 3.2.) 


Draw a decision tree (use teams A and B) to find the number of ways the NBA playoffs can happen, where 
the winner is the first team to win 4 out of 7. 


Voting on a certain issue is conducted by having everyone put a red, blue, or green slip of paper into a hat. 
Then the slips are pulled out one at a time. The first color to receive two votes wins. Draw a decision tree 
to find the number of ways in which the balloting can occur. 


In Example 39, prove the following facts, where C(n) = the total number of nodes in the decision tree at 
level n, H(n) = the total number of nodes at level n resulting from an Н toss, T(n) = the total number of 
nodes at level n resulting from a T toss.! 


. C(n) = H(n) + T(n) 
. H(n) = T(n — 1) 
. T(n) = H(n — 1) + T(n — 1) 
. H(n) = H(n — 2) * T(n — 2) 
. C(n) = C(n — 2) + C(n — D) юги = 3 
C(n) = F(n + 1) where F(n) is the nth Fibonacci number 


Use mathematical induction to extend the multiplication principle to a sequence of m events for any inte- 
ger m, m = 2. 


o c в 


a 


rh oO 


Use mathematical induction to extend the addition principle to m disjoint events for any integer m, m = 2. 
Consider the product of n factors, x, · x5: · · х„. Such an expression can be fully parenthesized to indicate 
the order of multiplication in a number of ways. For example, ifn = 4, there are five ways to parenthesize: 

хут (Xp 7 (Ху * x4)) 

хү (Оо 7 x3) ` x4) 

(xı ` x2) + (6 * x4) 

(х * (о * x3)) ` Xa 


(Œ * x2) * x3) * xa 


problem was suggested by Mr. Tracy Castile, a former University of Hawaii at Hilo student. 
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a. Write a recurrence relation for P(n), the number of ways to parenthesize a product of n factors, n = 1. 
Assume that P(1) = 1. (Hint: Note that for n > 2, the last multiplication to be performed can occur in 
any ofn — 1 positions.) 

b. Prove that 


P(n) = C(n — 1) 


where C(0), C(1), ... 1s the sequence of Catalan numbers (see Exercise 38 of Section 3.1). 

76. A simple closed convex polygon consists of n points in the plane joined in pairs by n line segments; each 
point is the endpoint of exactly 2 line segments, and any line connecting 2 nonadjacent points lies wholly 
within the polygon. 

a. Show that an (n + 2)-sided simple closed convex polygon can be triangulated (divided into triangular 
regions) using n — 1 lines. (The figures show two different triangulations of a 6-sided polygon, 


where n — 4). 


(Hint: Use a pair of straight lines to shave off 2 corners; consider the cases where n is even and the cases 
where л is odd.) 

b. Write a recurrence relation for T(n), the number of different triangulations of an (n + 2)-sided polygon. 
Assume that 7(0) — 1. (Hint: Fix one edge of the polygon as the base of a triangle whose tip rotates 
around the polygon, as shown. Use the sides ofthe triangle to divide the polygon into 2 polygonal sec- 
tions with (k + 1) sides and (n — К + 2) sides.) 


Trivial Trivial 


2-sided k=1 k=2 k=3 k=4 2-sided 
polygon polygon 


c. Prove that T(n) = C(n), where C(0), C(1), ... is the sequence of Catalan numbers (see Exercise 38 of 
Section 3.1). 


SECTION 4.3 | PRINCIPLE OF INCLUSION AND EXCLUSION; 
PIGEONHOLE PRINCIPLE 


In this section we discuss two more counting principles that can be used to solve 
combinatorics problems. 
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Principle of Inclusion and Exclusion 


To develop the principle of inclusion and exclusion, we first note that if A and B 
are any subsets of a universal set S, then 4 — B, B — A, and А N B are mutually 
disjoint sets (see Figure 4.6). For example, if x є A — B, then x € B, therefore 
x€B-—Aandx€ANB. 


S 


A-B АПВ В-А 


Figure 4.6 


Also, something can be said about the union of these three sets. 


PRACTICE 26 | What is another name for the set (4 — В) U (B — A) U (A N В)? р 


From Example 31 (extended to three disjoint finite sets), 


|(А— В) О(В – А) О(АПВ) | = |4 — В| +|B-A|+|ANB| (1) 
From Example 32, 
|4 — В| = |41 - АП | 
апа 
|B - A|=|B|-|4nB| 
Using these expressions in Equation (1), along with the result of Practice 26, we get 
|AUB|=|A|-|ANB|+|B)-|ANB|+|ANB| 
or 
|AUB|=|4|+|B|-|ANB| Q) 
Equation (2) is the two-set version of the principle of inclusion and exclusion. 
The name derives from the fact that when counting the number of elements in 
the union of А and В, we must “include” (count) the number of elements in А and 


the number of elements in B, but we must “exclude” (subtract) those elements in 
A f B to avoid counting them twice. 
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PRACTICE 27 | How does Equation (2) relate to Example 31 of Section 4.2? 


СҮТТЕ A pollster queries 35 voters, all of whom support referendum 1, referendum 2, or 
both, and finds that 14 voters support referendum 1 and 26 support referendum 2. 
How many voters support both? 
If we let А be the set of voters supporting referendum 1 and В be the set of 
voters supporting referendum 2, then we know that 


|АОВ|=35_ |A|=14 |В|=26 


From Equation (2), 
|AUB|=|A|+|B|-|ANB| 
35 = 14 + 26—-|ANMB| 
АП В| = 14 + 26 – 35 = 5 
so 5 voters support both. e 


Equation (2) can easily be extended to three sets, as follows: 


|AU BUC|- |AU (BU C)| 
=|A|+|BUC|-|AN(BUC)| 
-|4|*1B|*|C]-|Bn. C] -J4n U 4 O) 
=|A|+|B|+|C|—|BNC|—-(4ANB|+|ANC|-|ANBNC) 
=|A|+|B|+|C|-|ANB|-|ANC|—-|BNC|+|ANBNC| 


Therefore the three-set version of the principle of inclusion and exclusion is 


|AUBUC|=|4]+|B]+|C|-|AMB|-|ANC|-|BNC|+|ANBNC| 6) 


PRACTICE 28 | Justify each of the equalities used in deriving Equation (3). 


In addition to the formal derivation of Equation (3) that we just did, a sort of 
geometric argument for | А U B U C |is suggested by Figure 4.7 on the next page. 
When we add | A| + |В|-+ |C], we are counting each of |AMB|,|AM C|, and| B 
П C|twice, so we must throw each of them away once. When we add | 4 | + | | 
+ [С |, we are counting | A N B N C| three times, but in subtracting | А N B |, 
|A N C|, |B N C| we have thrown it away three times, so we must add it back once. 
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Figure 4.7 


A group of students plans to order pizza. If 13 will eat sausage topping, 10 will eat 
pepperoni, 12 will eat extra cheese, 4 will eat both sausage and pepperoni, 5 will 
eat both pepperoni and extra cheese, 7 will eat both sausage and extra cheese, and 
3 will eat all three toppings, how many students are in the group? 
Let 
A = {students who will eat sausage} 


B = {students who will eat pepperoni} 
C = {students who will eat extra cheese} 


Then|A| = 13,|B| = 10,|C| = 12,,/AN B|=4,|BNC|=5,|AN C| = 7, and 
A 1 BM C| = 3. From Equation (3), 


AUBUC|=134 104 12—=4—=5-7 +3 =) 


We can also solve this problem by filling in all the pieces in a Venn diagram. 
Working from the middle outward, we know there аге 3 people in| AM BM C| 
(Figure 4.8a). We also know the number in each оғ |А (1 B|,| B(1C , and|AN C| 
so with a little subtraction we can fill in more pieces (Figure 4.8b). We also know 
the size of A, B, and C, allowing us to complete the picture (Figure 4.8c). Now the 
total number of students, 22, is obtained by adding up all the numbers. 


A B 


@ 


(а) (b) (с) 
Figure 4.8 
Although we are about to generalize Equation (3) to an arbitrary number of sets, 
the Venn diagram approach gets too complicated to draw with more than three 
sets. ө 


© PRINCIPLE 
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A produce stand sells only broccoli, carrots, and okra. One day the stand served 
207 people. If 114 people purchased broccoli, 152 purchased carrots, 25 purchased 
okra, 64 purchased broccoli and carrots, 12 purchased carrots and okra, and 9 pur- 
chased all three, how many people purchased broccoli and okra? 

Let 


A = {people who purchased broccoli} 
B = {people who purchased carrots} 
C = {people who purchased okra} 


Then|A UBUC]= 207,|4|= 114,|B| = 152,|C|= 25,|4 MB] = 64,|BN C|= 12, 
and| А BM C|= 9. From Equation (3), 


|AUBUC|=|4]+|B]+|C|-|ANB|-|ANC|-|BNC|+|ANBNC| 
207 = ПА 152 + 25 = 64 =|ANC|= 12 +9 
|AMC|= 114 + 152 + 25 = 64 – 12 + 9 – 207 = 17 e 


In Equation (2), we add the number of elements in the single sets and subtract 
the number of elements in the intersection of two sets. In Equation (3), we add the 
number of elements in the single sets, subtract the number of elements in the in- 
tersection of two sets, and add the number of elements in the intersection of three 
sets. This seems to suggest a pattern: If we have n sets, we should add the number 
of elements in the single sets, subtract the number of elements in the intersection 
of two sets, add the number of elements in the intersection of three sets, subtract 
the number of elements in the intersection of four sets, and so on. This leads us to 
the general form of the principle of inclusion and exclusion: 


OF INCLUSION AND EXCLUSION 
Given the finite sets А), +++, 4, n = 2, then 


[d Hee Dass > = > |4; N Aj| 


l=isn 1<=1<]<п 


+ 2. А N A;N А, | 


]Ixi«cj«kzn 


= чеке le o (4) 


In Equation (4) the notation 


У [41 4] 


1=1<ј=лп 


for example, says to add together the number of elements in all the intersections of 
the form 4; 4; where i and j can take on any values between 1 and n as long as i < j. 
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For n = 3, this gives| 4, (145| G = 1,7 = 2),| А, 1 A3|@ = Lj = 3), and | 45 N А; | 
(i = 2,7 = 3). This agrees with Equation (3), where A, = A, A, = B, and А; = С. 

To prove the general form of the principle of inclusion and exclusion, we use 
mathematical induction. Although the idea of the proof is straightforward, the 
notation is rather messy. The base case, n = 2, is just Equation (2). We assume 
that Equation (4) is true for п = k and show that it is true for n = k + 1. We write 


|40: U Ags | 
=|(4;U UU AU. 
=|M U sss U Ay uar 
=|(A,U ++ UA) тА 1] (by Equation (2)) 
© 
= > | A;| — + |4, N 4| + >, |4; N 4N Anl 


1<isk 1<i<jsk 1<i<j<msk 


Ф 
т (АП DA Asal 
—|(А,П 4+0) U ra" U (ALN Ags 1) 


(by the inductive hypothesis and the distributive property) 


@ Ө) 
= > jal- È edt У [4,0 Aj Anl 


1Sisk+1 1Si<jsk 1si<j<msk 


© 
— (DEAD: 04; 


Q Ө) 
-( x |4, ПА i| = > |4; O AN Aga al t + 
1si=k 1si<jsk 
К @ 
T4 > (4; Y 4400 G5 44 00 ПА, As) 


1Si<j<--+<msk 
К — 1 terms 


F (== 14 NSN Aaa) 


(by combining terms @ from above and using the inductive hypothesis on the Ж sets 
A, ПА, + » Ay ПА, 4 ls ees „AN Ak D 


= > | A;| — >, |4; П A;| + > A4 NAA Anl 


1<=К+1 1si<j<k +1 1<i<j<m<k +1 
mb (D ans dude 
(by combining like-numbered terms from above) 
= У |4|- У |4, N Aj| + ` |4: N AN Anl 
1<=К+1 1si<j<k +1 1<i<j<m<k +1 


sese s qe (DEFAN NA Aral 


This completes the proof of Equation (4). A different proof of the principle of in- 
clusion and exclusion can be found in Exercise 23 of Section 4.5. 
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Pigeonhole Principle 


The pigeonhole principle acquired its quaint name from the following idea: If 
more than k pigeons fly into k pigeonholes, then at least 1 hole will end up with 
more than 1 pigeon. Although this seems immediately obvious, we can belabor the 
point. Suppose each of the k pigeonholes contains at most | pigeon. Then there are 
at most k pigeons, not the more-than-& pigeons that supposedly flew in. 

Now we'll state the pigeonhole principle in a less picturesque way. 


€ PRINCIPLE PIGEONHOLE PRINCIPLE 
If more ап & items are placed into k bins, then at least 1 bin contains more than 
1 item. 


By cleverly choosing items and bins, a number of interesting counting prob- 
lems can be solved (see Example 7 of Chapter 2). 


| EXAMPLE% | How many people must be in a room to guarantee that 2 people have last names 
h 


that begin with the same initial? 

There are 26 letters of the alphabet (bins). If there are 27 people, then there are 
27 initials (items) to put into the 26 bins, so at least 1 bin will contain more than 
1 last initial. ө 


PRACTICE 29 | How many times must a single die be rolled in order to guarantee getting the same value 


twice? 


| EXAMPLE44 | ИН Prove that if 51 positive integers between 1 and 100 are chosen, then one of them 


must divide another. 

Let the integers be 7, ..., п. Each integer n; = 2 can be written as a product 
of prime numbers (fundamental theorem of arithmetic), every prime number except 
2 is odd, and the product of odd numbers 15 odd. Therefore for each i, n; = 2b, 
where k; = 0 and b, is an odd number. Furthermore, 1 = b; = 99, and there are 50 
odd integers between 1 and 99 inclusive, but there are 51 5 values. By the pigeon- 
hole principle, b; = b; for some i and j, so n; = 2*b, and nc 25b, ТЁК, =k » then n; 
divides л; otherwise, n; divides п. e 


SECTION 4.3 REVIEW 


TECHNIQUES MAIN IDEA 

QJ Use the principle of inclusion and exclusion to find ° The principle of inclusion and exclusion and the 
the number of elements in the union of sets. piegeonhole principle are additional counting 

* Use the pigeonhole principle to find the minimum mechanisms for sets. 


number of elements to guarantee two with a 
duplicate property. 
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EXERCISES 4.3 


1. 


11. 


All the guests at a dinner party drink coffee or tea; 13 guests drink coffee, 10 drink tea, and 4 drink both 
coffee and tea. How many people are guests at the dinner party? 


. Ina group of 42 tourists, everyone speaks English or French; there are 35 English speakers and 18 French 


speakers. How many speak both English and French? 


. After serving 137 customers, a cafeteria notes at the end of the day that 56 orders of green beans were 


sold, 38 orders of beets were sold, and 17 customers purchased both green beans and beets. How many 
customers bought neither beans nor beets? 


. Abike show will be held for mountain bikes and road bikes. Of the 24 people who register for the show, 17 


will bring road bikes and 5 will bring both road bikes and mountain bikes. How many will bring mountain 
bikes? 


. Quality control in a factory pulls 40 parts with paint, packaging, or electronics defects from an assembly 


line. Of these, 28 had a paint defect, 17 had a packaging defect, 13 had an electronics defect, 6 had both 
paint and packaging defects, 7 had both packaging and electronics defects, and 10 had both paint and 
electronics defects. Did any part have all three types of defect? 


. Ina group of 24 people who like rock, country, and classical music, 14 like rock, 17 like classical, 11 like 


both rock and country, 9 like rock and classical, 13 like country and classical, and 8 like rock, country, and 
classical. How many like country? 


. Nineteen different mouthwash products make the following claims: 12 claim to freshen breath, 10 claim 


to prevent gingivitis, 11 claim to reduce plaque, 6 claim to both freshen breath and reduce plaque, 5 claim 
to both prevent gingivitis and freshen breath, and 5 claim to both prevent gingivitis and reduce plaque. 


a. How many products make all three claims? 
b. How many products claim to freshen breath but do not claim to reduce plaque? 


. From the 83 students who want to enroll in CS 320, 32 have completed CS 120, 27 have completed 


CS 180, and 35 have completed CS 215. Of these, 7 have completed both CS 120 and CS 180, 16 have 
completed CS 180 and CS 215, and 3 have completed CS 120 and CS 215. Two students have completed 
all three courses. The prerequisite for CS 320 is completion of one of CS 120, CS 180, or CS 215. How 
many students are not eligible to enroll? 


. Asurvey of 150 college students reveals that 83 own automobiles, 97 own bikes, 28 own motorcycles, 53 


own a car and a bike, 14 own a car and a motorcycle, 7 own a bike and a motorcycle, and 2 own all three. 
a. How many students own a bike and nothing else? 
b. How many students do not own any of the three? 


. Among a bank's 214 customers with checking or savings accounts, 189 have checking accounts, 73 have 


regular savings accounts, 114 have money market savings accounts, and 69 have both checking and regu- 
lar savings accounts. No customer is allowed to have both regular savings and money market savings 
accounts. 


a. How many customers have both checking and money market savings accounts? 
b. How many customers have a checking account but no savings account? 


At the beginning of this chapter you surveyed the 87 computer users who subscribe to your electronic 
newsletter in preparation for the release of your new software product. 


The results of your survey reveal that of the 87 subscribers, 68 have a Windows-based system available 
to them, 34 have a Linux system available, and 30 have access to a Mac. In addition, 19 have access 
to both Windows and Linux systems, 11 have access to both Linux systems and Macs, and 23 can use 
both Macs and Windows. 


26. 


27. 
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Use the principle of inclusion and exclusion to determine how many subscribers have access to all three 
types of systems. 


. You are developing a new bath soap, and you hire a public opinion survey group to do some market re- 


search for you. The group claims that in its survey of 450 consumers, the following criteria were named 
as important factors in purchasing bath soap: 


Odor 425 
Lathering ease 397 
Natural ingredients 340 
Odor and lathering ease 284 
Odor and natural ingredients 315 
Lathering ease and natural ingredients 219 
All three factors 147 


Should you have confidence in these results? Why or why not? 


‚а. How many integers n, 1 = n = 100, are multiples of either 2 or 5? 


b. How many integers п, 1 = п = 100, are not multiples of either 2 or 5? 


. How many integers и, 1 = n = 1000, are not multiples of either 3 or 7? 
. a. Write the expression for|4 U BU CU D | from Equation (4). 


b. Write an expression for the number of terms in the expansion of| А, U - - - U A, given by Equation (4). 


. Patrons of a local bookstore can sign up for advance notification of new book arrivals in genres of interest. 


In the first month of this service, 32 sign up for mysteries, 34 for spy novels, 18 for westerns, and 41 for 
science fiction. Of these, 17 sign for both mysteries and spy novels, 8 for both mysteries and westerns, 
19 for mysteries and science fiction, 5 for spy novels and westerns, 20 for spy novels and science fiction, 
and 12 for westerns and science fiction. In addition, 2 sign up for mysteries, spy novels and westerns, 11 
for mysteries, spy novels and science fiction, 6 for mysteries, westerns, and science fiction, and 5 for spy 
novels, westerns, and science fiction. Finally, 2 people sign up for all four categories. How many people 
signed up for service in the first month? 


. How many cards must be drawn from a standard 52-card deck to guarantee 2 cards of the same suit? 

. How many cards must be drawn from a standard 52-card deck to guarantee a black card? 

. If 12 cards are drawn from a standard deck, must at least 2 of them be of the same denomination (type)? 
. How many cards must be drawn from a standard 52-card deck to guarantee 2 queens? 

. A computerized dating service has a list of 50 men and 50 women. Names are selected at random; how 


many names must be chosen to guarantee one name of each gender? 


. А computerized housing service has a list of 50 men and 50 women. Names are selected at random; how 


many names must be chosen to guarantee two names of the same gender? 


. How many people must be in a group to guarantee that 2 people in the group have the same birthday (don't 


forget leap year)? 


. In a group of 25 people, must there be at least 3 who were born in the same month? 
. Prove that if four numbers are chosen from the set (1, 2, 3, 4, 5, 6}, at least one pair must add up to 7. 


(Hint: Find all the pairs of numbers from the set that add to 7.) 


How many numbers must be selected from the set (2, 4, 6, 8, 10, 12, 14, 16, 18, 20} to guarantee that at 
least one pair adds up to 22? (See the hint for Exercise 25). 

Let n be a positive number. Show that in any set of n + 1 numbers, there are at least two with the same 
remainder when divided by n. 
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SECTION 4.4 | PERMUTATIONS AND COMBINATIONS 


Permutations 


Example 26 in Section 4.2 discussed the problem of counting all possibilities for 
the last four digits of a telephone number with no repeated digits. In this problem, 
the number 1259 is not the same as the number 2951 because the order of the 
four digits is important. An ordered arrangement of objects is called a permuta- 
tion. Each of these numbers is a permutation of 4 distinct objects chosen from 
a set of 10 distinct objects (the digits). How many such permutations are there? 
The answer, found by using the multiplication principle, is 10 - 9 - 8 - 7—there are 
10 choices for the first digit, then 9 for the next digit because repetitions are not al- 
lowed, 8 for the next digit, and 7 for the fourth digit. The number of permutations 
ofr distinct objects chosen from n distinct objects is denoted by P(n, r). Therefore 
the solution to the problem of the four-digit number without repeated digits can be 
expressed as P(10, 4). 

A formula for Р(п, ғ) can be written using the factorial function. For a positive 
integer n, п factorial is defined as n(n — 1)(n — 2)--- 1 and denoted by n!; also, 0! 
is defined to have the value 1. From the definition of n!, we see that 


n! — n(n — 1)! 
and that for r < n, 
п! ии Лр (и А Р) 
(0 1 (n = p) 
= n(n— D: (n—r4 1) 


Using the factorial function, 


P(10,4) = 109-8. 7 
10:9:8-7-6:5-4:3-2-1 10! 10! 
4-352«] 6! (10 — 4)! 


In general, P(n, ғ) is given by the formula 


P(n, r) — forO =r=n 
(n — т)! 
| EXAMPLE | The value of P(7, 3) is 
! ! gaS k заро || 
| к es = 7:6:5 = 210 e 
(7—3)! 4! 4-3-2-1 


ЭТТ 8 Three somewhat special cases that can arise when computing P(n, r) are the two 
“boundary conditions” P(n, 0) and P(n, n), and also P(n, 1). According to the 
formula, 
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ез сс ш 


This formula сап be interpreted as saying that there is only one ordered arrange- 
ment of zero objects—the empty set. 


лс рс 


This formula reflects the fact that there are n ordered arrangements of 1 object. 
(Each arrangement consists of the 1 object, so this merely counts how many ways 
to get the 1 object.) 


n! n! 
P(n, n) — quem ПЁ n! 


This formula states that there are n! ordered arrangements of n distinct objects, 
which merely reflects the multiplication principle—7 choices for the first 
object, n — 1 choices for the second object, and so on, with 1 choice for the nth 
object. e 


| EXAMPLE% | The number of permutations of 3 objects, say a, b, and c, is given by 


P(3,3) = 3! = 3-2-1 = 6. The 6 permutations of a, b, and c are 


abc, acb, bac, bca, cab, cba e 


SOVA How many three-letter words (not necessarily meaningful) can be formed from the 
word “compiler” if no letters can be repeated? Here the arrangement of letters mat- 
ters, and we want to know the number of permutations of 3 distinct objects taken 
from 8 objects. The answer is P(8, 3) — 8!/5! — 336. ө 


Note that we could have solved Example 48 just by using the multiplication 
principle—there are 8 choices for the first letter, 7 for the second, and 6 for the 
third, so the answer is 8 - 7 - 6 = 336. P(n, r) simply gives us a new way to think 
about the problem, as well as a compact notation. 


ҮТЕ Ten athletes compete in an Olympic event. Gold, silver, and bronze medals are 
awarded; in how many ways can the awards be made? 

This problem is essentially the same as the one in Example 48. Order mat- 
ters; given 3 winners A, B, and C, the arrangement A — gold, B— silver, C— bronze 
is different than the arrangement С —gold, A—silver, B —bronze. So we want the 
number of ordered arrangements of 3 objects from a pool of 10, or P(10, 3). Using 
the formula for P(n, ғ), Р(10, 3) = 10/7! = 10-9- 8 = 720. ө 
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PRACTICE 30 | In how many ways can a president and vice-president be selected from a group of 
20 people? i 
PRACTICE 31 | In how many ways can 6 people be seated in a row of 6 chairs? = 


Counting problems can have other counting problems as subtasks. 


A library has 4 books on operating systems, 7 on programming, and 3 on data 
structures. Let’s see how many ways these books can be arranged on a shelf, given 
that all books on the same subject must be together. 

We can think of this problem as a sequence of subtasks. First we consider 
the subtask of arranging the 3 subjects. There are 3! outcomes to this subtask, 
that is, 3! different orderings of subject matter. The next subtasks are arranging 
the books on operating systems (4! outcomes), then arranging the books on 
programming (7! outcomes), and finally arranging the books on data structures 
(3! outcomes). Thus, by the multiplication principle, the final number of arrange- 
ments of all the books 15 (3!)(4!)(7!)(3!) = 4,354,560. ө 


Combinations 


Sometimes we want to select r objects from a set of n objects, but we don't care 
how they are arranged. Then we are counting the number of combinations of r 
distinct objects chosen from n distinct objects, denoted by C(n, r). For each such 
combination, there are r! ways to permute the r chosen objects. By the multipli- 
cation principle, the number of permutations of r distinct objects chosen from 
n objects is the product of the number of ways to choose the objects, C(n, r), 
multiplied by the number of ways to arrange the objects chosen, r! Thus, 


C(n, r): r! = P(n,r) 
or 


P(nr) | n! 


r! r!(n =r)! 


C(n, r) = 


foO0-zrzn 


Other notations for C(n, r) are 


n 
„С„ Cr, ( ) 
r 
| EXAMPLESI | The value of C(7, 3) is 


1 _ P65 32 

3107753) 34! 321432 
7:65 
“рр e 


From Example 45, the value of P(7, 3) is 210, and C(7, 3) - (3!) = 35(6) = 210 
IA, 3). e 
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| EXAMPLES2 | The special cases for C(n, r) are C(n, 0), C(n, 1), and C(n, n). The formula for C(n, 0), 


n! 
SOS TESTE 


reflects the fact that there is only one way to choose zero objects from л objects: 
Choose the empty set. 


n! 


Cn lye-— — — 
i aay 
Here the formula indicates that there are n ways to select 1 object from n objects. 
n! 
C(n, n) = ———— = 1 
Uu oec 


Here we see that there is only one way to select л objects from n objects, and that 
is to choose all of the objects. ө 


In the formula for C(n, r), suppose n is held fixed and r is increased. Then r! 
increases, which tends to make C(n, r) smaller, but (n — r)! decreases, which tends 
to make C(n, r) larger. For small values of r, the increase in r! is not as great as 
the decrease in (n — r)!, and so C(n, r) increases from 1 to л to larger values. At 
some point, however, the increase in r! overcomes the decrease in (n — r)!, and 
the values of C(n, r) decrease back down to | by the time r = n, as we calculated 
in Example 52. Figure 4.9a illustrates the rise and fall of the values of C(n, r) fora 
fixed n. For P(n, r), as nis held fixed and r is increased, n — r and therefore (n — r)! 
decreases, so P(n, r) increases. Values of P(n, r) for 0 = r <n thus increase from 
1 to n to n!, as we calculated in Example 46. See Figure 4.9b; note the difference 
in the vertical scale of Figures 4.9a and 4.9b. 


Figure 4.9 (a) (b) 


How many 5-card poker hands are possible with a 52-card deck? Here order does 
not matter because we simply want to know which cards end up in the hand. We 
want the number of ways to choose 5 objects from a pool of 52, which is a combi- 
nations problem. The answer is C(52, 5) = 52!/(5!47!) = 2,598,960. e 
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Unlike earlier problems, the answer to Example 53 cannot easily be obtained 
by applying the multiplication principle. Thus, С(и, r) gives us a way to solve new 
problems. 


| EXAMPLES | Ten athletes compete in an Olympic event; 3 will be declared winners. How many 


sets of winners are possible? 

Here, as opposed to Example 49, there is no order to the 3 winners, so we 
are simply choosing 3 objects out of 10. This is a combinations problem, not a 
permutations problem. The result is C(10, 3) = 10!/(3!7!) = 120. Notice that 
there are fewer ways to choose 3 winners (a combinations problem) than to 
award gold, silver, and bronze medals to 3 winners (a permutations problem— 
Example 49). e. 


PRACTICE 32 | How many committees of 3 are possible from a group of 12 people? 


REMINDER Remember that the distinction between permutations and combinations lies 
in whether the objects are to be merely selected or both selected and ordered. 
If ordering is important, the problem involves permutations; if ordering is not 
important, the problem involves combinations. For example, Practice 30 is a per- 


In a counting problem, 
first ask yourself if order 
matters. If it does, it's a 


permutations problem. If mutations problem—2 people are to be selected and ordered, the first as president, 
not, it’s a combinations the second as vice-president—whereas Practice 32 is a combinations problem— 
problem. 3 people are selected but not ordered. 


In solving counting problems, С(и, r) can be used in conjunction with the 
multiplication principle or the addition principle. 


ИШЕТ ГА А committee of 8 students is to be formed from a class consisting of 19 freshmen 


and 34 sophomores. 


a. How many committees of 3 freshmen and 5 sophomores are possible? 
b. How many committees with exactly 1 freshman are possible? 
c. How many committees with at most 1 freshman are possible? 
d. How many committees with at least 1 freshman are possible? 


Because the ordering of the individuals chosen is not important, these are com- 
binations problems. 

For part (a), we have a sequence of two subtasks, selecting freshmen and 
selecting sophomores. The multiplication principle should be used. (Thinking of a 
sequence of subtasks may seem to imply ordering, but it just sets up the levels of 
the decision tree, the basis for the multiplication principle. There is no ordering of 
the students.) Because there are C(19, 3) ways to choose the freshmen and C(34, 5) 
ways to choose the sophomores, the answer is 


19! 34! 
C(19, 3) - C(34, 5) = 


3116! 5029! (969)(278,256) 


REMINDER 


“At least” counting prob- 
lems are often best solved 
by subtraction. 
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For part (b), we again have a sequence of subtasks: selecting the single fresh- 
man and then selecting the rest of the committee from among the sophomores. 
There are C(19, 1) ways to select the single freshman and C(34, 7) ways to select 
the remaining 7 members from the sophomores. By the multiplication principle, 
the answer is 


19 — 34 
MO = D 2184 575! 


C(19, 1) - C(34, 7) = = 19(5,379,616) 


For part (c), we get at most 1 freshman by having exactly 1 freshman or by 
having 0 freshmen. Because these are disjoint events, we use the addition prin- 
ciple. The number of ways to select exactly 1 freshman is the answer to part (b). 
The number of ways to select 0 freshmen is the same as the number of ways to 
select the entire 8-member committee from among the 34 sophomores, C(34, 8). 
Thus the answer is 


C(19, 1) - C(34, 7) + C(34, 8) = some big number 


We can attack part (d) in several ways. One way 15 to use the addition princi- 
ple, thinking ofthe disjoint possibilities as exactly 1 freshman, exactly 2 freshmen, 
and so on, up to exactly 8 freshmen. We could compute each of these numbers and 
then add them. However, it is easier to do the problem by counting all the ways the 
committee of 8 can be selected from the total pool of 53 people and then eliminat- 
ing (subtracting) the number of committees with 0 freshmen (all sophomores). 
Thus the answer is 


C(53, 8) — C(34, 8) " 


The factorial function grows large quickly. 4 number like 100! cannot be 
computed on most calculators (or on most computers unless double— precision 
arithmetic is used), but expressions like 


100! 
25175! 


can nevertheless be computed by first canceling common factors. 


Eliminating Duplicates 


We mentioned earlier that counting problems can often be solved in different ways. 
Unfortunately, it is also easy to find so-called solutions that sound eminently rea- 
sonable but are incorrect. Usually they are wrong because they count something 
more than once (or sometimes they overlook counting something entirely). 


Consider again part (d) of Example 55, the number of committees with at least 1 
freshman. A bogus solution to this problem goes as follows: Think of a sequence 
of two subtasks, choosing a freshman and then choosing the rest of the committee. 
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There are C(19, 1) ways to choose | freshman. Once a freshman has been selected, 
that guarantees that at least 1 freshman will be on the committee, so we are free to 
choose the remaining 7 members of the committee from the remaining 52 people 
without any restrictions, giving us C(52, 7) choices. By the multiplication prin- 
ciple, this gives C(19, 1) - C(52, 7). However, this is a bigger number than the 
correct answer. 

The problem is this: Suppose Derek and Felicia are both freshmen. In one of 
the choices we have counted, Derek is the one guaranteed freshman, and we pick 
the rest of the committee in such a way that Felicia is on it along with 6 others. But 
we have also counted the option of making Felicia the guaranteed freshman and 
having Derek and the same 6 others be the rest of the committee. This is the same 
committee as before, and we have counted it twice. e 


PRACTICE 33 | A committee of 2 to be chosen from 4 math majors and 3 physics majors must include at 


least 1 math major. Compute the following 2 values. 


a. C(7, 2) — C(3, 2) (correct solution: all committees minus those with no math majors) 
b. C(4, 1) - C(6, 1) (bogus solution: choose 1 math major and then choose the rest of the committee) 


The expression C(4, 1) - C(6, 1) — C(4, 2) also gives the correct answer because C(4, 2) is the number 
of committees with 2 math majors, and these are the committees counted twice in C(4, 1) · C(6, 1). и 


| EXAMPLES | a. How many distinct permutations can be made from the characters in the 


word FLORIDA? 
b. How many distinct permutations can be made from the characters in the 
word MISSISSIPPI? 


Part (a) is a simple problem of the number of ordered arrangements of seven 
distinct objects, which is 7!. However, the answer to part (b) is not 11! because the 
11 characters in MISSISSIPPI are not all distinct. This means that 11! counts some 
of the same arrangements more than once (the same arrangement meaning that we 
cannot tell the difference between MIS,S,ISSIPPI and MIS,S,ISSIPPI.) 

Consider any one arrangement of the characters. The four S’s occupy certain 
positions in the string. Rearranging the S’s within those positions would result in 
no distinguishable change, so our one arrangement has 4! look-alikes. In order to 
avoid overcounting, we must divide 11! by 4! to take care of all the ways of mov- 
ing the S’s around. Similarly, we must divide by 4! to take care of the four Г and 
by 2! to take care of the two P’s. The number of distinct permutations is thus 


11! 
41412! ө 


In general, suppose there are n objects of which a set of n, are indistinguish- 
able from each other, another set of n, are indistinguishable from each other, and 
so on, down to n, objects that are indistinguishable from each other. The number 
of distinct permutations of the n objects is 


n! 


(99) : * (12) 
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PRACTICE 34 | How many distinct permutations are there of the characters in the word MONGOOSES? | 


EXAMPLE 58 


Permutations and Combinations with Repetitions 


Our formulas for P(n, r) and C(n, r) assume that we arrange or select r objects out 
of the n available using each object only once. Therefore r = n. Suppose, however, 
that the n objects are available for reuse as many times as desired. For example, we 
construct words using the 26 letters of the alphabet; the words may be as long as 
desired with letters used repeatedly. Or we may draw cards from a deck, replacing 
a card after each draw; we may draw as many cards as we like with cards used 
repeatedly. We can still talk about permutations or combinations of r objects out 
of n, but with repetitions allowed, r might be greater than n. 

Counting the number of permutations of r objects out of n distinct objects 
with repetition is easy. We have n choices for the first object and, because we can 
repeat that object, n choices for the second object, n choices for the third, and so 
on. Hence, the number of permutations of r objects out of n distinct objects with 
repetition allowed is л”. 

To determine the number of combinations of r objects out of n distinct objects 
with repetition allowed, we use a rather clever idea. 


A jeweler designing a pin has decided to use five stones chosen from a supply of 
diamonds, rubies, and emeralds. How many sets of stones are possible? 

Because we are not interested in any ordered arrangement of the stones, this is 
a combinations problem rather than a permutations problem. We want the number 
of combinations of five objects out of three objects with repetition allowed. The 
pin might consist of 1 diamond, 3 rubies, and 1 emerald, for instance, or 5 dia- 
monds. We can represent these possibilities by representing the stones chosen by 
5 asterisks and placing markers between the asterisks to represent the distribution 
among the three types of gem, diamonds, rubies, and emeralds. For example, we 
could represent the choice of 1 diamond, 3 rubies, and 1 emerald by 


while the choice of 5 diamonds, 0 rubies, and 0 emeralds would be represented by 


Although we wrote the asterisks and markers in a row, there is no ordering implied. 
We are just looking at seven slots holding the five gems and the two markers, and 
the different choices are represented by which of the seven slots are occupied by 
asterisks. We therefore count the number of ways to choose five items out of seven, 
which is C(7, 5) or 
7! 
50! 
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In general, if we use the same scheme to represent a combination of r objects out 
of n distinct objects with repetition allowed, there must be n — 1 markers to indi- 
cate the number of copies of each of the n objects. This gives r + (n — 1) slots to 
fill, and we want to know the number of ways to select r of these. Therefore we 
want 


(r * n — 1)! 
n(r-*n-l-r 


(r t n = 1)! 
r!(n — 1)! 


C(r*n-lr)- 


This agrees with the result in Example 58, where r = 5, n = 3. 


PRACTICE 35 | Six children get one lollipop each from among a selection of red, yellow, and green lol- 
lipops. How many sets of lollipops are possible? (We do not care which child gets which.) | 


We have discussed a number of counting techniques in this chapter. Table 4.2 
summarizes the techniques you can apply in various circumstances, although there 
may be several legitimate ways to solve any one counting problem. 


TABLE 4.2 


You Want to Count the Number of ... 


Technique to Try 


Subsets of an n-element set 


Use formula 2”. 


Outcomes of successive events 


Multiply the number of outcomes for 
each event. 


Outcomes of disjoint events 


Add the number of outcomes for each 
event. 


Outcomes given specific choices at 
each step 


Draw a decision tree and count the 
number of paths. 


Elements in overlapping sections of 
related sets 


Use principle of inclusion and exclusion 
formula. 


Ordered arrangements of r out of n 
distinct objects 


Use P(n, г) formula. 


Ways to select r out of n distinct 
objects 


Use C(n, г) formula. 


Ways to select r out of n distinct 
objects with repetition allowed 


Use C(r + n — 1,r) formula. 


Generating Permutations and Combinations 


In a certain county, lottery ticket numbers consist of a sequence (a permutation) 
of the 9 digits 1, 2, ..., 9. The ticket printing company may or may not know that 
9! — 362,880 distinct ticket numbers are possible, but it certainly needs a way to 
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generate all possible ticket numbers. Or the county council (a group of 12 mem- 
bers) wants to form a subcommittee of 4 members but wants to pick the combina- 
tion of council members it feels can best work together. The council could ask 
someone to generate all C(12, 4) = 495 potential subcommittees and examine 
the membership of each one. We see that in some situations, simply counting the 
number of permutations or combinations is not enough; it is useful to be able to 
list all the permutations or combinations. 


Example 47 asked for the number of permutations of the three objects a, b, and c. 
The answer is given by the formula P(3,3) = 3! = 6. However Example 47 went 
on to list the six permutations: 


abc, ach, bac, bca, cab, cba 


This list presents the six permutations using lexicographical ordering, that is, the 
order in which they would be found in a dictionary if they were legitimate words. 
Thus abc precedes acb because although both words begin with the same first 
character, for the second character, b precedes c. If we had three integers, say 4, 6, 
and 7, instead of three alphabetical characters, the lexicographical ordering of all 
six permutations would present values in increasing numerical order: 


467, 476, 647, 674, 746, 764 e 


PRACTICE 36 | Arrange the following list of permutations in lexicographical order: 


EXAMPLE 60 


Scary, yarsc, scyra, cysar, scrya, yarcs п 


Words that are close in lexicographical order have the maximum number of 
matching leftmost characters or, equivalently, differ in the fewest rightmost char- 
acters. We use this characteristic to develop a process to generate all permutations 
of the integers (1, ..., n} in lexicographical order. 


Consider the set (1, 2, 3, 4,5}. The smallest numerical value (the first permutation) 
is given by the increasing order of all the integers, namely, 


12345 


To generate the next number in lexicographical order, we want to retain as many 
of the leftmost digits as possible. Clearly we can't keep the leftmost four digits be- 
cause this also determines the fifth digit. To keep the leftmost three digits, 123 — —, 
we must be able to rearrange the remaining two digits to represent a larger value 
than they do now. Reading 12345 from right to left, we find in the last two digits 
that 4 « 5, which means we can reverse the 4 and the 5 to get 


12354 
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which is the next permutation in the list. That’s all that can be done with the last 
two digits; in particular, since 54 is a decreasing sequence, we can’t use these two 
values to generate anything larger. 

For the next number, we keep 12— — — and consider how to arrange the last 
three digits. Reading 12354 from right to left, we find in the last three digits that 
3 « 5, but we know that everything from 5 to the right is a decreasing sequence. 
The next permutation should replace 3 with the next largest value to its right. 
Reading from right to left in the number 12354, the first value larger than 3, in 
this case 4, 15 the least value larger than 3. Swapping 3 and 4 gives 12453, which 
puts 4 in the correct order; the digits to the right are now in descending order, so 
reversing them gives 


12435 


which is the next permutation. e 


| EXAMPLE®I | To continue Example 60, let's jump ahead. Suppose we have just generated 


permutation 
25431 


and we want the next permutation. Reading from right to left, everything increases 
until we get to 2, where we have 2 « 5. Starting again from right to left, we stop 
at the first (and smallest) value greater than 2, which is 3. Swapping 2 and 3 gives 
35421, giving the correct first digit. The digits after 3 are in descending order, so 
reversing them involves swapping 5 and 1, and also swapping 4 and 2, giving the 
next permutation 


31245 ө 


From ће preceding examples, we can construct an algorithm to generate all 
permutations of the integers from 1 to n in lexicographical order. 


ALGORITHM PERMUTATION GENERATOR 


PermGenerator(integer n = 2) 


//generates in lexicographical order all permutations 

//of the integers in the set (1, ..., п} 

Local variables: 

integers i, j /hndices of permutation elements 
integer k //for loop counter 

integers d, d», ..., d, //left to right elements of a permutation 
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//create and write out smallest permutation 
for k = 1 ton do 
d, =k 
end for 
write d,d...d, 


//create and write out remaining permutations 
for k = 2 ton! do 
/ЛооК right to left for first break in increasing sequence 
i=n-1 
j = 
while d; > d; do //still increasing right to left 
i-i-l 
== 
end while 
//now d; < d, need to replace d; with next largest integer 


//look right to left for smallest value greater than d; 
Jeu 
while d; > d; do 
j=j-1 
end while 
//now d; is smallest value > d; 


swap d; and d; 


//reverse the digits to the right of index i 
i=it+l 
у = 
while i < j do 
swap d; and d; 
p= + 1 
== 
end while 


write djd,...d, 
end for 
end function PermGenerator 


PRACTICE 37 | Walk through the steps in the algorithm that generate the next permutation following 
51432. 


Another algorithm for generating (not in lexicographical order) all permuta- 
tions of the integers (1, ... , n} is suggested in Exercise 7 of On the Computer at 
the end of this chapter. Both of these algorithms can also be used to generate all 
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permutations of any п distinct elements; simply assign each of the л elements a 
unique integer from 1 to n, generate the permutations of the integers, and then 
reverse the assignment. 

Our second problem is to generate the C(n, r) combinations of r distinct in- 
tegers chosen from (1, ... , n}. Such a combination does not involve order, it is 
merely a subset of r elements. Nonetheless we will represent the subset (3, 5, 7} 
as the sequence 357, and generate the subsets in lexicographical order. Once we 
generate 357, we can't also generate 375 or 753 or any ofthe other permutations of 
the elements in this set. Each legitimate representation is an increasing sequence. 


| EXAMPLE? | Consider the lexicographical ordering of the combinations of 4 integers from 


(1, ..., 7}. If the combination 
2346 
has just been generated, then the next combination would be 
2347 
obtained by incrementing the last digit of the sequence. However, in 2347, the last 
digit is already at its maximum allowable value. Moving to the left, the 4 can be 


bumped up to 5, but then the last digit has to be reduced to its minimum value, 
which is 6 (one more than 5). Therefore, 


2356 
is the next combination. The next two values are 
DESI? S GI 


at which point both 7 and 6 are at their maximum values. The 3 can be bumped up, 
but the two digits to its right have to be reset to their lowest possible values. The 
next few values are 


2456, 2457, 2467, 2567 ... e 


Based on the ideas of Example 62, given a combination sequence the algo- 
rithm should bump up the rightmost digit that is not at its maximum allowable 
value. The sequence of digits to the right of the newly incremented digit v should 
have the values v + 1, v + 2, and so on. The initial (smallest) combination is 
РЕ А 
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ALGORITHM COMBINATION GENERATOR 


CombGenerator(integer n = 2, integer r = 1) 
//generates in lexicographical order all combinations 
//of г integers from the set (1, ..., n} 

Local variables: 


integers i, j /hndices of combination elements 
integer k //for loop counter 

integer max //maximum allowable value for a digit 
integers d, d», ..., d, //left to right elements of a combination 


//стеаїе and write out smallest combination 
fork = 1 tor do 
d, = k 
end for 
write did, ... d, 


//create and write out remaining combinations 
for k = 2 to C(n, r) do 
/llook right to left for first non-max value 


max =n 
і = ү 
while d; = max do //look left 
f= = Il 
max = max — 1 
end while 


/Inow d; < max, need to increment d; 
dl = dis Il 


//reset values right of d; 
forj = і + 1tordo 
dj = dj) + | 
end for 
write d)d)...d, 
end for 
end function CombGenerator 


PRACTICE 38 | Using this algorithm, find the next combination of five items from 11, ... , 9} after 24589. 


SPECIAL INTEREST PAGE 


Archimedes and the Stomachion 


Archimedes was one of the greatest mathematicians 
of the ancient world. He lived about 287—212 B.C.E. in 
Syracuse (a Greek city-state in current-day Sicily). It 
is thought that he studied for a time in Alexandria 
(Egypt) with students of Euclid. At any rate, he was 
greatly interested in geometry, and he thought his 
most important work was the discovery of a formula 
for the volume of a sphere. He also approximated the 
value of тт and grasped the ideas of integral calculus 
by approximating the area under a curve by breaking 
it up into a series of rectangles. His contributions to 
mechanics, engineering, physics, and astronomy are 
also amazing. Some of his inventions helped defend 
the city of Syracuse from Roman invaders. 

Archimedes wrote a treatise about the Stoma- 
chion, which is a puzzle apparently known even 
before Archimedes’ time. The puzzle consists of 14 
polygons that fill a 12 X 12 square. It is constructed 
by marking off unit intervals along the edges of the 
square and connecting them to create a grid. Each in- 
tersection of two grid lines, or of a grid line with the 
square's border, is called a lattice point. The polygons 
are formed by lines that connect certain lattice points. 
The 14 pieces and their areas (each an integer) are 
shown here, and these areas add up to 144, the area 
of the square. (А modern result, Pick's theorem, says 
that the area of such a polygon is given by the formula 
A = I + B/2 — 1, where J = the number of lattice 
points enclosed within the polygon and В = the num- 
ber of lattice points on the border of the polygon. You 
can test this theorem on the diagram here, which is 
from the Stomachion construction Web page.) 


aa = 


ө Chapter 


That Archimedes’ work on the Stomachion is 
available today, at least in part, is quite a story. No 
original writings of Archimedes exist today, but copies 
were made throughout the ages. One such copy is an 
ancient parchment (goatskin) manuscript that was 
written sometime in the 900s. It survived for 300 cen- 
turies but in 1229 it was torn apart, washed, folded, and 
written over as a prayer book (early recycling!). Then 
it disappeared for hundreds of years, to be discovered 
in the library of a Greek Orthodox monastery in Con- 
stantinople in 1906 by a Danish classics scholar who 
noted the written-over characters and deciphered what 
he could, enough to recognize that it was a work by 
Archimedes. It contained information on the Stoma- 
chion. The object of the puzzle, it was thought, was to 
use the pieces to construct interesting shapes (a bird, 
an elephant, etc.), much like the better-known tangram 
puzzle. Pretty trivial stuff to occupy the mind of Ar- 
chimedes, it seemed. 

The parchment prayer book again disappeared 
after World War I until it was put up for auction by a 
French family in 1998. Purchased by an anonymous 
American buyer for $2 million, it was handed over to 
scientists for further study. By this time, the parch- 
ment was in terrible shape, with missing pages, holes 
and tears, and covered with mold, to say nothing of 
the overwriting. Modern technology has revealed 
more of the original text, which brought about a new 
interpretation of the Stomachion puzzle. 

Dr. Reviel Netz, a mathematics historian at Stan- 
ford, has concluded that Archimedes was trying to 
count how many ways the pieces could be combined 
to form the original square. This is essentially a 
combinatorics problem. Is it a simple problem? No, 
as it turns out. Dr. Netz posed this problem to two 
husband-and-wife teams of combinatorics experts, 
who worked six weeks to prove that the number is 
17,152. If you ignore issues of symmetry, such as ro- 
tations and reflections, the answer is 536, which was 
confirmed via a computer program written by the 
computer scientist Dr. William Cutler. 
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Combinatorics had been considered a modern 
branch of mathematics, of much interest in com- 
puter science but supposedly unknown to the ancient 
Greeks. Archimedes’ Palimpsest (a “palimpsest” is a 
written-over manuscript) gives combinatorics a much 
earlier foundation. 


“Archimedes,” Encyclopedia Britannica, 
http://www. britannica.com/EBchecked/topic/32808/ 
Archimedes 

“In Archimedes’ Puzzle, a New Eureka Moment," The New 
York Times, December 14, 2003, 


http://www.nytimes.com/2003/12/14/us/in- 
archimedes-puzzle-a-new-eureka-moment. 
html?pagewanted-all&src-pm 

Reading Between the Lines, Smithsonian.com, 

http://www.smithsonianmag.com/science-nature/ 
archimedes.html?c=y&page=1 

Stomachion construction, 

http://www.math.nyu.edu/~crorres/Archimedes/ 
Stomachion/construction.html 

A Tour of Archimedes' Stomachion, Chung, F., and 
Graham, R., 

http://www.math.ucsd.edu/~fan/stomach/ 
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SECTION 4.4 REVIEW 


TECHNIQUES e Generate all permutations of the integers (1, ..., n} 


* Find the number of permutations of r distinct 
objects chosen from 7 distinct objects. 

* Find the number of combinations of r distinct 
objects chosen from 7 distinct objects. 

(D Use permutations and combinations in conjunction 


in lexicographical order. 
* Generate all combinations of r integers from the set 


(1, ..., n). 


MAIN IDEAS 


with the multiplication principle and the addition ° There are formulas for counting various permuta- 


principle. tions and combinations of objects. 
* Find the number of distinct permutations of n ° Care must be taken when analyzing a counting 
objects that are not all distinct. problem to avoid counting the same thing more 
* Find the number of permutations of r objects than once or not counting some things at all. 
out of n distinct objects when objects may be ° Algorithms exist to generate all permutations of 
repeated. n objects and all combinations of r out of n objects. 


* Find the number of combinations of r objects out 
of n distinct objects when objects may be repeated. 


EXERCISES 4.4 


l. 


ч ON tn A UC 


Compute the value of the following expressions. 
a. P(7,2) b. P(8,5) 


. Compute the value of the following expressions. 


a. P(6, 4) b. P(n,n — 1) 


. How many batting orders are possible for a 9-man baseball team? 

. The 14 teams in the local Little League are listed in the newspaper. How many listings are possible? 

. How many different ways can 10 flavors of ice cream be arranged in an ice cream store display case? 

. How many different ways are there to arrange 6 candidate names on a ballot? 

. How many permutations of the characters in COMPUTER are there? How many of the permutations end 


in a vowel? 


. In how many ways can 6 people be seated in a circle of 6 chairs? Only relative positions in the circle can 


be distinguished. 


. In how many ways can first, second, and third prize in a pie-baking contest be given to 15 contestants? 
. &. Stock designations on an exchange are limited to 3 letters. How many different designations are there? 


b. How many different designations are there if letters cannot be repeated? 


. In how many different ways can 19 people be seated in a row? 
. In how many different ways can 11 men and 8 women be seated in a row? 


. In how many different ways can 11 men and 8 women be seated in a row if the men all sit together and the 


women all sit together? 


. In how many different ways can 11 men and 8 women be seated in a row if no 2 women аге to sit together? 


. In how many different ways can 11 men and 8 women be seated around a circular table? (Only relative 


positions in the circle can be distinguished.) 


. In how many different ways can 11 men and 8 women be seated around a circular table if no 2 women are 


to sit together? (Only relative positions in the circle can be distinguished.) 


20. 


21. 
22. 
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. Compute the value of the following expressions. 


a. C(10, 7) b. С(9, 2) с. С(8, 6) 


. Compute C(n, n — 1). Explain why С(п, n — 1) = C(n, 1). 
. Quality control wants to test 25 microprocessor chips from the 300 manufactured each day. How many 


different batches of test chips are possible? 


A soccer team carries 18 players on the roster; 11 players make a team. How many different teams are 
possible? 


How many juries of 5 men and 7 women can be formed from a panel of 17 men and 23 women? 


How many different sets of 4 novels and 3 plays can be created from a collection of 21 novels and 
11 plays? 


Exercises 23—26 deal with the following situation: Of a company’s personnel, 7 people work in design, 14 in 
manufacturing, 4 in testing, 5 in sales, 2 in accounting, and 3 in marketing. A committee of 6 people is to be 
formed to meet with upper management. 


23. 
24. 
25. 


26. 


How many committees with 1 member from each department are possible? 
How many committees with exactly 2 members from manufacturing are possible? 


How many committees with no representative from accounting and exactly 1 representative from market- 
ing are possible? 


How many committees with at least 2 representatives from manufacturing are possible? 


Exercises 27—32 concern a 5-card hand from a standard 52-card deck. A standard deck has 13 cards from each 
of 4 suits (clubs, diamonds, hearts, spades). The 13 cards have face value 2 through10, jack, queen, king, or ace. 
Each face value is a “kind” of card. The jack, queen, and king are “face cards." 


21. 
28. 
29. 
30. 
31. 
32. 


How many hands contain 4 queens? 

How many hands contain all diamonds? 

How many hands contain 3 spades and 2 hearts? 

How many hands contain cards from all 4 suits? 

How many hands consist of all face cards? 

How many hands contain exactly 2 spades and exactly 2 hearts? 


Exercises 33—42 concern 5-card poker hands from a standard 52-card deck. 


33. 
34. 


35. 
36. 
37. 


38. 


39. 


40. 
41. 


How many hands contain a royal straight flush (that is, the 10, jack, queen, king, ace of one suit)? 


How many hands contain a straight flush (that is, 5 consecutive cards of the same suit, where aces can be 
low or high) that is not a royal straight flush (see Exercise 33)? 


How many hands contain four of a kind (such as 4 jacks plus a fifth card)? 
How many hands contain a full house (that is, three of a kind plus a pair of another kind)? 


How many hands contain a flush (that is, 5 cards of the same suit) that is not a straight flush or a royal 
straight flush (see Exercises 33 and 34)? 


How many hands contain a straight (that 15, 5 consecutive cards, where aces can be low or high) that is not 
a straight flush or a royal straight flush (see Exercises 33 and 34)? 


How many hands contain three of a kind (that is, exactly 3 cards of the same kind plus 2 other cards that 
are not a pair)? 


How many hands contain 2 pairs (that is, 2 pairs of 2 different kinds plus a fifth card of some third kind)? 
How many hands contain 1 pair (that is, exactly 2 cards of the same kind)? 
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42. If the joker is added to the deck and functions as a fifth ace (of any suit), how many hands contain a royal 
straight flush (that is, the 10, jack, queen, king, ace of one suit)? 


For Exercises 43—48, 14 copies of a code module are to be executed in parallel on identical processors or- 
ganized into two communicating clusters, A and B. Cluster A contains 16 processors and cluster B contains 
32 processors. 

43. How many different sets of processors can be used? 

44. How many different sets of processors can be used if all modules must execute on cluster B? 


45. How many different sets of processors can be used if 8 modules are to be processed on cluster А and 6 on 
cluster B? 


46. How many different sets of processors can be used if cluster A has 3 failed processors and cluster B has 
2 failed processors? 


47. How many different sets of processors can be used if exactly 2 modules are to execute on cluster B? 


48. How many different sets of processors can be used if all modules are to be executed either on cluster A or 
on cluster B? 


For Exercises 49—52, a set of 4 coins is selected from a box containing 5 dimes and 7 quarters. 


49. Find the number of sets of 4 coins. 

50. Find the number of sets in which 2 coins are dimes and 2 are quarters. 
51. Find the number of sets composed of all dimes or all quarters. 

52. Find the number of sets with 3 or more quarters. 


Exercises 53—56 concern a computer network with 60 switching nodes. 
53. The network is designed to withstand the failure of any 2 nodes. In how many ways can such a failure 
occur? 
54. In how many ways can | or 2 nodes fail? 
55. If 1 node has failed, in how many ways can 7 nodes be selected without encountering the failed node? 
56. If 2 nodes have failed, in how many ways can 7 nodes be selected to include exactly 1 failed node? 


In Exercises 57—60, a congressional committee of 3 is to be chosen from a set of 5 Democrats, 3 Republicans, 
and 4 independents. 

57. How many committees are possible? 

58. How many committees with at least 1 independent are possible? 

59. How many committees that do not include both Democrats and Republicans are possible? 

60. How many committees with at least 1 Democrat and at least 1 Republican are possible? 


In Exercises 61—66, the states of California, Arizona, New Mexico, Utah, and Nevada each send a team of 
6 delegates to the Southwestern States annual conference. A subcommittee of 9 is to be formed to discuss water 
rights. 

61. How many committees are possible? 

62. How many committees with no delegates from New Mexico are possible? 

63. How many committees with exactly 1 delegate from New Mexico are possible? 

64. How many committees with at most 1 delegate from New Mexico are possible? 


65. 
66. 
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How many committees with at least 2 delegates from Nevada are possible? 
How many committees with at most 4 delegates total from Arizona and California are possible? 


In Exercises 67—70, a hostess wishes to invite 6 dinner guests from a list of 14 friends. 


67. 
68. 


69. 


70. 


71. 


72. 


73. 
74. 
75. 


76. 
TT. 


78. 


79. 


80. 


81. 


82. 


83. 


84. 


85. 


How many sets of guests are possible? 


How many sets of guests are possible if 6 friends are boring and 6 friends are interesting, and the hostess 
wants to have at least 1 of each? 


How many sets of guests are possible if 2 of the friends dislike each other and neither will come if the 
other is present? 


How many sets of guests are possible if 2 of the friends are very fond of each other and one won't come 
without the other? 


Twenty-five people, including Simon and Yuan, are candidates to serve on a committee of 5. How many 
committees that include Simon or Yuan are possible? 


A student must select 5 classes for the next semester from among 12, but one of the classes must be either 
American history or English literature. How many sets of classes are possible? 


How many 5-card hands from a standard 52-card deck contain exactly 4 aces and exactly 1 club? 
How many 5-card hands from a standard 52-card deck contain exactly 3 jacks and exactly 2 hearts? 


How many distinct permutations of the characters in ERROR are there? (Remember that the various R's 
cannot be distinguished from one another.) 


How many distinct permutations of the characters in LOLLAPALOOZA are there? 

a. How many distinct permutations of the characters in the word HAWAIIAN are there? 

b. How many of the permutations begin with H? 

a. How many distinct permutations of the characters in the word APALACHICOLA are there? 
b. How many of the permutations have both L’s together? 


A bookstore displays a shelf of 5, 3, and 4 copies, respectively, of the top 3 bestsellers. How many distin- 
guishable arrangements of these books are there 1f books with the same title are not distinguishable? 


The United Group for Divisive Action uses secret code words that are permutations of 5 characters. You 
learn that there are only 10 code words. What can you say about repeated characters in the code words? 


At a dinner party for 5, a tray of 5 servings of appetizers is prepared. An appetizer could be escargots, egg 
rolls, or nachos. How many different trays could the kitchen produce? 


A florist has a large number of roses, carnations, lilies, and snapdragons in stock. How many different 
bouquets of one dozen flowers can be made? 


A cheese shop carries a large stock of 34 kinds of cheese. By the end of the day, 48 cheese sales have been 
made, and the items sold must be restocked. How many different restocking orders are possible? 


One “game package" consists of 12 bingo cards. How many different game packages are there if there are 
15 kinds of cards and repetitions are allowed? 


A hardware shipping order contains 6 items, where each item is either a gallon of paint, a hammer, or a 
drill. 


a. How many different shipping orders are possible? 
b. How many different shipping orders are possible if no paint is shipped? 


c. How many different shipping orders are possible if each order must contain at least 1 gallon of paint, 
] hammer, and 1 drill? 


86. 


87. 


88. 


89. 


90. 


91. 


92. 
93. 
94. 
95. 
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At a birthday party, a mother prepares a plate of cookies for 8 children. There are plenty of chocolate chip, 
peanut butter, and oatmeal cookies, but each child gets only 1 cookie. 


a. How many different plates can be prepared? 

b. How many different plates can be prepared if at least 1 of each kind of cookie is given out? 
c. How many different plates can be prepared if no one likes oatmeal cookies? 

d. How many different plates can be prepared if 2 children insist on getting peanut butter? 

e. 


How many different plates can be prepared if the dog got into the kitchen and ate all the chocolate chip 
cookies except 2? 


On Halloween, 10 identical apples are distributed to 7 children. 


a. How many distributions are possible? (Hint: One possible distribution is that child 1 gets 3 apples, 
child 2 gets 0 apples, child 3 gets 2 apples, child 4 gets 0 apples, child 5 and child 6 get 1 apple 
each, and child 7 gets 3 apples. Although the problem says apples are distributed to children, think of 
assigning a child's name to each apple; a child's name can go to more than 1 apple.) 


b. How many distributions are possible if each child is to receive at least 1 apple? 

Eight identical antique pie safes are sold at a furniture auction to 3 bidders. 

a. In how many ways can the pie safes be distributed among the bidders? (See the hint for Exercise 87.) 
b. In how many ways can the pie safes be distributed if bidder A gets only 1 pie safe? 

How many distinct nonnegative integer solutions are there to the equation 


Xj +X, + х3 x4 = 10 


where the solution 


and the solution 


x, =4,x, = 2,0 = 3,x,= 1 


are distinct? (Hint: Think of this problem as distributing 10 pennies to 4 children; then see the hint in 
Exercise 87.) 


How many distinct nonnegative integer solutions are there to the equation 
ху +X +x3=7 


in which x, = 3? (See the hint for Exercise 89.) 


Prove that for n = 1, P(n, n) = P(n, n — 1). (The proof does not require induction, even though it sounds 
like a very likely candidate for induction.) 


Prove that for n = 2, P(n, 1) + P(n, 2) = n°. 

Prove that for any n andr with 0 = r = n, C(n, г) = C(n, n — r). Explain why this is intuitively true. 
Prove that for any n andr with 0 =r = n, C(n, 2) = C(r, 2) + Cu — r, 2) + r(n — r). 

Prove the identity 


C(n, r)C(r, К) = C(r, ЮС(п — К, r - k) forr = nand k = ғ 


Give a combinatorial argument. 
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96. Prove Vandermonde’s identity: 
C(n + m,r) = У C(n, k)C(m, r — К) 
k=0 


(Hint: use a combinatorial argument, picking r elements from the union of two disjoint sets of size n and 
m, respectively.) 


97. The recurrence relation for the sequence of Catalan numbers is 


С(0) = 1 
Са) = 1 


C(n) = Уак —)Cn-k n=2 
k=1 


Although we will not prove it, a closed-form solution to this recurrence relation is 


1 
C(n) = n+l С(2п, п) 


(Note that C(n) denotes a value in the Catalan sequence and C(2n, n) denotes the number of combinations 
of n objects from 2n objects.) Compute C(2), C(3), and C(4) using this formula and compare the results 
with the recurrence relation results (see Exercise 38 of Section 3.1). 


98. а. A turtle begins at the upper left corner of an n X n grid and makes his way to the lower right corner. 
Along the way, he can move only right or down. The accompanying figure shows two possible paths in 
a 4 X 4 grid. How many possible paths can the turtle take? 


(Hint: Each path can be described by a sequence of A's (right moves) and D's (down moves). Find the 
number of ways to distribute the A's in such a sequence.) 


b. Relate the answer to part (a) to the sequence of Catalan numbers (see Exercise 97). 


99. Arrange the following permutations of the numbers (1, ... , 6} in lexicographical order: 
163542, 345621, 643125, 634521, 163452, 356421 
100. Arrange the following permutations of the numbers 11, ... , 5) in reverse lexicographical order: 


32541, 35142, 53124, 42531, 32154, 42315 
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In Exercises 101—104, use algorithm permutation generator to generate the next permutation after the given 
permutation in the set of all permutations of the numbers {1, ... , 7}. 


101. 
102. 
103. 
104. 
105. 


106. 


107. 


108. 


7431652 
4365127 
3675421 
2756431 


In generating all combinations of five items from the set (1, ... , 9}, find the next five values in the list 
after 24579. 


In generating all combinations of four items from the set (1, ... , 6}, find the next five values in the list 
after 1234. 


Describe an algorithm to generate all permutations of the integers (1, ... , n} in reverse lexicographical 
order. 


Describe an algorithm to generate all permutations of r elements from the set (1, ... , n}. 


SECTION 4.5 | BINOMIAL THEOREM 


The expression for squaring a binomial is a familiar one: 
(a+ Б)? = а? + 2ab + P? 


This is a particular case of raising а binomial to а nonnegative integer power л. 
The formula for (a + b)" involves combinations of n objects. Before we prove this 
formula, we'll look at a historically interesting array of numbers that suggests a 
fact we will need in the proof. 


Pascal's Triangle 


Pascal’s triangle is named for the seventeenth-century French mathematician 
Blaise Pascal (for whom the programming language Pascal was also named), al- 
though it was apparently known several centuries earlier. Row n of the triangle 
(n = 0) consists of all the values C(n, r) for 0 = r = n. Thus the triangle looks like 


this: 
Row 
C(0, 0) 0 
C(1,0) C(l,1) 1 
С(2,0) СО,1) CQ, 2) 2 
C(3,0 C(3,1) C(3,2) С(3, 3) 3 
C(4,0 C(4,1) C(4,2) C(4,3) C(4,4) 4 
5 


C(5,0) C(5,1) C(5,2) C(5,3) C(5,4) C(5,5) 


C(n,0) Cn, 1) y Со 0). Coin 
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If we compute the numerical values of the expressions, we see that Pascal’s tri- 
angle has the form 


I: 3-9 +l 
14 6 4 1 
15 10 10 5 1 


Observing this figure, it is clear that the outer edges are all 15. But it also seems 
that any element not on the outer edge can be obtained by adding together the two 
elements directly above it in the preceding row (for example, the first 10 in row 
five is below the first 4 and the 6 of row four). If this relationship is indeed always 
true, it means that 


C(n,k) = Cn - 1,k — 1) + Сп — L,) forl Sk En-1 (1) 


Equation (1) is known as Pascal’s formula. 
To prove Pascal's formula, we begin with the right side: 


Ca- ik=- ) + Си 1,0 = шш. X ш. 
uds eT) ceret ped Nota 
(n — 1)! (n — 1)! 


(= 00и = 01 Kn-—1- E 
_ka- D! , (- Dt - 5 
О kn—Kkt kn- 0)! 


(multiplying the first term by k/k and the second term by (n — A)/(n — K)) 


Кп = D! + (n — D! — K 
К\(п — k) 


(adding fractions) 


(n = Dk + (n = k)] 


k(n — k)! 
(factoring the numerator) 
(n = Dto) 
С Kn — 0! 
7 n! 
К\(п — k)! 


= С(п, К) 
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Another, less algebraic way to prove Pascal’s formula involves a counting ar- 
gument; hence it is called a combinatorial proof. We want to compute C(n, K), 
the number of ways to choose k objects from n objects. There are two disjoint 
categories of such choices—item 1 is one of the k objects or it is not. If item 1 is 
one of the k objects, then the remaining k — 1 objects must come from the remain- 
ing n — 1 objects exclusive of item 1, and there are C(n — 1, k — 1) ways for this 
to happen. If item 1 is not one of the k objects, then all k objects must come from 
the remaining л — 1 objects, and there are C(n — 1, К) ways for this to happen. 
The total number of outcomes is the sum of the number of outcomes from these 
two disjoint cases. 

Once we have Pascal's formula for our use, we can develop the formula for 
(a + b)", known as the binomial theorem. 


Binomial Theorem and Its Proof 


In the expansion of (a + b, à? + 2ab + b’, the coefficients are 1, 2, and 1, which 
is row 2 in Pascal’s triangle. 


PRACTICE 39 | Compute the expansion for (a + by? and (a + b)* and compare the coefficients with rows 3 


and 4 of Pascal's triangle. 


© THEOREM 


REMINDER 


The expansion of (a + b)" 
starts with a^b?. From 
there the power of a goes 
down and the power of b 
goes up, but for each term 
the powers of a and b add 
up to n. The coefficients 
are all of the form 

C(n, the power of b). 


Looking at the coefficients in the expansion of (a + Б), (a + b)’, and (a + D)* 
suggests a general result, which is that the coefficients in the expansion of (a + b)” 
look like row n in Pascal’s triangle. This is indeed the binomial theorem. 


BINOMIAL THEOREM 
For every nonnegative integer п, 


(a + Б)" = C(n, O)a"b? + C(n, Da" !b! + C(n, 2)а" 7b? 
toc C(n, ka" bI + +++ + С(п, п = Dalb"^! + Cn, n)a°b" 


У С(п, Юа" p 
k=0 


Because the binomial theorem is stated “for every nonnegative integer п,” a proof 
by induction seems appropriate. For the basis step, n = 0, the theorem states 


(a + b = C(O, 0)a°b® 


which is 


Since this is certainly true, the basis step is satisfied. 
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As the inductive hypothesis, we assume that 
(a + b} = C(k, 0)ab? + C(k, 1)а 1b! + --- + C(k k — Dalb*^!  C(k, Rab" 
Now consider 
(a + byt! = (a + b(a + b) = (a + bY'a + (a + Б) 
= [C(k, 0)a*b® + C(k, а‘ + +++ + C(k, k — Dalp^! 


+ C(k, Hab‘ а + [С(®, 0)a^b? + C(K, Паб !b! 
tee + Ck k = Dalb ^! + Clk, k)a?b*]b 


(by the inductive hypothesis) 


= C(k, O)a** 5? + C(k, аъ! + +++ + C(k, k — Dab! 

+ C(k, )alb* + C(k, 0)a b! + C(k, Da ^ !P? 

C(k, k — 1)а!Ь* + C(k, )a9p**! 

= C(k, 0)a** !5? + [C(k, 0) + Clk, 1) Jab! + [C(k, 1) + C(k, 2)]a* P? 
[C(k, k — 1) + Clk, £)]a!'b* + C(k, а“! 


(collecting like terms) 


= Ck, 0)a^* 'b* + Clk + 1, Паф! + C(k + 1, 2)a* 162 
++ C(k + 1, kja'bt + CUR Jap" 


(using Pascal’s formula) 


= C(k + 1, O)a**!b° + C(k + 1, Lab! + C(k + 1, 2)а“ 12 
+ +--+ Chk + 1, Юа + C(k + 1, k + 1)a?b*t! 


(because C(k, 0) = 1 = C(k + 1,0) and C(k, k) = 1 = C(k + 1,k + D) 


This completes the inductive proof of the binomial theorem. 

The binomial theorem also has a combinatorial proof. Writing (a + b)" 
as (a + bY(a + b)--+ (a + Б) (n factors), we know that the answer (using the distribu- 
tive law of numbers) is the sum of all values obtained by multiplying each term in 
a factor by a term from every other factor. For example, using b as the term from k 
factors and a as the term from the remaining n — k factors produces the expression 
a" *b*. Using b from a different set of k factors and a from the n — k remaining fac- 
tors also produces a" *b*, How many such terms are there? There are C(n, k) different 
ways to select К factors from which to use 5; hence there are C(n, k) such terms. After 
adding these terms together, the coefficient of a" *b* is C(n, k). As k ranges from 0 
to л, the result of summing the terms is the binomial theorem. 

Because of its use in the binomial theorem, the expression C(n, r) is also 
known as a binomial coefficient. 
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Applying the Binomial Theorem 


Using the binomial theorem, we can write the expansion of (x — 3)*. To match 
the form of the binomial theorem, think of this expression as (x + (—3))* so that b 
equals —3. Remember that a negative number raised to a power is positive for an 
even power, negative for an odd power. Thus 


(x — 3)* = C(4, 0x(—3)? + C(4, 1j? (-3)! + C(4, 25 (-3Y 
+ C(4, 3)x!(—3) + C(4, 4)x°(-3)4 
= xf + 40(—3) + 6x°(9) + 4x(—27) + 81 
= xt — 12? + 54x? — 108x + 81 e 


PRACTICE 40 | Expand (x + 1)? using the binomial theorem. 


The binomial theorem tells us that term k + 1 in the expansion of (a + D)" 
is C(n, Ka" *b*. This allows us to find individual terms in the expansion without 
computing the entire expression. 


PRACTICE 41 | What is the fifth term in the expansion of (x + y)? 


By using various values for a and 5 in the binomial theorem, certain identities 
can be obtained. 


| EXAMPLE 64 | С Let a = b = 1 inthe binomial theorem. Then 


Gl se Пу) = (Qoo. 0) ЗР (Ca, 1) чо 9 se Cos) ae v? se Cla, m) 
or 
27? = Cho O) ae Ce, I) ae 2° 9 ae (0, 9) aR * 2° se (Шш) (2) 


This result says that the sum of all the entries in row п of Pascal’s triangle equals 
2”. Actually, Equation (2) can be proved on its own using a combinatorial proof. 
The number C(n, k), the number of ways to select k items from a set of л items, can 
be thought of as the number of k-element subsets of an n-element set. The right 
side of Equation (2) therefore represents the total number of all the subsets (of all 
sizes) of an n-element set. But we already know that the number of such subsets 
is 2". ө 
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SECTION 4.5 REVIEW 


TECHNIQUE 


Ù Use the binomial theorem to expand a binomial. 
QJ Use the binomial theorem to find a particular term 
in the expansion of a binomial. 


EXERCISES 4.5 


1. Expand the expression using the binomial theorem. 


a. (a + by 
b. (x + у) 
c. (а + 2) 
d. (a — 4)4 


2. Expand the expression using the binomial theorem. 


a. (2x + 3y)? 
b. (Зх — 1) 

с. (2р — 3q) 
d. (3x + %)5 


MAIN IDEAS 


* The binomial theorem provides a formula for ex- 
panding a binomial without multiplying it out. 

* The coefficients of a binomial raised to a nonnega- 
tive integer power are combinations of п items as 
laid out in row n of Pascal’s triangle. 


In Exercises 3—10, find the indicated term in the expansion. 


3. The fourth term in (a + Б)! 
4. The seventh term in (x — y)? 
5. The sixth term in (2x — 3? 
6. The fifth term in (За + 2b)’ 
7. The last term in (x — 3y)* 

8. The last term in (ab + 3х)° 
9. The third term in (4x — 2y)° 
10. The fourth term in (3x — 1⁄2) 


11. Use the binomial theorem (more than once) to expand (a + b + c). 


12. Expand (1 + 0.1)? in order to compute (1.1)°. 


13. What is the coefficient of x*y* in the expansion of (2x — y + 5)? 
14. What is the coefficient of x°y7z? in the expansion of (x + y + 2z)?? 


15. Prove that 


C(n + 2,r) = C(n,r) + 2C(n,r - 1) + С(п, г -2for2-rzn 


(Hint: Use Pascal's formula.) 
16. Prove that 


C(k, k) + C(k + LK) - + С(п, F) = Cn + Lk D) for0 kn 


(Hint: Use induction on л for a fixed, arbitrary Ё, as well as Pascal's formula.) 


20. 


21 


22. 


23. 
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. Use the binomial theorem to prove that 


C(n, 0) — C(n, 1) + C(n, 2) — +++ + (= 1)"С(л,п) = 0 


. Use the binomial theorem to prove that 


C(n, 0) + C(n, 1)2 + С(п, 2)22 + +++ + Сп, п)2" = 3" 


. Use the binomial theorem to prove that 


C(n, п) + C(n, n — 1)2 + C(n,n = 2)22 + +++ + C(n, 1277! + C(n, 0)2" = 3" 


Prove the result of Exercise 19 directly from Exercise 18. 


. (Requires calculus) 


a. Expand (1 + x)". 
b. Differentiate both sides of the equation from part (a) with respect to x to obtain 


п(1 + x^! = C(n, 1) + 2C(n, 2)x + 3C(n, 3)x? + `+ nC(n, nx"! 
c. Prove that 
C(n, 1) + 2C(n, 2) + 3C(n, 3) + °° +  nC(n, n) = n2"! 
d. Prove that 
C(n, 1) — 2C(n, 2) + 3C(n, 3) — 4C(n, 4) + ++ + (- 1)" „Сп, n) = 0 


(Requires calculus) 
a. Prove that 


2"+1 _ 1 
nt+ 1 


1 1 
= Cn, 0) + ^ C(n, De Cea eh C(n, n) 


n+ 1 


b. Prove that 


1 1 1 
= 2285 + — +... (—])? —— 
Lp 00-201) + 00,2) (71 — Ca, n) 


(Hint: Integrate both sides of the equation from part (a) of Exercise 21.) 


The general form of the principle of inclusion and exclusion is 


l4 U*UA,- > lal 2 I4 4j 
1=i=n ]xi«cjzn 
F > |4: N A; Al 
Ixicj«kzn 


esp y** la fes D А 


| 


(1) 
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This exercise provides an alternate to the inductive proof given in Section 4.3 of the principle of inclusion 
and exclusion. Equation (1) is correct if for any x in (4; U* * © UA,}, x is counted exactly once by the right 
side of the equation. 


a. Suppose x is an element of k ofthe n sets {A,, ..., 4,3. Let B equal the set of 4;s of which x is a member. 
Then x is counted once in the right side of (1) for each of the intersections that include only sets from 
B. Show that in the intersections of m sets from (4,, ..., 4,3, 1 =m = k, there are C(k, m) that include 
only sets from B. 

b. Using the result of part (a), write a sum of terms that represents the number of times x is counted in the 
right side of (1). 

c. Use Exercise 17 to show that this sum of terms equals 1. 

24. Pascal's triangle has many interesting properties. If you follow diagonal paths through the triangle and 
sum the values on each path, the result is the Fibonacci sequence (see (a) below). The values along the 
diagonals are easier to understand if the rows of the triangle are written one row per line with each row 
beginning one position to the right ofthe previous row (see (b) below). Then the diagonals are the table 


columns. 
011213 14 [516.1 7 |8 
0 |1 
1 111 
2 1|2 {|1 
3 113 1 
| 4 1|4 |6 |4 |1 
Pe 5 5 | 10 | 10 
222 5 6 1. |6 |15 
x 7 1 7 
3 ae 5 1 8 І 
: 1]1]2|3]|51|389]13]21|34 
(а) (b) 


a. Prove that the values in column л, п = 2, read from bottom to top, are given by the expression 
n/2 (n—1)/2 
У С(п — k, k) ifnisevenandby У C(n — k, k) if n is odd. 
k=0 k=0 


b. Prove that the sum of the values in column n, п = 0, equals F(n + 1). 


SECTION 4.6 | PROBABILITY 


Introduction to Finite Probability 


Probability is an extension of the combinatorics (counting) ideas we have already 
been using. If some action can produce Y different outcomes and X of those Y 
outcomes are of special interest, we may want to know how likely it is that one of 
the X outcomes will occur. Probability had its beginnings in gaming or gambling, 
and we have pretty good intuition for simple cases. 
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| EXAMPLEGS | What is the probability of 


a. getting "heads" when a coin is tossed? 

b. getting a 3 with a roll of a die? 

c. drawing either the ace of clubs or the queen of diamonds from a standard 
deck of cards? 


For (a), tossing a coin results in 2 different possible results, heads or tails, but only 
the heads result is of interest. The probability of getting heads is *one out of two" 
or 1/2. For (b), the roll of a standard 6-sided die has 6 possible results—any of the 
numbers 1, 2, 3, 4, 5, or 6 might come up. The 3 is exactly one of these possibili- 
ties, so the probability of rolling a 3 is 1/6. For (c), because a standard card deck 
contains 52 cards, the action of drawing one card has 52 possible results. Two of 
these are successful results (ace of clubs or queen of diamonds), so the probability 
of succeeding in this task is 2/52 or 1/26. e 


On closer examination, our intuitive answers rely on certain assumptions. We 
assume that the coin is a “balanced” coin, equally likely to come up heads or tails. 
We assume that the die is not “loaded” and that the deck of cards is not “stacked.” 
In other words, we assume that each of the possible outcomes is equally likely; 
otherwise our probability of success would be different. (Loaded dice and stacked 
decks are the motivation for many Western movie gunfights.) 

The set of all possible outcomes of an action is called the sample space S of 
the action. Any subset of the sample space is called an event. If S is a finite set 
of equally likely outcomes, then the probability P(E) of event Е is defined to be 


- El 
P(E) — isl 


(Remember that | A | denotes the size of a finite set A.) 


ЭҮП Two coins, A and B, are tossed at the same time; each coin is a fair coin, equal- 
ly likely to come up heads (H) or tails (T). The sample space of this action is 
S = (HH, HT, TH, TT}. Here HT denotes that А comes up heads and В comes up 
tails, while TH denotes that 4 comes up tails and В comes up heads; these are two 
different outcomes. Let the event E be the set (HHj. The probability of E—that is, 
the probability of having both coins come up heads^—is 


PE) = |El _ КНН} US 
S e e ОТ Лл ә 


PRACTICE 42 | Find the probability of drawing ап ace from a standard deck of cards. = 


An event, while technically a set, is also used to describe the action whose outcomes are the members of 
this set. Here the event ЕЁ is the set {HH}, but we also speak of the event as the action of tossing two coins 
and having both come up heads. 
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Because events are sets, they can be combined using set operations. Suppose 
that E, and E, are two events from the same sample space 5. If we are interested 
in the outcomes in either ЕЁ, or E; or both, this will be the event E, U E;. If we are 
interested in the outcomes in both E, and ЕЁ», this will be the event E, N Z}. And if 
we are interested in all the outcomes that are not in £}, this will be Ej". 


Employees from testing, development, and marketing participate in a drawing in 
which one employee name is chosen. There are 5 employees in testing (2 men and 
3 women), 23 in development (16 men and 7 women), and 14 in marketing (6 men 
and 8 women). 

The sample space has 42 names, that is, | S| = 42. Let W be the event that a 
name drawn belongs to a woman. Then | W| = 3 + 7 + 8 = 18. Therefore, the 
probability P(W) that the name drawn belongs to a woman is | W|/| $| = 18/42 = 
3/7. Let M be the event that a name drawn belongs to someone from marketing. Then 
| М| = 14. Thus, the probability P(M) that the name drawn belongs to someone from 
marketing is | M|/|S| = 14/42 = 1/3. The event that the name drawn belongs to a 
woman in marketing is W N M. Because there are 8 women in marketing, | VM M | 
= 8, and the probability P(W N M) that the name drawn belongs to а woman from 
marketing is 8/42 — 4/21. Finally, the event that a name drawn belongs to either a 
woman ог to someone from marketing is WU M, and| WU M|= 3 +7 + 14 = 24. 
Hence P(W U M) = 24/42 = 4/7. e 


PRACTICE 43 | In Example 67, what is the probability of drawing the name of a male from development? 


Of drawing a name from testing or development? 


EXAMPLE 68 


Probability involves finding the size of sets, either of the sample space or of 
the event of interest. Therefore many of our previous counting techniques come 
into play. We may need to use the addition or multiplication principles, the prin- 
ciple of inclusion and exclusion, or the formula for the number of combinations of 
r things from n objects. (In Example 67, we could have found the size of the union 
of the women and the marketing people by using the principle of inclusion and 
exclusion: [WU M| = W| - |M| -|W'YM| = 18 + 14 - 8 = 24) 


Ata party, each card in a standard deck is torn in half and both halves are placed in 
a box. Two guests each draw a half-card from the box. What is the probability that 
they draw two halves of the same card? 

There are 52 - 2 — 104 half-cards in the box. The size of the sample space is 
the number of ways to pick two objects from 104, that is, | S| = C(104,2). Let H be 
the event that the halves match. There are 52 ways that the halves can match, so 
| H| = 52. The probability is therefore 


PH) |H| 52 52 52 52 
IS| | C(1042) 104! 104-103 52-103 103 
21102! 2 ө 


= 0.0097 


P(E, U E) = 
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Using our definition 


_ |El 
PE) = s 


we can make some observations about probability for any events E, and Е, from a 
sample space S of equally likely outcomes (Table 4.3). These observations are also 
called probability axioms. 


TABLE 4.3 
Observation Justification 
1. | Ox P(E) = 1 E, C S 500 x |Ej and Е; = |S] 


2. | The probability of an impossibility is O E, = Ø so|E,| = 0 


3. | The probability of a “sure thing” is 1 E, = S ѕо|Е,| = |S] 


4. | P(E) = 1 - P(E) IE] = IS] — [Ei] 


5. | P(E, UE,) = P(E,) + P(E.) - P(E, N E, | See following discussion 


If E, and E, are disjoint events, then 


Follows from observation 5 
P(E, U E3) = P(E1) + P(E;) 


Observation 5 requires a bit of explanation. From the principle of inclusion 
and exclusion, 


|Z, ОР, = [Е | + Е, – (Е, ПР, 


= P(E,) + P(E) — P(E, E; 


If an action produces outcomes that are not all equally likely, one way to handle 
the situation is by introducing an appropriate number of repetitions of some of the 


So 
JEU E| Eb 154 2 |Е| 161 |B, EI 
| 5| 15| IS] Isl 15| 
Probability Distributions 
outcomes. 
EXAMPLE 69 


Suppose a fair die is rolled. There are 6 possible outcomes, so |S| = 6. Let T be the 
event of rolling a 3; there is only one successful outcome, so |7 | = 1. Therefore the 
probability of rolling a 3, just as in Example 65b, is 


os e aig Im 
OT m EUH 


The probability of rolling a 4 is the same. 
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Now suppose the die is loaded so that a 4 comes up three times more often 
than a 1, 2, 3, 5, or 6. We can describe the set of outcomes for the loaded die by 


le 2 5 4, 4,, 45, 95 6} 
The size of the sample space is now |< | = 8 and the probability of rolling a 3 is now 


ЕЕ 
(D) = Top = g = 0125 


This is a lower probability than before because the loaded die is not as likely to 
come up with a 3. However, if F is the event of rolling a 4, then there are three 
successful outcomes from the sample space. Therefore, the probability of rolling 
a 4 is now 


T 
CONG UE 


which is higher than before because the loaded die is more likely to come up 
with a 4. ө 


Another way to look at problems where not all outcomes are equally likely is 
to assign a probability distribution to the sample space. Rather than artificially 
enlarging the sample space by creating duplicates of outcomes that occur more 
frequently, simply consider each distinct outcome in the original sample space as 
an event and assign it a probability. If there are k different outcomes in the sam- 
ple space and each outcome x; is assigned a probability p(x;), the following rules 


apply: 
1. 0=р(х) = 1 


k 
2. LP) = 1 


The first equation must hold because any probability value must fall within this 
range. The second equation must hold from observation 6 in Table 4.3; the union 
of all of these k disjoint outcomes is the sample space S, and the probability of S 
is 1. 

Now consider some event E C S. The probability of event E is then given by 


P(E) = >, р(х) (1) 


x,EE 


In other words, we can add up all the probabilities for the individual outcomes 
in Е. This also follows from observation 6 in Table 4.3; Е is the union of all its 
distinct outcomes. The definition of P(E) as |E|/|S| when the outcomes are equally 
likely is a special case of this definition where р(х) = 1/|S| for each x; in Е. 
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| EXAMPLE 70 | For the loaded die of Example 69, the appropriate probability distribution is 


СЕТЕ || SE 
р(х) | 1/8 | 1/8 | 1/8 | 3/8 | 18 | 18 


As in Example 69, the probability of rolling a 3 is 1/8 and the probability of rolling 
a 4 is 3/8. Let E be the event that a 2 or a 4 is rolled. These are disjoint outcomes, 
so by Equation (1), P(E) = p(2) + p(4) = 1/8 + 3/8 = 4/8 = 0.5. ө 


PRACTICE 44 | The sample space 5 = (a, b, с}. Assume p(a) = 0.2 and p(b) = 0.3. 


a. What is p(c)? 
b. What is the probability of getting an outcome of a or c? a 


Conditional Probability 


A fair coin is tossed twice. The sample space is 
{HH, HT, TH, TT} 


The probability of getting two tails is clearly 1/4, but let us belabor this conclusion. 
Let E, be the event that the first toss results in T, so E, = (TH, TT}; let Е, be the 
event that the second toss results in 7; so E; — (HT, TTj. Then getting two tails is 
the event E, ПЕ, = {TT}. The desired probability is 


IE QE 1 
P(two tails) — ds = 4 


Suppose, however, that we already know that the first toss resulted in 7. Does 
this fact change the probability of getting two tails? Surely so, because we already 
have half of what we want. The outcome of interest is still Ej OÑ Е, = {TT}, but 
the sample space is now limited to that meeting the condition that ЕЁ, has indeed 
occurred. That is, because we are assuming that event Е has occurred, our sample 
space now becomes ЕЁ itself, namely (TH, TT}. Let E; | Е, denote the event that E; 
occurs given that E, has already occurred. Then 


IEEE d 
P(EjE)) = EP = 
1 


In terms of probabilities, P(E, N Е) = 1/4, Р(Е,) = 2/4, and 


Р(Е N E) _ 1/4 
PŒ) 2/4 


== Р(Е, | E) 


This suggests the following definition. 
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DEFINITION CONDITIONAL PROBABILITY 
Given events E, and £;, the conditional probability of E; given Ej, P(E, | Ej), is 


PEN Ey) 


P(E, | Ej) = P(E) 


In a drug study of a group of patients, 17% responded positively to compound 4, 
34% responded positively to compound B, and 8% responded positively to both. 
The probability that a patient responded positively to compound B given that he or 
she responded positively to A is 


P(ANB 
ша т 
P(A) 0.17 e 


P(B|A) = 


PRACTICE 45 | In the problem of tossing a fair coin twice, what is the probability of getting two heads 
given that at least one of the tosses results in heads? (Hint: Let E, be the event of two heads and E, be 
the event of at least one head.) 


If P(E, |E) = P(E)), then E; is just as likely to happen whether Е, happens or 
not. In this case £, and Е, are said to be independent events and we have 


P(E, E) 
Р(Р,|Е)) = PE) - = Р(Е,) 
or 
P(E, N E) = P(E) : PŒ) (2) 


Equation (2) can be extended to any finite number of independent events and can 
also be used to test whether events are independent. 


| EXANMPLE?2 | The events of tossing a coin and coming up heads one time (E) and heads the next 


(E) are independent events because 


P(E, N E) = 1/4 
Р(Е)) = 1/2, P(E,) = 1/2 


so Equation (2) is satisfied. If we toss a fair coin repeatedly and the coin lands 4 or 
5 or 6 times in a row heads up, we may feel that tails is “due to come up,” in other 
words, that at the next toss the coin has a better than 50% probability of coming up 
tails, but in fact that’s not the case. It is true that the probability of getting longer 
and longer runs of heads decreases from 1/4 (two heads) to 1/8 (three heads) to 
1/16 (four heads), and so forth, yet on each successive toss, the probability of get- 
ting a head is still 1/2. ө 


REMINDER 


P(E,U E;) = P(E,)+P(E2) 
only when E, and E, are 
disjoint events. P(E, N E>) 
= P(E4) - P(E2) only when 
E, and E, are independent 
events. 
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At this point we have what might be called an addition rule and a multiplica- 
tion rule for probability, loosely related to the addition principle and the multi- 
plication principle for counting. If we want the probability of event E, or event 
E», that is, P(E, U E;), we can add the respective probabilities—but only if the 
events are disjoint. If we want the probability of event E, and event Е, that is, 
P(E, П Ej), we can multiply the respective probabilities—but only if the events 
are independent. 


Bayes' Theorem 


Bayes' theorem allows us to squeeze an additional probability out of a certain set 
of known probabilities. Before we state the theorem, let's look at an example. 


A grocery store receives an order from Supplier A that consists of 57% lettuce and 
43% spinach. It also receives an order from Supplier В that consists of 39% lettuce 
and 61% spinach. Before the orders are unloaded, a clerk randomly selects an order 
box and pulls out a package of produce to show to the produce manager. Later the 
grocer is notified that spinach from Supplier B is contaminated. Ifthe clerk pulled out 
a package of spinach, what 15 the probability that it came from Supplier В? 

Let Е, be the event that the package came from Supplier A and Е, be the event 
that the package came from Supplier B. Let F be the event that the package was 
spinach. The sample space looks something like 


ТАШКА? AL ..., AST; AS2,AS3, ..., BET, BE2;BES; ..., BST, BS2, BS3 ...} 


E, and Е, are disjoint events and E, UE, = S. 


We know that 
P(E,) = 1/2 (equally likely that either box was chosen) 
Р(Е,) = 1/2 


P(F | Ej) = 43/100 (percentage of spinach in Supplier A order) 
P(F | E;) = 61/100 (percentage of spinach in Supplier B order) 


and we want 


P(E, |F) (probability the package came from Supplier B 
given that it was spinach) 


Although we know that the probability of the clerk choosing the Supplier B order 
box is 0.5, we suspect that P(E, | ЕЁ) is greater than 0.5 because the item was spinach 
and Supplier B has a higher percentage of spinach than Supplier A. It turns out that 
we can compute this probability by sufficient fiddling with the probabilities we do 
have. 

From the definition of conditional probability, 


P(FM E) 


P(E,|F) = PE) 


or P(F N E>) = Р(Е,|Р)Р(Е) 
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Р(Е, ПЕ) 


АЕ) = P(E) 


or P(E, ПЕ) = Р(Е|Е,)Р(Е,) 


Because FM E, = Е, ПЕ, Р(ЕП Ej) = P(E, П Е) and therefore 
Р(ЕПЕ,) = Р(Е|Е,)Р(Е,) [and a similar equation for P(F N Е))] 
апа 


PENE)  P(F|E)P(E) 
Р(Р) Р(Р) 


Р(Е›|Р) = 


Е is another event in (that is, subset of) S, and 
F-—FüS-FO(EjUE,-(FOE)U( On E, 
so F is the union of disjoint events, and 


Р(Е) = P(FYE) + Р(ЕПЕ,) = FF | E)P(E)* PF | Ej)P(E;) 
43 1, 61 1 43-61 104 
1106 w2 у m 


Finally, 


P(F|E)P(E, _ (61/100)(1/2) _ 61/200 


P(E,|F) = 
Vn P(F) 104/200 104/200 


= 61/104 = 0.587 


As we suspected, the probability that the spinach came from the contaminated 
batch is greater than 0.5. ө 


The general statement of Bayes’ theorem (see Exercise 89) follows. 


€ THEOREM BAYES’ THEOREM 


Let E, ... , E, be disjoint events from a sample space S whose union equals S. If 
Е is another event from S, then the probability of event E; 1 = i =n, given event 
F,is 
P(F| E)IXE) 
P(E|F) = ———À——À— 


> Р(Е|ЕдР(Е;) 
с=1 


PRACTICE 46 | ш Example 73, what is the probability that the package came from Supplier A if it was 
lettuce? 
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Expected Value 


A student takes three tests; the set of grades received is S = (gi, 22, g3}. The 
student computes the average test grade A(g) by 


gi t+ Bo tg 


A(g) = 3 


This assumes that the three tests are equally weighted. If we write 


1 1 1 1 
Ag) = ,G + Bg + рз) = ТӨ | e(1) H s(1) 


we can also see that A(g) is the sum of the product of each test grade times the 
amount of its contribution to the total grade. If the last test counts twice as much 
as the other two, then the weighted average" grade would be 


1 1 1 2 
A(g) = 1G tg + 2g3) = (+) | e(1) | Ө 


If we consider S as the sample space and assign the probability distribution 


Xi 91 92 9з 
р(х) 1/4 1/4 2/4 


Шеп 


3 
A(g) = > 8p (g) (3) 


We want to take the weighted average idea of Equation (3) and make it a 
bit more general. For the test grades, the sample space S consisted of numerical 
values. If the values in the sample space are not numerical, we may find a func- 
tion X that associates a numerical value (a real number) with each element in the 
sample space. Such a function is called a random variable.? Given a sample space 
S = (xj X2, ..., Х„} to which a random variable X and a probability distribution 
p have been assigned, the expected value or weighted average of the random 
variable is 


E(X) = 2 Херб) 


>The term “random variable” is a misnomer because X is neither random nor a variable—it's a function that 
associates with each value x; in S a real number X(x;) 
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| EXANPLE74 | A fair coin is tossed three times. The sample space SS is 


S = {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT} 


Let the random variable X assign to each outcome in S the number of heads in that 
outcome, which will be an integer value between 0 and 3. Because this is a fair 
coin, each member of S occurs with equal probability, which determines the prob- 
ability distribution. Hence we can write 


x; | HHH | HHT | HTH | HTT | THH | THT | ттн | TIT 


Хх) 3 2 2 1 2 1 1 0 
px) | 1/8 | 1/8 | 1/8 | 1/8 | 1/8 | 1/8 | 1/8 | 1/8 


The expected value of X, that is, the expected number of heads in three tosses, is 


8 
EX) = Y хоро) 


= 3(1/8) + 2(1/8) + 2(1/8) + 1(1/8) + 2(1/8) + 1(1/8) + 1(1/8) + 0(1/8) 
= 12(1/8) = 32 = 1.5 


This seems intuitively correct; because the coin is fair, we would expect to get 
heads about half the time, or 1.5 times out of 3. (Of course, we really can’t get half 
a head, but if we get heads about half the time, then we expect to get 4 heads out of 
8 tosses, or 64 heads out of 128 tosses, and so forth.) Note how the expected value 
is a “predictor” of future outcomes. 

Now suppose the coin is weighted in such a way that it is three times more 
likely to come up heads. In other words, the probability of a head is 3/4 while the 
probability of a tail is 1/4. The elements in the sample space are no longer equally 
likely, but we can compute their probability distribution. We know that the succes- 
sive tosses are independent events, so the probability of each outcome in S can be 
obtained by multiplying the probability of each toss. The probability of HTT, for 


example, is 
СЮ 
4/\4/\4/ 6 


x; HHH | HHT | HTH | HTT | THH | THT | TTH | TIT 
X(x) 3 2 2 1 2 1 1 0 
p(x) | 27/64 | 9/64 | 9/64 | 3/64 | 9/64 | 3/64 | 3/64 | 1/64 


The new table looks like 


and the new expected value for X is 
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8 
E(X) = р) 


= 3(27/64) + 2(9/64) + 2(9/64) + 1(3/64) + 2(9/64) + 1(3/64) + 1(3/64) + 0(1/64) 
= 144/64 = 2.25 


The expected number of heads is now higher because the coin is much more likely 
than before to come up heads. ө 


PRACTICE 47 | Given the following table for a sample space, a random variable X, and a probability 
distribution p, find the expected value for X. 


рх) | 2/8 | 3/8 | 2/8 | 1/8 


Expected value has a property called linearity. If X, and X, are two random 
variables on the same sample space S and a and b are real numbers, then 


EX, + 0) = E(X)) + EW) (4) 
E(aX, + b) = aE(X)) + b (5) 


Keep in mind that for any x; in S, Х(х,) and X;(xj) are both numerical values that 
can be added, so the random variable Х + X; just means that (X, + X5)(xj)) = 
Хү(х) + Xx). Similarly, if a and b are real numbers, then aX, + b just means 
that (aX, + b)(xj) = аХ\|(х) + b. Then Equation (4) is true because 


n n 


EX + X) = У(Х + Хдр) = 2,066) + Х(х))р(х) 


ї=1 i-l 


= 2, (дрн) + >, Bapa) = EX) + EQG) 


і= 1 


Equation (4) extends to any finite sum of random variables. Equation (5) is true 


because (note that > р(х) = 1) 
і= 1 


j= 


E(aX, + b) 


2, (aX, + byx)pG;) = 2, (aX) + Б)р(х) 


= a> X(x)p(x) + b» р(х) = aEQG) + БО) 
j=] i=l 


j= 


Use of linearity can sometimes simplify the calculation of an expected value (see 
Exercise 94). 
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Binomial Distributions 


Consider an event that has only two possible outcomes, success or failure. The 
probability of success is p and the probability of failure д = 1 — p. Our well- 
known coin toss would fit this description because there are only two possible 
outcomes. Such an event is called a Bernoulli trial or Bernoulli experiment 
after the eighteenth-century Swiss mathematician Jacob (or James) Bernoulli. 
However, a single Bernoulli trial is not of much interest; instead we want to talk 
about a finite series of Bernoulli trials, each of which has the same probability p of 
success (hence the same probability g = 1 — p of failure). Because the probability 
of success does not vary, the various trials are mutually independent events. 

Without specifically describing the sample space S, we can define a random 
variable X as the number of successful outcomes that occur in the n trials. X can 
range from 0 (no successful outcomes) to и (all successful outcomes). We can de- 
termine the probability of k successful outcomes out of л trials as follows: 

Assume the k successes (and (n — К) failures) occur in some specific pattern. 
For example if k = 3 and n = 4, one pattern would look like S-F-S-S. The prob- 
ability of a success 1s p, the probability of a failure 15 q. Because the л trials are 
mutually independent, we can multiply their probabilities, giving pq" *. But this 
is only one pattern of k successes; how many patterns are there? Exactly the num- 
ber of ways we can select k out of n items, C(n, k). So the probability of k success- 
ful outcomes out of л trials is 


Р(®) = C(n, Юр'д" * 


We have therefore determined a probability distribution for the various values of 
X, as shown in this table: 


Pk) C(n, 0) р" | C(n,1)pq"* |C(n,2)p?q^? C(n,k\p*q” * C(n,n)p^q? 


Look at the values in this probability distribution. They are the terms in the expan- 
sion of (¢ + p)". Hence this is called a binomial distribution. 

Notice that because g + p = 1, the sum of the probability distribution terms 
equals 1" = 1. This makes sense because the various values of X are all disjoint so 
the probability of their union is the sum of their individual probabilities. But the 
union covers all possible outcomes, so its probability is 1. 


A fair coin is tossed three times, with heads being considered a success, tails being 
considered a failure. Here n = 3 and p = q = 1/2. The binomial distribution is 


k 0 1 2 3 


P(k) | (1/2)? = 1/8 | 3(1/2)(1/2)2 = 3/8 | 3(1/2)2(1/2) = 3/8 | (1/2? = 1/8 


Note that this table contains the same information as the table in Example 74, 
except that here we don’t care about the order in which the heads occur. 
To compute E(X), 


E(X) = 0(1/8) + 1(3/8) + 2(3/8) + 3(1/8) = 12/8 = 1.5 


again agreeing with the result in Example 74. ө 
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PRACTICE 48 | From the expected value in Example 75, we “expect” to get 100 heads if we toss a fair coin 
200 times. Find the actual probability of getting 100 heads out of 200 trials. 


Average Case Analysis of Algorithms 


Previously, we have done primarily worst-case analysis of algorithms (Section 3.3). 
Expected value may help give an average case analysis of an algorithm, i.e., tell 
us the expected "average" amount of work performed by an algorithm. As in any 
algorithm analysis, the first step is to identify a suitable “unit of work” based on 
the nature of the algorithm. Then let the sample space S be the set of all possible 
inputs to the algorithm. We'll assume that S is finite; while there may be an infinite 
number of distinct input values, we can group together those with the same work 
unit characteristics. Let the random variable X assign to each member of S the 
number of work units required to execute the algorithm on that input. And let p 
be a probability distribution on S (here is where we make assumptions about what 
constitutes "average" input). Then 


EX) = > Хо)р(х) 
i=l 
gives the expected number of work units. 


Consider the sequential search algorithm (Section 3.3). The work unit is the num- 
ber of comparisons of a target element x against the л elements in a list. Assume 
that the target element is in the list and is equally likely to be any of the п elements 
of the list (see Exercise 35 of Section 3.3). This assumption gives the table 


Xi L, Lo jm 
Х(х) 1 2 п 
р(х) 1/п 1/п pee 1/n 


Then 


BX) = Y Xp) 


а (All | еа Ж i : Поч 1) me il 
г 3 Яг шет 2 == 


The average number of comparisons to find a target in the list, with a uniform 
probability distribution, is a little more than half the length of the list. e 
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SECTION 4.6 REVIEW 


TECHNIQUES 


W) Compute the probability of an event when all out- 


comes оЁ ап action are equally likely. 

Compute the probability of an event when a prob- 
ability distribution has been assigned to the sample 
space. 


W Compute the conditional probability of an event 


given that another event has already occurred. 
Determine whether two events are independent. 
Given a sample space with a random variable x and 
a probability distribution, compute the expected 
value E(X). 


MAIN IDEAS 


An event is a subset of the set of all possible out- 
comes of some action. 

For equally likely outcomes, the probability of an 
event is the ratio of the number of outcomes in the 
event to the number of all possible outcomes. 


EXERCISES 4.6 


In the conditional probability of event Е, given that 
event E, has already taken place, the sample space 
is reduced to Е. 

Events Е; and Е, are independent if and only if the 
conditional probability of E; given Æ; is the same 
as the probability of £}. 

In the simplest form of Bayes' theorem, the prob- 
ability of event А given event В can be computed 
from the probability of A, the probability of B, and 
the probability of B given A. 

Given a sample space to which a random variable 
and a probability distribution have been assigned, 
the expected value of the random variable is a pre- 
dictor of its future value. 

An average case analysis of an algorithm is the 
expected value of work units over the sample 
space of all inputs; the probability distribution re- 
flects the assumptions being made about "average" 
input. 


Exercises 1—6 concern three coins tossed at the same time, each equally likely to come up heads or tails. 


In 


. What is the size of the sample space? 


. What is the probability of getting all tails? 


xercises 7—14, a pair of fair dice is rolled. 


. What is the size of the sample space? 


CON t ANRWN н 


= = 
—- о 


. What is the probability of getting 1 head and 2 tails? 


. What is the probability that no coin comes up heads? 


. What is the probability of getting all tails or all heads? 
. What is the probability of getting all tails and all heads? 


. What is the probability of getting “snake eyes" (two 15)? 

. What is the probability of getting doubles (the same number on each die)? 
. What is the probability of getting a 1 on at least one die? 

. What is the probability of getting a total of 7 on the two dice? 


12. What is the probability of getting two consecutive values, such as 3—4, on the two dice? 


13. What is the probability of getting a total on the two dice greater than 10? 


14. What is the probability of getting a total on the two dice that is an odd number? 
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Exercises 15—18 concern 3 people participating in a race, with each participant equally likely to either finish 
the race or drop out of the race. 

15. What is the size of the sample space? 

16. What is the probability of exactly 1 participant finishing the race? 

17. What is the probability of no one finishing the race? 

18. What is the probability of at least 2 of the participants finishing the race? 
Exercises 19—24 concern a single card drawn from a standard 52-card deck. A standard deck has 13 cards from 
each of 4 suits (clubs, diamonds, hearts, spades). The 13 cards have face value 2 through 10, jack, queen, king, 
or ace. Each face value is a “kind” of card. The jack, queen, and king are “face cards.” 

19. What is the probability of drawing a diamond? 

20. What is the probability of drawing a queen? 

21. What is the probability of drawing the queen of diamonds? 

22. What is the probability of drawing a queen or a diamond? 

23. What is the probability of drawing a black card? 

24. What is the probability of drawing a card with a value less than 4 (aces count low)? 
Exercises 25—36 concern 2-card hands from a standard 52-card deck. A standard deck has 13 cards from each of 
4 suits (clubs, diamonds, hearts, spades). The 13 cards have face value 2 through 10, jack, queen, king, or ace. 
Each face value is a “kind” of card. The jack, queen, and king are “асе cards.” 

25. What is the size of the sample space? 

26. What is the probability that both cards are the same suit? 

27. What is the probability that neither card is a spade? 

28. What is the probability that both cards are spades? 

29. What is the probability that exactly 1 card is a spade? 

30. What is the probability that at least 1 card is a spade? 

31. What is the probability that both cards are face cards? 

32. What is the probability that exactly 1 card is a face card? 

33. What is the probability that both cards are spade face cards? 

34. What is the probability that both cards are either face cards or spades? 

35. How does the answer to Exercise 30 relate to the answers for Exercises 28 and 29? 

36. How does the answer to Exercise 30 relate to the answer for Exercise 27? 


Exercises 37—40 concern possible games in the Hoosier lottery, the Indiana state lottery. 


37. In the Daily 3 game, three numbers between 0 and 9 will be drawn in succession (repetitions allowed). 
The player marks three numbers on a game card and has a choice of how to play, straight (the player’s 
numbers will match the three numbers drawn in exact order) or box (the player’s numbers will match the 
three drawn in any order). 


a. What is the size of the sample space? 

b. What is the probability of a straight? 

c. What is the probability of a box if 3 distinct numbers are drawn? 

d. What is the probability of a box if 2 of the numbers drawn are the same? 


38. 


39. 


40. 
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In the Daily4 game, four numbers between 0 and 9 will be drawn in succession (repetitions allowed). The 
player marks four numbers on a game card and has a choice of how to play, straight (the player’s numbers 
will match the four numbers drawn in exact order) or box (the player’s numbers will match the four drawn 
in any order). 


. What is the size of the sample space? 

. What is the probability of a straight? 

. What is the probability of a box if four distinct numbers are drawn? 

. What is the probability of a box if two of the numbers drawn are the same? 
. What is the probability of a box if two distinct pairs of numbers are drawn? 


mwo aoao oro 


What is the probability of a box if three of the numbers drawn are the same? 

In the Cash5 game, five different numbers between 1 and 39 will be drawn in succession. The player 
marks five different numbers between 1 and 39 on a game card. 

a. What is the size of the sample space? 

b. What is the probability of matching all five numbers in any order? 

c. What is the probability of matching exactly four of the five numbers in any order? 

d. What is the probability of matching exactly three of the five numbers in any order? 

In the Powerball game, five different numbers between 1 and 59 will be drawn in succession, and then one 


number (the Powerball number) between 1 and 35 will be drawn. The player marks five different numbers 
between 1 and 59 and one number between 1 and 35 on a game card. 


a. What is the size of the sample space? 
b. What is the probability of matching all five numbers in any order plus matching the Powerball number? 
c. What is the probability of matching none of the five numbers but matching the Powerball number? 


Exercises 41—50 concern 5-card poker hands from a standard 52-card deck. A standard deck has 13 cards from 
each of 4 suits (clubs, diamonds, hearts, spades). The 13 cards have face value 2 through 10, jack, queen, king, 
or ace. Each face value is a “kind” of card. The jack, queen, and king are “face cards.” (See Exercises 33—41 
of Section 4.4. for the definitions of terms.) 


41. 
42. 
43. 
44. 
45. 
46. 
47. 
48. 
49. 
50. 


What is the probability of a royal straight flush? 

What is the probability of a straight flush that is not a royal straight flush? 

What is the probability of four of a kind? 

What is the probability of a full house? 

What is the probability of a flush that is not a straight flush or a royal straight flush? 
What is the probability of a straight that is not a straight flush or a royal straight flush? 
What is the probability of three of a kind? 

What is the probability of two pairs? 

What is the probability of a pair? 

Explain why, in poker, 

a. a straight flush beats a full house. 

b. four of a kind beats a straight. 
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Exercises 51—56 relate to the well-known “birthday problem.” Consider a room containing n persons, each of 
whom has a birthday equally likely to fall on day 1-365 of the year (ignoring leap years). 


51. 
52. 
53: 
54. 
DD: 
56. 


What is the size of the sample space of all possible assignments of birthdays to people? 

Let E be the event that no 2 persons in the room have the same birthday. Find an expression for P(£). 
Let B be the event that 2 ог more persons in the room share the same birthday. Find an expression for P(B). 
Let C be the event that exactly 2 persons in the room share the same birthday. Find an expression for P(C). 
Which has the larger probability, B or C, and why? 


Use a spreadsheet or calculator to determine that 23 is the minimum number of people in the room to give 
a probability of least 1/2 that 2 or more people share the same birthday. 


Exercises 57—62 refer to the gambling game of roulette. A roulette wheel contains 18 black slots and 18 red 
slots that are numbered (not sequentially) 1—36. There are two green slots numbered 0 and 00. The dealer spins 
the wheel and spins a ball into the wheel in the opposite direction. Bets are made on which slot the ball will fall 
into as the wheel slows to a stop. The ball is equally likely to fall into any slot. 


57. 
58. 
59. 
60. 
61. 
62. 
63. 


64. 


What is the size of the sample space? 

What is the probability that the ball lands in a red slot (a “red” bet)? 

What is the probability that the ball lands in a specific numbered slot (a “straight up” bet)? 
What is the probability that the ball lands on one of three specific numbers (a “street” bet)? 


What is the probability that the ball lands on one of four specific numbers (a “corner” bet)? 


What is the probability that the ball lands on an odd number three times in a row? 


A woman in Lake Havasu City, Arizona, gave birth to twin girls on September 22, 2006. This happened 
to be the same date as the woman's birthday—and the same date as her mother's birthday. What is the 
probability that three generations share the same birthday? 


The NCAA men's Division I college basketball championship (known as “March Madness") is a single- 
elimination tournament held in March of each year. The tournament begins with 68 teams. The structure 
of the tournament is as follows; the loser of each game is eliminated: 


Name of the Round Number of Teams Number of Games 
The First Four 8 4 
Round of 64 64 32 
Round of 32 32 16 
Sweet Sixteen 16 8 
Elite Eight 8 4 
Final Four 4 2 
Championship Game 2 1 


What is the probability of correctly picking the championship team? (Assume that all teams are equally 
likely to win their respective games.) 


65. 


66. 
67. 


68. 


69. 


70. 


Ti. 
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Bridge is a card game played by four players with a standard 52-card deck. All 52 cards are dealt to the 
four players at the beginning of each hand. In 1963, women playing bridge in Kankakee, Illinois, discov- 
ered that after the deal, each player held the 13 cards of a single suit. What is the probability of one player 
being dealt an entire suit? 


Referring to Exercise 65, what is the probability of each of the four players being dealt an entire suit? 


Е, and E, are events from the same sample space; P(E;) = 0.37, Р(Е,) = 0.45, and P(E, П E) = 0.14. 
a. Find the probability that E; does not occur. 

b. Find the probability that either E, or E; occurs. 

c. Find the probability that neither E, nor E, occurs. 


An 8-letter password is automatically generated from the 26 lowercase letters of the English alphabet. 
Each letter is equally likely to be used, and letters can be repeated. 


a. What is the size of the sample space? 


се 29 


. Find the probability that the word does not contain an “e. 


ПУЧ) 


. Find the probability that the word contains at least опе “е. 
. Find the probability that the word contains a single “e.” 


се, 55 


. Find the probability that the word contains both а single “h” and а single “x. 


m2 0o б © c 


се, 55 


Find the probability that the word contains either a single “h” or a single “х. 


A loaded die has the following probability distribution: 


" 1 2 3 4 5 6 
px) | 02 |0.05| 01 | 02 | 03 | 0.15 


When the die is rolled, let E; be the event that the number rolled is odd, let Е, be the event that the number 
rolled is 3 or 6, and let Е; be the event that the number rolled is 4 or more. 


a. Find Р(Е,). 

b. Find P(£;). 

c. Find Р(Е;). 

d. Find P(E, N £3). 

e. Find P(E, U £;). 

A congressional race has a Democratic, a Republican, and an Independent candidate in a district where 


past voting patterns indicate that a Democrat is twice as likely to be elected as a Republican, and that a 
Republican is four times as likely to be elected as an Independent. 


a. Find the appropriate probability distribution. 
b. What is the probability that a Democrat will be elected? 
c. What is the probability that a Republican will not be elected? 


At a certain school, 72% of the students play one or more sports. The percentage of students who play 
one or more sports and who graduate is 67%. Find the probability that a student graduates given that the 
student plays one or more sports. 
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72. On a particular manufacturing job, the probability that there will be a shortage of copper is 0.37. The prob- 
ability that there will be a shortage of both copper and aluminum is 0.28. Find the probability of a shortage 
of aluminum given a shortage of copper. 


73. In Example 71, 
а. What is the probability that a patient responds positively to compound А given that he or she responds 
positively to В? 
b. What is the probability that a patient responds positively to either compound А or compound 5? 
c. What is the probability that a patient does not respond positively to either compound? 
74. A food magazine surveys the preferences of its readers who like Asian food. The findings follow. 


47% like Thai food (E 
39% like Indian food (Ez) 
78% like Chinese food (Ез) 


23% like both Thai and Indian food 
31% like both Indian and Chinese food 
29% like both Thai and Chinese food 


a. Extend observation 5 in Table 4.3 to the case of three events, E,, E», and £3}. 
b. Find the probability that a reader likes all three types of food. 
c. Find the probability that a reader likes Chinese food given that the reader likes Indian food. 


For Exercises 75—80, a student takes a true—false quiz with four questions, each equally likely to be either T or F. 


75. What is the probability of getting exactly one question wrong? 
76. What is the probability of getting Question 1 correct? 

77. What is the probability of getting three or more questions correct? 
78. What is the probability of getting the first two questions correct? 


79. What is the probability of getting the first two questions correct given that the answer to Question 1 is 
correct? 


80. What is the probability of getting all four questions correct given that the answers to the first two questions 
are correct? 
For Exercises 81—88, a family has 3 children; boys and girls are equally likely offspring. 


81. What 15 the probability that the oldest child is a boy? 

82. What is the probability that the youngest two children are girls? 

83. What 15 the probability of 2 boys and 1 girl? 

84. What is the probability of no girls? 

85. What is the probability of at least 1 girl? 

86. What is the probability of 3 girls? 

87. What is the probability of 3 girls given that the first 2 are girls? 

88. What 15 the probability of at least 1 boy and at least 1 girl given that there is at least 1 boy? 


89. 


90. 


91. 


92. 


93. 
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Prove Bayes’ theorem. The proof parallels what was done in Example 73. Let £}, ..., E, be disjoint events 
from a sample space 5 whose union equals S. If F is another event from S, then Bayes’ theorem says that 
the probability of event E; 1 <i = п, given event F, is 


Р(Е|Е?Р(Е,) 


2, Р(Е|Е)Р(Е,) 
с=1 


P(E;|F) = 


a. Use the definition of P(E;| F) and P(F | Ej) to prove that 


F(F|E)P(E) 


P(E;|F) = Р(Е) 


b. Prove that 
P(F) = p P(F E) 
=1 
c. Use the definition of P(F | Ej) and the result from part (b) to prove that 
POF) = Y PIE) 


d. Using parts (a) and (c), prove Bayes' theorem. 


Toys for boys and for girls are donated to a benefit event by two groups. The Lakeville Do-Gooders 
donated 5 toys for boys and 7 for girls. The Southside Champions Club donated 6 toys for boys and 5 for 
girls. The master of ceremonies pulls the first toy out of a bin and it's a toy for a boy. Find the probability 
that it was donated by the Do-Gooders. 


An online pharmacy sells an over-the-counter drug, medication X, that is used for a variety of purposes. 
The pharmacy has data that say that 1896 of its customers are HIV positive, 996 of its HIV-positive cus- 
tomers buy medication X, and 3% of the customers who are not HIV positive buy medication X. Find the 
probability that a customer who buys medication X is HIV positive. The pharmacy can use this data to 
shape its marketing/advertising plan. 


Ofthe high blood pressure patients in a particular clinic, 6296 are treated with medication X, the remainder 
with medication Y. It is known that 1.496 of patients using medication X suffer from fainting spells, as do 
2.9% ofthe patients using medication Y. A patient known by the clinic to have high blood pressure suffers 
a fainting spell, but she does not remember which medication she is on. Which medication is she most 
likely to be taking? (Hint: Let E; and E;—treated with X and treated with Y, respectively—be events in 
the sample space of all patients with high blood pressure in the clinic, and let F be the event of fainting; 
find P(X | F) and P(Y|F).) 

a. А fair die is rolled once. Let the random variable X equal the value that comes up. Find the expected 

value of X, E(X). 


b. The die is now “loaded” so that a 2 comes up twice as often as any other number. Find the new expected 
value of X. 


c. Your answer to part (b) should be (greater than, less than) your answer to part (a). Explain why. 


94. 


95. 


96. 


97. 


98. 


99. 


100. 


101. 


102. 
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Two fair dice are rolled. The sample space S contains the 36 combinations of two numbers. For each mem- 
ber (r, s) of S, the random variable X(r, s) = r + s. 


a. Write a table showing the values for X and the probability of those values; instead of 36 columns each 
with probability 1/36, do a column for each distinct value of X and show the probability of that value. 


b. Find the expected value of the sum of the numbers that come up when two fair dice are rolled. 


c. Find the expected value of the sum of the numbers that come up when two fair dice are rolled. This 
time let the sample space S consist of the ordered pairs (r, s) that can appear on the two dice. Use two 
different random variables over this sample space, where Х = the value of the first component of the 
ordered pair and X, = the value of the second component of the ordered pair. Make use of the linearity 
property from Equation (4). 

At a gambling casino, a ball will be drawn from a bin containing 43 red balls, 27 green balls, and 8 blue 

balls. A player marks a game card with the color he or she believes will be picked. The prize money for 

guessing the correct color is 


Red $3.00 
Green $6.00 
Blue $10.00 


The price of the game card is $5.00. Find the expected value of the prize money. 


A directory on a computer's hard disk contains 12 files, 3 of which have viruses. If a file with a virus is 
selected, the virus is detected and another file is then selected. Find the expected number of files that must 
be selected in order to get a virus-free file. 


Bit strings are sent across a computer network in packets of length 10. The probability of a bit getting cor- 
rupted (that is, a 0 gets changed to a 1 or vice versa) is 0.01. These bit errors are independent. 


a. Find the probability that in a single packet there are no bit errors. (Hint: It's OK to call a bit error a 
"success." 


b. Find the probability that there are no more than two bit errors. 
c. Find the probability that there is at least one bit error. 


Of the items produced in a certain manufacturing facility, 5% are defective. If 8 items are chosen at ran- 
dom, find the probability that 


a. l is defective. 

b. 2 are defective. 

c. none is defective. 

d. at least 1 is defective. 
e. at most 1 is defective. 


A student has failed to study for a true—false test and guesses at every one of the 10 questions. If the pass- 
ing grade is 8 correct answers, what is the probability that the student will pass the quiz? 


A baseball player has a probability p — 0.04 of hitting a home run for each at bat. Find the minimum 
number of at bats he must take so that there is at least an 8096 probability of hitting a home run (that is, at 
least 1 home run). 


Find the average number of comparisons to search for a target element x using the sequential search al- 
gorithm under the assumption that x is equally likely to be at any of the n positions in the list or not in the 
list. 

Find the average number of comparisons to search for a target element x using the sequential search algo- 
rithm under the assumption that x is not in the list 80% of the time, but if x is in the list it is equally likely 
to be at any of the и positions. 
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TERMINOLOGY 


addition principle (p. 254) 
Bayes' theorem (p. 309) 
Bernoulli trial (p. 313) 
Bernoulli experiment (p. 313) 
binary operation (p. 229) 
binomial coefficient (p. 297) 
binomial distribution (p.313) 
binomial theorem (p. 296) 
Cantor's diagonalization method 
(p. 237) 
cardinality of a set (p. 236) 
Cartesian product (cross product) 
of sets (p. 233) 
closed set under an operation 
(p. 229) 
combination (p. 274) 
combinatorial proof (p. 296) 
combinatorics (p. 252) 
complement of a set (p. 232) 
conditional probability (p. 307) 
countable set (p. 236) 


SELF-TEST 


decision tree (p. 257) 

denumerable set (p. 236) 

disjoint sets (p. 232) 

dual of a set identity (p. 235) 

empty set (p. 224) 

equal sets (p. 223) 

event (p. 302) 

expected value (p. 310) 

finite set (p. 223) 

independent events (p. 307) 

infinite set (p. 223) 

intersection of sets (p. 231) 

lexicographical ordering (p. 281) 

linearity of expected value 
(p. 312) 

multiplication principle (p. 253) 

n factorial (p. 272) 

null set (p. 224) 

ordered pair (p. 228) 

Pascal's formula (p. 295) 

Pascal's triangle (p. 294) 


Answer the following true—false questions. 


Section 4.1 


permutation (p. 272) 

pigeonhole principle (p. 269) 

power set (p. 227) 

principle of inclusion and 
exclusion (p. 267) 

probability axioms (p. 304) 

probability distribution (p. 305) 

probability of an event (p. 302) 

probability of an event E (p. 305) 

proper subset (p. 225) 

random variable (p. 310) 

sample space (p. 302) 

set difference (p. 232) 

subset (p. 224) 

unary operation (p. 230) 

uncountable set (p. 236) 

union of sets (p. 231) 

universal set (p. 231) 

universe of discourse (p. 231) 

weighted average (p. 310) 

well-defined operation (p. 229) 


3. The addition principle requires the tasks at hand to 


1. 
2. 


The empty set is a proper subset of every set. 


If A and B are disjoint sets, then (4 — B) U (B — A) 
= АОВ. 


. If a set has и elements, then its power set has 2” 


elements. 


. If a binary operation ° on a set S is well-defined, 


then x ° y € S for all x and y in S. 


. Cantor's diagonalization method is a way to prove 


that certain sets are denumerable. 


Section 4.2 


1 


. According to the multiplication principle, the number 


of outcomes for a sequence of tasks is the product of 
the number of outcomes for each separate task. 


. The addition principle finds the total number of 


branches of a decision tree. 


have disjoint sets of outcomes. 


4. The multiplication principle says that the number of 


elements in A X B equals the number of elements in 
A times the number of elements in В. 


5. Any problem that requires a decision tree for its 


solution cannot be solved by the multiplication 
principle. 


Section 4.3 
1. The principle of inclusion and exclusion requires 


that А and В be disjoint sets in order to find the 
number of elements in A U B. 


. The principle of inclusion and exclusion applied 


to two sets says that the number of elements in 
the union minus the number of elements in the 
intersection is the sum of the number of elements 
in each set. 
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. The principle of inclusion and exclusion applies to 


the union of any number of sets as long as at least 
one of them is finite. 


. The pigeonhole principle is a way to count the 


number of elements in the union of disjoint sets, or 
“pigeonholes.” 


. The piegeonhole principle guarantees that if there 


are 8 people in a room, at least 2 must have been 
born on the same day of the week. 


Section 4.4 


1. 


2. 


A permutation is an ordered arrangement of 
objects. 

The number of combinations of r objects out of n, 
r > 1, is fewer than the number of permutations of 
r objects out of n. 


. To find the number of ways a subset of r objects can 


be selected from n objects, use the formula P(n, r). 


. The number of permutations of the letters in a word 


with three sets of repeated letters 15 71!/3. 


. The formula С( + n — 1, r) computes the number 


of combinations of r objects out of n objects where 
objects may be used repeatedly. 


Section 4.5 


1. 


Pascal’s triangle consists of rows that represent the 
number of ways to arrange r out of n objects for 
various r. 


ON THE COMPUTER 


For Exercises 1—10, write a computer program that 
produces the desired output from the given input. 


1. 


Input: Elements in a finite set 5 
Output: Elements in (S) 
Algorithm: Use recursion. 


. Input: Arithmetic expression in postfix notation 


(see Exercise 45 in Section 4.1) 
Output: Value of the expression 


. Input: Arithmetic expression in infix notation (see 


Exercise 45 in Section 4. 1) 
Output: Postfix form of the expression 
Do this problem in two ways: 


2. Pascal's formula says that an "interior" number 
in Pascal's triangle is the sum of the two numbers 
directly above it in the triangle. 


3. In the expansion of a binomial to the nth power, the 
kth term is found in row k of Pascal’s triangle. 


4. A combinatorial argument is one that is based on 
counting techniques. 


5. The coefficient of the seventh term in the expansion 
of (a + b)? is given by the expression C(12, 6). 


Section 4.6 


1. The probability of an event always falls in the range 
between 0 and 1. 


2. In a sample space with л equally likely outcomes, 
the probability distribution is 1/л for each 
outcome. 


3. To find the probability of several events occur- 
ring, multiply the probabilities of the individual 
events. 


4. A random variable is a variable whose value 
is randomly assigned using a random number 
generator. 


5. If E, and E, are disjoint events whose union 
equals the sample space, then Bayes' theorem 
allows you to derive the conditional probability 
P(E, | F) if you know P(F | Ej), PF | Ey), P(E) 
and P(E;). 


a. Assume that the input is fully parenthesized. 


Do not assume that the input is fully parenthe- 
sized, but apply the proper order of precedence 
of operators within the program (order of prece- 
dence of operators is parenthesized expressions 
first, then exponentiation, then multiplication and 
division, then addition and subtraction). 
4. Input: Values for n andr, 0 rn 
Output: Value of P(n, r) 


5. Input: Values for n and r, 0 = = и 
Output: Value of C(n, r) 


6. Input: Value for n 
Output: All values of C(n, г), 0 Er xn 
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7. Input: Value for n 
Output: All permutations of the integers 1, ... , 7 


Here is an outline for an alternative to the algorithm 
given in Section 4.4 to generate the n! permutations 
of the integers (1, ... , n}. Use a recursive algorithm. 
Once а permutation А of the integers 1, ..., k — 1 
exists, permutations of the integers 1, ... , k can be 
obtained by inserting integer k into every possible 
position in A. Every time k = n, any permutation so 
obtained can be written out. Initiate the process by 
sending 1 to an empty permutation list. For the case 
п = 3, for example, this algorithm successively 
traverses the branches of the tree and prints out the 
leaves. 


3,251 2,3,1 2, 1,3 3,1,2 135,2 12,3 


8. Input: Values for a, b, and n 
Output: Value of (a + D)" 


a. Use the binomial theorem to compute your 
result. 


b. Compute a + b and raise this value to the 
nth power; compare your answer with that of 
part (a). 


9. Input: Values fora, b, n, andr, 1 Ern 1 
Output: rth term in the expansion of (a + b)” 


10. Input: А random variable and a probability distri- 
bution for a finite sample space. 
Output: The expected value ofthe random variable. 


11. Write a program that allows the user to enter a 
value for n, 1 = n = 10, and then queries the user 
for the values needed on the right side of Equation 
(4) of Section 4.3 (the principle of inclusion and 
exclusion) and computes the value of 

|A; Uses U A, 

12. Write a program to generate a given number of 
rows of Pascal’s triangle. Do this problem in two 
ways. 

a. Use the definition of Pascal’s triangle (and 
perhaps use your answer to compute Exercise 
5 as a function). 


b. Use recursion and Pascal’s formula. 


13.Benford’s law, also called the first-digit law, 
states that in many (but not all) large numerical 
data sets, the first digit is not equally likely to be 
1 through 9. In fact, the probability that the first 
digit equals 1, p(1), is about 30%, and the prob- 
ability for each successive value of the first digit 
goes down until p(9) is about 4.6%. The formula 
for Benford’s law is 


1 
p(d) = logu( | + ;) 


Evidence based on Benford's law 1s admissible in 
court and has been used to help detect fraudulent 
data in accounting, economics, scientific research, 
and other areas. 


a. Use the given formula to compute the prob- 
ability of occurrence in the first digit of digits 
1-9. 

b. Write a program to generate the first 200 Fibo- 
nacci numbers and determine whether the first 
digits follow Benford’s law. 
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Relations, Functions, 
and Matrices 


CHAPTER OBJECTIVES 


Chapter 


After studying this chapter, you will be able to: 


Identify ordered pairs related by a binary relation. 

Test a binary relation for the reflexive, symmetric, transitive, and antisymmet- 
ric properties. 

Find the reflexive, symmetric, and transitive closures of a binary relation. 
Recognize partial orderings and construct Hasse diagrams for them. 
Recognize an equivalence relation on a set and describe how it partitions the 
set into equivalence classes. 

Draw a PERT chart from a task table. 

Find the minimum time-to-completion and a critical path in a PERT chart. 
Extend a partial ordering on a finite set to a total ordering by doing a topologi- 
cal sort. 

Understand the entity-relationship model and the relational model for an 
enterprise. 

Perform restrict, project, and join operations in a relational database. 

Create relational database queries in the languages of relational algebra, SQL, 
and relational calculus. 

Determine whether a binary relation is a function. 

Test a function for the onto and one-to-one properties. 

Create composite functions. 

Decide whether a function has an inverse function and what the inverse 
function is. 

Manipulate cycle notation for permutation functions. 

Compute the number of functions, onto functions, and one-to-one functions 
from one finite set to another. 

Understand order of magnitude as a relative measure of function rate of growth. 
Build a hash table using a modulo hash function. 

Encode and decode messages using RSA public key encryption. 

Use the mod function to compute check digits for various identification codes. 
Perform matrix arithmetic on matrices of appropriate dimensions. 

Solve systems of linear equations using Gaussian elimination. 

Perform Boolean arithmetic operations on Boolean matrices of appropriate 
dimensions. 


327 


Question: 


328 Relations, Functions, and Matrices 


Your company has developed a program for use on a small parallel processing 
machine. According to the technical documentation, the program executes processes 
P1, P2, and РЗ in parallel; these processes all need results from process P4, so they 
must wait for Process P4 to complete execution before they begin. Processes P7 and 
P10 execute in parallel but must wait until processes P1, P2, and P3 have finished. 
Process P4 requires results from P5 and P6 before it can begin execution. P5 and 
P6 execute in parallel. Processes P8 and P11 execute in parallel, but P8 must wait 
for Process P7 to complete and P11 must wait for process P10 to complete. Process 
P9 must wait for results from P8 and P11. You have been assigned to convert the 
software for use on a single processor machine. 


In what order should the processes be executed? 


Here various pairs of processes are related to one another by a "prerequisite" 
relation. This is a special case of a binary relation, a relationship between pairs of 
elements within a set. We will study the various properties of binary relations in 
Section 5.1. One type of binary relation is called a partial ordering; elements re- 
lated by a partial ordering can be represented graphically. Another type of binary 
relation is an equivalence relation; elements related by an equivalence relation can 
be grouped into classes. 

A topological sort extends a partial ordering to a total ordering. For a par- 
tial ordering of prerequisite tasks, a corresponding total ordering identifies the 
sequential order in which the tasks would have to be done, which is the solution 
to the parallel processing conversion problem. Topological sorting is presented in 
Section 5.2. 

A generalization of a binary relation forms the basis for relational databases, 
considered in Section 5.3. Using operations of restrict, project, and join on the 
relations in a database, we can make various queries of the database. 

A function is a special kind of binary relation. Functions, as well as relations, 
describe a number of real-world situations. Functions can also have special prop- 
erties, as discussed in Section 5.4. Order of magnitude, presented in Section 5.5, 
provides a way to compare the growth rate of two functions and is useful in the 
analysis of algorithms. A simple function called the modulo function has a sur- 
prising number of applications, ranging from encryption algorithms for computer 
security to the basis for artistic design patterns. Some of these applications are 
mentioned in Section 5.6. 

In Section 5.7, we consider matrices and develop an arithmetic for manipulat- 
ing them. Matrices provide a mechanism for solving systems of linear equations. 
We will later use matrices to represent relations and graphs. 


SECTION 5.1 


RELATIONS 


Binary Relations 


If we learn that two people, Henrietta and Horace, are related, we understand that 
there is some family connection between them—that (Henrietta, Horace) stands 
out from other ordered pairs of people because there 15 a relationship (cousins, sis- 
ter and brother, or whatever) that Henrietta and Horace satisfy. The mathematical 
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analogue is to distinguish certain ordered pairs of objects from other ordered pairs 
because the components of the distinguished pairs satisfy some relationship that 
the components of the other pairs do not. 


Remember (Section 4.1) that the Cartesian product of a set S with itself, S X S 
or S°, is the set of all ordered pairs of elements of S. Let S = (1,2, 3); then 


oos 2) З 2, 2), G ys 2} 09: Эў} 


If we were interested in the relationship of equality, then (1, 1), (2, 2), (3, 3) would 
be the distinguished elements of 5 X 5, that is, the only ordered pairs whose com- 
ponents are equal. If we were interested in the relationship of one number being 
less than another, we would choose (1, 2), (1, 3), and (2, 3) as the distinguished 
ordered pairs of S X S. ө 


In Example 1, we could pick out the distinguished ordered pairs (x, у) by say- 
ing that x = y or that x < y. Similarly, the notation x py indicates that the ordered 
pair (x, у) satisfies a relation р. The relation p may be defined in words or by an 
equation or simply by listing the ordered pairs that satisfy p. 


La s = 11,22) 0n e sa ЕИО) (4) О О) оа 
(4, 1), (4, 2), (4, 4)}, a relation p can be defined by x py if and only if x = y/2, 
abbreviated x py <> x = y/2 Thus (1, 2) and (2, 4) satisfy p. Alternatively, the 
same p could be defined by saying that {(1, 2), (2, 4)} is the set of ordered pairs 
satisfying p. e 


As in Example 2, one way to define the binary relation p is to specify a subset 
of S X S. Formally, this is the definition of a binary relation on a set. 


* DEFINITION BINARY RELATION ON A SET S 
A binary relation on a set S is a subset of S X S (a set of ordered pairs of 
elements of 5). 


Now that we know that a binary relation p is a subset, we see that 


xpyex(y)ep 


Generally, a binary relation is defined by describing the relation rather than by list- 
ing the ordered pairs. The description gives a characterizing property of elements 
of the relation; that is, it is a binary predicate satisfied by certain ordered pairs. 
A binary relation implies a yes/no result—an ordered pair either does or does not 
satisfy the binary predicate and either does or does not belong to the relation. 
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Let S = {1,2}. Then S X S = {(1, 1), (1, 2), (2, 1), (2, 2)}. Let p on 5 be given 
by the description xpy<>x + y is odd. Then (1, 2) € p and (2, 1) € p. The 
ordered pair (1, 1) є p because 1 + 1 is not odd. Similarly (2, 2) € p. e 


| EXAMPLE 4 | Let S = {1,2}. Then S x S = {(1, 1), (1, 2), (2, 1), (2, 2)}. If p is defined on S by 


p = (0, 1), (2, 1)}, then 1р1 and 2р1 hold, but not, for instance, 1р2. Here p 
seems to have no obvious verbal description. ө 


In this section we will be concerned almost exclusively with binary relations 
on a single set, but more generally, relations can be defined on multiple sets. 


* DEFINITION RELATIONS ON MULTIPLE SETS 
Given two sets S and 7, a binary relation from S to T (also called a binary 
relation on 5 X T)isa subset of 5 X T. Given n sets S, $5, ..., Sp, п > 2, an 
n-ary relation on S; X S, X ++- X S,isasubset of S, X 5, X --- X S, 


| EXAMPLES | Let S = {1,2,3} and T = (2,4, 7}. Then the set 
{(1, 2), (2, 4), (2, D} 


consists of elements from S X T. It is a binary relation from S to T. ө 


PRACTICE 1 | For each of the following binary relations р оп N, decide which of the given ordered 


pairs belong to p. 


xpy<ex = у + 1; (2, 2), (2, 3), (3, 3), (3, 2) 

. Xpy €» x divides y; (2, 4), (2, 5), (2, 6) 

xpy €» x is odd; (2, 3), (3, 4), (4, 5), (5, 6) 

ХО yw 1/2), 2.1) 2000059), (4,3) п 


по ср 


If p is a binary relation on S, then р will consist of a set of ordered pairs of ће 
form (51, 5). A given first component s; or second component s; can be paired in 
various ways in the relation. The relation is one-to-one if each first component and 
each second component appears only once in the relation. The relation is one-to- 
many if some first component s, appears more than once; that is, one s, is paired 
with more than one second component. It is many-to-one if some second compo- 
nent s, is paired with more than one first component. Finally, it is many-to-many 
if at least опе s, is paired with more than one second component and at least one 
5 is paired with more than one first component. Figure 5.1 illustrates these four 
possibilities. Note that not all values in $ need be components in ordered pairs of p. 
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| 


One-to-one One-to-many 


»p— 4 


Many-to-one Many-to-many 
Figure 5.1 


These ideas extend to relations from a set S to a set 7. The relation of Example 5 
is one-to-many because the first component 2 appears more than once; 2 from set 
S is associated with both 4 and 7 from set T: 


PRACTICE 2 | Identify each of these relations on S, where S = (2, 5, 7, 9}, as one-to-one, one-to-many, 
many-to-one, or many-to-many. 


ао) уок О; 2)) 
beer (БШ (еду ч 
E EO Sa 989).(2. 7) 


Suppose В is the set of all binary relations on a given set S. If p and с belong 
to B, then they are subsets of S X S. As such, we can perform set operations 
of union, intersection, and complementation that result in new subsets of S X S, 
that is, new binary relations, which we will denote by p U e, po, and р’, 
respectively. Thus 


х(р U о)у e»xpyorxa y 
х(рПо)у e xpy and xa y 
xp'y«» notxpy 


PRACTICE 3 | Let p and ø be two binary relations on N defined by x py &»x = y and xo y &» x < y. 
Give verbal descriptions for parts (a), (b), and (c); give a set description for part (d). 


a. What is the relation p U a? 

b. What is the relation р’? 

c. What is the relation с’? 

d. What is the relation p N с? н 


REMINDER 


Reflexive —Every x is 
related to itself. 
Symmetric — |f x is related 
to y, then y is related to x. 
Transitive — lf x is related 
to y and y is related to z, 
then x is related to z. 


© DEFINITION 


DEFINITION 
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The following facts about the operations U, N and ' on relations are immedi- 
ate consequences of the basic set identities found in Section 4.1. The set S? (which 
is, after all, a subset of S?) is being viewed here as a binary relation on S. 


laapUe-cUp Ib. oe 7» ср 

2a.(p ОЈ о) Uy » pU (с U y) 2b. (pNa)Ny = pN(aN у) 

3a. p U (c C0 y) = (p U о) (рО у) 3b. рђћ(с U у) = PAPU (py) 
4a. p U Ф = р 4b. pS? = p 

5a. p U p' = S? 5b. pN p' = Ø 


Properties of Relations 


A binary relation on a set S may have certain properties. For example, the 
relation p of equality оп S, (х,у) € p €» x = y, has three properties: (1) for 
апу x E S,x = x, or (x, x) e p; (2) for any x,y € S, if x = y then y = x, or 
(x, y) € p — (y, x) € p; and (3) for any x,y,z € S, if x = y and y = z, then 
x = z, or (х, у) e p and (y, z) e p] > (x, 2) є p. These three properties make 
the equality relation reflexive, symmetric, and transitive. 


REFLEXIVE, SYMMETRIC, AND TRANSITIVE RELATIONS 
Let p be a binary relation on a set S. Then 


p is reflexive means (Vx)(x € S — (x, x) € p) 
pis symmetric means (Vx)Vy)x e 5 Лує 5 Л (х, у) єр (у, х) є р) 
p is transitive means (Vx)(Vy(Vz(x e S^yeS^zeS^ 

(х,у) E€ p(y, 2) єр (х, Zz) є р) 


Consider the relation = оп the set №. This relation is reflexive because for апу 
nonnegative integer x, x = x. It is also a transitive relation because for any non- 
negative integers x, y, and z, if х= y and y = 2, then x = 2. However, = is 
not symmetric; 3 = 4 does not imply 4 = 3. In fact, for any x, y є №, if both 
x € y and y = х, then x = y. This characteristic is described by saying that = is 
antisymmetric. ө 


ANTISYMMETRIC RELATION 
Let p be a binary relation on a set S. Then p is antisymmetric means 


(aye eSAyeSA,y) epAy,x) ерх = у) 


Let S = g(N). Define a binary relation p on Sby A pB <A С B. Then pis reflexive 
because every set is a subset of itself. Also, p is transitive because if A is a subset of 
В and B is a subset of C, then А is a subset of C. Finally, p is antisymmetric because 
if А is a subset of B and В is a subset of A, then А and В are equal sets. ө 


REMINDER 


Antisymmetric — lf x is re- 
lated to y and y is related 
to x, then x = y. 
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АП four relational properties involve the implication connective. The univer- 
sal quantifiers mean that the implications must be true for arbitrary choices of 
variables. Recall that to prove an implication true, we assume that the antecedent 
18 true and prove that the consequent must also be true. For the reflexive property, 
the antecedent just chooses an arbitrary element in S; the consequent says that 
this element must be related to itself. For a relation p on a set to be reflexive, then, 
every element in the set must be related to itself, which specifies certain ordered 
pairs that must belong to p. 

However, in the symmetric, transitive, and antisymmetric properties, the an- 
tecedent does not say only that the elements are in 5. To prove that a relation is 
symmetric, for example, we must show that if x and y are arbitrary elements in S 
and if in addition x is related to y, then it must be the case that y is related to x. This 
says that if certain ordered pairs are found in p, then certain other ordered pairs 
must also be in p in order for p to be symmetric. In other words, knowledge of the 
set S is critical to determining whether reflexivity holds, while to determine the 
other properties, it is sufficient just to look at the ordered pairs in p. 

Atany rate, the question of whether a given relation ona set S has a certain prop- 
erty requires a yes or no answer. The property either holds or it doesn't. 


PRACTICE 4 | Герб 11,2, 3}. 


e 


. Ifa relation p on S is reflexive, what ordered pairs must belong to р? 


b. Ifarelation p on Sis symmetric, what ordered pairs must belong to p? (This is a trick question; see 
the answer at the back of the book.) 


e 


. Ifa relation p on S is symmetric and if (a, Б) є p, then what other ordered pair must belong to р? 


d. Ifa relation p on S is antisymmetric and if (a, b) and (b, a) belong to p, what must be true? 
e. Is the relation p = {(1, 2)} on S transitive? (Hint: Remember the truth table for implication.) ш 


The properties of symmetry and antisymmetry for binary relations are not 
precisely opposites. Antisymmetric does not mean “not symmetric.” A relation 
is not symmetric if some (x, y) belongs to the relation but (y, x) does not. More 
formally, not symmetric means 


(ух) (Уу) х е5 Лує 5 Л (х,у) єр (у, х)р])' 
o (xaylx E SAy ESA (х, у) єр (у, х)р] 
o (Ax\(Ay[@ е5 лує 8 Л (х, у) єр) VG.» єр] 
o (39(3yl( eSAy ESA (х, у) єр) Л (у, х) € p] 


Relations can therefore be symmetric and not antisymmetric, antisymmetric and 
not symmetric, both, or neither. 

The equality relation on a set 5 is both symmetric and antisymmetric. However, 
the equality relation on S (or a subset of this relation) is the only relation having 
both these properties. To illustrate, suppose p is a symmetric and antisymmet- 
ric relation on SS, and let (x, y) € p. By symmetry, it follows that ( y, x) € p. But 
by antisymmetry, x — y. Thus, only equal elements can be related. The relation 
р = (0,2), (2, 1), (1, 3)} on the set S = (1, 2, 3} is neither symmetric—(1, 3) be- 
longs but (3, 1) does not—nor antisymmetric—(1, 2) and (2, 1) belong, but 1 = 2. 
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PRACTICE 5 | Test each binary relation on the given set 5 for reflexivity, symmetry, antisymmetry, and 
transitivity. 


> 
+ 


N 
2 


Б њоро сь 


ЕХАМРІЕ 8 


© DEFINITION 


S xpy «х + yis even 

S (positive integers); x py < x divides y 

S = set of all lines in the plane; x py <> x is parallel to y or x coincides with y 
SENS pr x = у 
S 

S 

S 

S 


2 


The discussion on recursion in Prolog (Section 1.5) noted that a recursive rule 
should be used when the predicate being described is one that is inherited from one 
object to the next. The predicate in-food-chain used there has this property because 


in-food-chain-(x, y) A in-food-chain (y, 2) > in-food-chain (х, 2) 


Now we see that this is simply the transitive property. e 


Closures of Relations 


If a relation p on a set S fails to have a certain property, we may be able to extend 
p to a relation p* on S that does have that property. By "extend," we mean that the 
new relation p* will contain all the ordered pairs in p plus the additional ordered 
pairs needed for the desired property to hold. Thus p C p*. If p* is the smallest 
such set, then p* 15 called the closure of p with respect to that property. 


CLOSURE OF A RELATION 
A binary relation p* on a set S is the closure of a relation p on S with respect to 
property P if 


1. p* has property P. 
2.9 C pt. 
3. p* isa subset of any other relation on S that includes p and has property P. 


We can look for the reflexive closure, the symmetric closure, and the transi- 
tive closure of a relation on a set. Of course, if the relation already has a property, 
it is its own closure with respect to that property. 


Las = „7. 3} and p = {(1, у (CIL; 2), Cl, 3, (0 1D (2; 3)). Then р is not reflex- 
ive, not symmetric, and not transitive. The closure of p with respect to reflexivity is 


a = UN („2 Gah Gr TU etu ey 5 (б ey 


РВАСТ!СЕ 6 
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This relation is reflexive and contains p. Furthermore, any reflexive relation on 5 
would have to contain the new ordered pairs we've added—(2, 2) and (3, 3)—so 
no smaller reflexive relation can exist; that 15, any reflexive relation containing p 
must have the relation above as a subset. 

The closure of p with respect to symmetry is 


ge = ОООО ООС) 


Here it is also clear that we have added just those new pairs required—(2, 1) and 
(3, 2)—for the relation to be symmetric. 

For both reflexive closure and symmetric closure, we only had to inspect the 
ordered pairs already in p to find out what ordered pairs we needed to add (assum- 
ing we knew what the set S was). The reflexive or symmetric closure of the rela- 
tion could be found in one step. Transitive closure may require a series of steps. 
Inspecting the ordered pairs in our example p, we see that we need to add (3, 2) 
(because of (3, 1) and (1, 2)), (3, 3) (because of (3, 1) and (1, 3)), and (2, 1) (be- 
cause of (2, 3) and (3, 1)). This gives the relation 


ОЕЕО ОСО (959 C Di 


However, this relation is still not transitive. Because of the new pair (2, 1) and the 
old pair (1, 2), we need to add (2, 2). This gives the relation 


DECIR) ОООО) 


which is transitive and also the smallest transitive relation containing p. It is the 
transitive closure of p. ө 


As ш Example 9, one way to find the transitive closure of a relation is to inspect 
the ordered pairs in the original relation, add new pairs if necessary, inspect the 
resulting relation, add new pairs if necessary, and so on, until a transitive relation is 
achieved. This is a rather ad hoc procedure, and we will give a better algorithm in 
Chapter 7. There we will also see that the transitive closure of a binary relation is 
related to “reachability in a directed graph,” which has many applications. 


Does it make sense to look for the antisymmetric closure of a relation on a set? Why or why 


not? 


PRACTICE 7 | Find the reflexive, symmetric, and transitive closure of the relation 


{(a, a), (b, b), (c, c), (а, с), (a, а), (b, d), (c, a), (d, а)} 


on the set S = (a, b, c, d]. п 
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For the rest of this section we will concentrate on two types of binary relations 
that are characterized by which properties (reflexivity, symmetry, antisymmetry, 
and transitivity) they satisfy. 


Partial Orderings 


* DEFINITION PARTIAL ORDERING 
A binary relation on a set S that is reflexive, antisymmetric, and transitive is 
called a partial ordering on S. 


From previous examples and Practice 5, we have the following instances of 
partial orderings: 


On №, xpy e» x = y. 
On (№, 4p B €» A C B. 
On Z^, xpy x divides y. 


On (0, 1}, xpy o x = y’. 


If p is a partial ordering on S, then the ordered pair (S, p) is called a partially 
ordered set (also known as a poset). We will denote an arbitrary, partially ordered 
set by (S, <); in any particular case, < has some definite meaning such as “less 
than or equal to,” “is a subset of,” “divides,” and so on. (The symbol for a generic 
partial ordering, <, is designed to resemble the inequality symbol =, which, as 
we’ve just noted, is a partial ordering on the set N or on any other set in which a 
less-than-or-equal-to relation makes sense.) 

Let (S, <) be a partially ordered set, and let A C S. Then < is a set of or- 
dered pairs of elements of S, some of which may be ordered pairs of elements 
of A. If we select from < the ordered pairs of elements of A, this new set is called 
the restriction of < to A and is a partial ordering on A. (Do you see why the 
three required properties still hold?) For instance, once we know that the relation 
“х divides y" is a partial ordering on Z* , we automatically know that “x divides y" is 
a partial ordering on 11, 2, 3, 6, 12, 18}. 

We want to introduce some terminology about partially ordered sets. Let 
(S, <) be a partially ordered set. If x < y, then either x = y or x z£ y. If x y 
but x = y, we write x < y and say that x is a predecessor of y or y is a successor 
of x. A given y may have many predecessors, but if x « y and there is no z with 
x < т < y, then x is an immediate predecessor of y. 


PRACTICE 8 | Consider the relation “x divides y" on (1, 2, 3, 6, 12, 18}. 


a. Write the ordered pairs (x, y) of this relation. 
b. Write all the predecessors of 6. 
c. Write all the immediate predecessors of 6. m 


If S is finite, we can visually depict a partially ordered set (5, <) by using 
a Hasse diagram. Each of the elements of S is represented by a dot, called a 
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node, or vertex, of the diagram. If x is an immediate predecessor of y, then the 
node for y is placed above the node for x and the two nodes are connected by a 
straight-line segment. 


Consider @({1, 2}) under the relation of set inclusion. This is a partially ordered set, 
a restriction of the partially ordered set (9(N), C). The elements of e((1, 2}) are 
@, {1}, {2}, and (1, 2}. The binary relation C consists of the following ordered 
pairs: 


(©. 0), C1, (55, 02), (29. C1, 25, 1, 2), (2, Шр}, 
(ОООО Ор) 


The Hasse diagram of this partially ordered set appears in Figure 5.2. Note that 
although Ø is not an immediate predecessor of {1, 2}, it is a predecessor of {1, 2} 
(shown in the diagram by the chain of upward line segments connecting @ with 


1152]: ә 
{1,2} 
{1} {2} 
REMINDER 
Two nodes in a Hasse 
diagram should never be @ 
joined by a horizontal line. Figure 5.2 


PRACTICE 9 | Draw the Hasse diagram for the relation “х divides y" on (1, 2, 3, 6, 12, 18}. - 


The Hasse diagram of a partially ordered set conveys all the information 
about the partial ordering. We can reconstruct the set of ordered pairs making up 
the partial ordering just by looking at the diagram. The lines in the diagram tell 
us immediate (predecessor, successor) pairs. We can fill in the rest by using the 
reflexive and transitive properties. Thus, given the Hasse diagram in Figure 5.3 of 
a partial ordering < on a set (a, b, c, d, e, f }, we can conclude that < is the set 


(а, a), (b, D), (с, c), (d, d), (е, е), (7.7), (a, Б), (a, с), (a, d), (а, е), (d, e)} 


Figure 5.3 
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Two elements of S may be unrelated in a partial ordering of S. In Example 10, 
{1} and {2} are unrelated; so are 2 and 3, and 12 and 18 in Practice 9. In Figure 5.3, 
fis not related to any other element. A partial ordering in which every element of 
the set is related to every other element is called a total ordering, or chain. The 
Hasse diagram for a total ordering on a four-element set looks like Figure 5.4. 
The relation = on N is a total ordering, although we can’t draw a Hasse diagram 
because N is an infinite set. 


Figure 5.4 


Again, let (S, X) bea partially ordered set. If there is a y € S with y < x for 
all x € S, theny is a least element of the partially ordered set. A least element, 
if it exists, is unique. To show its uniqueness, assume that y and z are both least 
elements. Then y < z because y is least and z < y because z is least; by anti- 
symmetry, y = z. An element y є S is minimal if there is no x € S with x < y 
. In the Hasse diagram, a least element is below all others, while a minimal 
element has no elements below it. Similar definitions apply for greatest element 
and maximal elements. 


PRACTICE 10 | Define greatest element and maximal element in a partially ordered set (S, X). = 


In the partially ordered set of Practice 9, 1 is both least and minimal; 12 and 18 are 
both maximal, but there is no greatest element. e 


A least element is always minimal and a greatest element is always maximal, 
but the converses are not true (see Example 11). In a totally ordered set, however, 
a minimal element is the least element and a maximal element is the greatest 
element. 


PRACTICE 11 | Draw the Hasse diagram for a partially ordered set with four elements in which there are 
two minimal elements but no least element, two maximal elements but no greatest element, 
and each element is related to exactly two other elements. 


Partial orderings satisfy the properties of reflexivity, antisymmetry, and tran- 
sitivity. Another type of binary relation, which we study next, satisfies a different 
set of properties. 


* DEFINITION 


REMINDER 


A partial ordering is anti- 
symmetric; an equivalence 
relation is symmetric. 


* DEFINITION 
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Equivalence Relations 


EQUIVALENCE RELATION 
A binary relation on a set S that is reflexive, symmetric, and transitive is called an 
equivalence relation on S. 


We have already come across the following examples of equivalence relations: 


On any set S, x py €» x = y. 

On №, x py €» x + y is even. 

On the set of all lines in the plane, x py < x is parallel to y or coincides with y. 
On (0, 1}, xpy o x = y’. 

On {x|x is a student in your class}, xpy €» x sits in the same row as y. 

On {1,2,3} p = {(1, 1), (2, 2), (3, 3), (1, 2), Q2, 1)). 


We can illustrate an important feature of an equivalence relation on a set 
by looking at 5 = {x|x is a student in your class}, x py < “x sits in the same row 
as у” Let's group together all those students in set 5 who are related to one an- 
other. We come up with Figure 5.5. We have partitioned the set S into subsets in 
such a way that everyone in the class belongs to one and only one subset. 


S 


Figure 5.5 


PARTITION OF A SET 
A partition of a set S is a collection of nonempty disjoint subsets of S whose 
union equals S. 


Any equivalence relation, as we will see, partitions the set on which it is 
defined. The subsets making up the partition, often called the blocks of the parti- 
tion, are formed by grouping together related elements, like the students in the 
classroom. 

For p an equivalence relation on a set S and x € S, we let [x] denote the set 
of all members of S to which x is related, called the equivalence class of x. Thus, 


[x] = {yly e S^xpy) 


(Because p is symmetric, we could just as well have said that [x] = {yly € S 
^ ypx}.) 
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In the case where хру <> “х sits in the same row as у,” suppose that John, Chuck, 
Jose, Judy, and Ted all sit in row 3. Then [John] = {John, Chuck, Jose, Judy, Ted}. 
Also [John] = [Ted] = [Judy], and so on. These are not distinct classes, but the 
same class with multiple names. An equivalence class can take its name from any 
of the elements in it. e 


Now we state the result about equivalence relations and partitions. For some 
practice with formal theorems and proofs, we give this result as a formal theorem, 
then analyze the structure of the proof and complete part of the proof. 


* THEOREM ON EQUIVALENCE RELATIONS AND PARTITIONS 
An equivalence relation р оп а set S determines a partition of S, and a partition of 
a set S determines an equivalence relation on S. 


Partial Proof: The theorem makes two separate statements: 


a. An equivalence relation on S determines a partition of S. 
b. A partition of 5 determines an equivalence relation on S. 


To prove part (a), we must show that the distinct equivalence classes of members 
of S under equivalence relation p satisfy the definition of a partition. To satisfy the 
definition of a partition, we must show that 


1. the union of these distinct classes equals S. 
ii. the distinct classes are disjoint. 


To prove part (a. 1), we must show something about the union of the distinct 
equivalence classes formed by p. Equivalence classes are sets of elements of S, so 
their union is a set; let's denote this set by U. We must show that U — S, which is 
a set equality. To prove this set equality, we will prove set inclusion in each direc- 
tion; in other words, 


l. UC S 
2.SCU 


For this part of the proof, we are finally down to two small statements that are 
easy to prove, as follows: 


a.i.l: Let x € U. Then x belongs to an equivalence class. Every equivalence 
class is a subset of S, so x Є S. 


a.i.2: Let x є S. Then x px (reflexivity of p); thus, x є [x], and every member 
of S belongs to some equivalence class, hence to the union of classes U. 


This completes the proof of part (a.i). For part (a.ii), let [x] and [z] be two 
equivalence classes. We want to show that distinct classes are disjoint, or 


[x] 2 [z] > [x]  [z] = Ø (a.ii) 
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If we assume that [x] # [z], we must then show that [x] A [z] does not contain any- 
thing, which might be hard to do. So instead, we'll prove the contrapositive of a. ii: 


[х] [z] # 2 > [x] = [z] (contrapositive of a.ii) 


Therefore, we assume that [х] A [z] = Ø and that there is a y € S such that 
y € [x] A [z]. What does this tell us? 


yelx]n [z] (assumption) 

yelxl» e [z] (definition of N) 
Xpy,zpy (definition of [x] and [z]) 
Xpy, y pz (symmetry of p) 

хр2 (transitivity of p) 


Now we can show that [x] = [z] by proving set inclusion in each direction: 


To show (3), [z] С [x], let q € [z] (we know [z] ¥ Ø because y € [z].) Then 


zpq (definition of [z ]) 
хр2 (from above) 
xpq (transitivity of p) 
q є [x] (definition of [x]) 
[z] € [x] (definition of C) 


Practice 12 asks for a proof of (4), [x] C [z]. Once this proof is supplied, it 
completes (3) and (4), which leads to the conclusion [x] = [2]. This completes the 
proof of the contrapositive of part (а. ii) and therefore proves part (а. ii), which in 
turn completes the proof of part (a). Whew! 

Practice 13 asks for a proof of part b. 

End of Partial Proof 


PRACTICE 12 | For the foregoing argument, supply the proof that [x] C [z]. 


PRACTICE 13 | Prove part (b) of the theorem. Given a partition of a set S, define a relation p by 


xpy €» x is in the same block of the partition as у 


and show that p is an equivalence relation on S, that is, show that p is reflexive, symmetric, and transitive. m 


| EXAMPLE 13 | The equivalence relation on N given by 


xpy<>x + yis even 
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partitions N into two equivalence classes. ТЁх is an even number, then for any even 
number y, x + y is even and y є [x]. All even numbers form one class. If x is an 
odd number and y is any odd number, x + y is even and у є [x]. АП odd numbers 
form the second class. The partition can be pictured as in Figure 5.6. Notice again 
that an equivalence class may have more than one name, or representative. In this 
example, [2] = [8] = [1048], and so on; [1] = [17] = [947], and so on. 


N 


Figure 5.6 LJ 


PRACTICE 14 | For each of the following equivalence relations, describe the corresponding equivalence 
classes. 


а. On the set of all lines in the plane, x py < x is parallel to y or x coincides with y. 
b. On the set №, xpy e» x = y. 
ee Oni 2a o 001620), (2, 2): 4,9 922). (2 MINIS a 


Partitioning a set into equivalence classes is helpful because it is often conve- 
nient to go up one level of abstraction and treat the classes themselves as entities. 
We will conclude this section with two important examples where this is the case 
(you actually saw the first example somewhere around the fourth grade). 


Let S = (a/b|a, b e Z, b = 0}. Sis therefore the set of all fractions. The fractions 
1/2 and 2/4 are different fractions—they have different numerators and different 
denominators, but they are said to be “equivalent.” Formally, a/b is equivalent 
to c/d, denoted by a/b ~ c/d, if and only if ad = bc. We will show that the bi- 
nary relation ~ on S is indeed an equivalence relation. First, a/b ~ a/b because 
ab = ba. Also, if a/b ~ c/d then ad = bc, or cb = da and c/d ~ a/b. Hence, 
~ is reflexive and symmetric. To show that ~ is transitive, let a/b ~ c/d and 
c/d ~ e/f. Then ad = bc and cf = de. Multiplying the first equation by f and the 
second by b, we get adf = bcf and bcf = bde. Therefore, adf = bde, or af = be 
(why is it legitimate to divide by d here?). Thus, a/b ~ e/f, and ~ is transitive. 
Some sample equivalence classes of S formed by this equivalence relation are 


1 223 
2 xd ? 0: =й? = у D 4 6 
EI - b = e} 
10 "ep uq mp He Эў sm 
The set Q of rational numbers can be regarded as the set of all equivalence 
classes of S. A single rational number, such as [1/2], has many fractions represent- 


ing it, although we customarily use the reduced fractional representation. When 
we add two rational numbers, such as |1/2] + [3/10], we look for representatives 
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from the classes having the same denominator and add those representatives. Our 
answer is the class to which the resulting sum belongs, and we usually name the 
class by using a reduced fraction. Thus, to add [1/2] + [3/10], we represent [1/2] 
by 5/10 and [3/10] by 3/10. The sum of 5/10 and 3/10 is 8/10, and [8/10] is 
customarily named [4/5]. This procedure is so familiar that it is generally writ- 
ten as 1/2 + 3/10 = 4/5; nonetheless, equivalence classes of fractions are being 
manipulated by means of representatives. ө 


We will define a binary relation of congruence modulo 4 on the set Z of inte- 
gers. An integer x is congruent modulo 4 to y, symbolized by x = 4y, or x = y 
(mod 4), if x — y is an integral multiple of 4. Congruence modulo 4 is an 
equivalence relation on Z. To construct the equivalence classes, note that [0], for 
example, will contain all integers differing from 0 by a multiple of 4, such as 4, 8, 
— 12, and so on. The distinct equivalence classes are 


ШЕЕ о 

ШЕ = Salo, 9] 

РДЕ ЕЕ 

B] = Е, ө 


There is nothing special about the choice of 4 in Example 15; we can give а 
definition for congruence modulo п for any positive integer л. 


* DEFINITION CONGRUENCE MODULO n 
For integers x and y and positive integer n, 


x = y (mod n) if x — y is an integral multiple of n 


This binary relation is an equivalence relation on Z for any positive inte- 
ger п (see Exercise 46). This equivalence relation and the resulting equivalence 
classes can be used for integer arithmetic on a computer. An integer is stored as 
a sequence of bits (05 and 15) within a single memory location. Each computer 
allocates a fixed number of bits for a single memory location (this number varies 
depending on the architecture of the computer—how its memory space is laid 
out). The larger the integer, the more bits required to represent it. Therefore each 
machine has some limit on the size of the integers it can store. Suppose that n — 1 
is the maximum size and that x and y are integer values with 0 = x =n — 1, 
0 <= у <= n- l.Ifthesum x + y exceeds the maximum size, it cannot be stored. 
As an alternative, the computer may perform addition modulo n and find the 
remainder r when x + y is divided by n. 
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The equation 
xty=qnt+r,05r<n 


symbolizes this division, where q is the quotient and r is the remainder. This 
equation may be written as 


x ty-r-qn 


which shows that (x + y) — r is an integral multiple of n, or that (x + y) =r 
(mod л). The integer r may not be x + y, but it is in the equivalence class [x + y], 
and since 0 =r < n, it is also in the range of integers that can be stored. (The 
system may or may not issue an integer overflow message if x + y is too large to 
store and addition modulo n must be used.) The situation is analogous to your car's 
odometer, which records mileage modulo 100,000; when mileage reaches 102,758, 
for example, it is displayed on the odometer as 2,758. 


PRACTICE 15 | What are the equivalence classes corresponding to the relation of congruence modulo 5 


0) 
on Z? ш 
PRACTICE 16 | If 4 is the maximum integer that can be stored on a (micromicro) computer, what will 
be stored for the value 3 + 4 if addition modulo 5 is used? = 


Table 5.1 summarizes important features of partial orderings and equivalence 
relations. 


TABLE 5.1 


Partial Orderings and Equivalence Relations 


Type of Binary Relation 


Reflexive | Symmetric | Antisymmetric | Transitive Important Feature 


Partial ordering 


Predecessors and 
successors 


Yes No Yes Yes 


Equivalence relation 


Yes Yes No Yes Determines a partition 


SECTION 5.1 REVIEW 


TECHNIQUES 


* Test an ordered pair for membership in a binary 


relation. 


W Test a binary relation for reflexivity, symmetry, 
antisymmetry, and transitivity. 
* Find the reflexive, symmetric, and transitive 


closure of a relation. 


* Draw the Hasse diagram for a finite partially 
ordered set. 
* Findleast, minimal, greatest, and maximal elements 
in a partially ordered set. 
Ù Find the equivalence classes associated with an 
equivalence relation. 
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MAIN IDEAS * Binary relations can have properties of reflexivity, 


* А binary relation on a set 5 is formally a subset of 
S X S; the distinctive relationship satisfied by the 
relation's members often has a verbal description 
as well. 

* Operations on binary relations on a set include 
union, intersection, and complementation. 


symmetry, transitivity, and antisymmetry. 

* Finite partially ordered sets can be represented 
graphically. 

* An equivalence relation on a set S determines a 
partition of S, and conversely. The blocks of the 
partition are equivalence classes, which may them- 
selves be treated as entities. 


EXERCISES 5.1 


1. For each of the following binary relations p on N, decide which of the given ordered pairs belong to p. 


a. 
b. 
с. 
а. 


хру<ҷх+у < 7; (1, 3), (2, 5), (3, 3), (4, 4) 

xpy €»x = y + 2; (0, 2), (4, 2), (6, 3), (5, 3) 

xpy €» 2x + Зу = 10; (5, 0), (2, 2), (3, 1), (1, 3) 

хру < y is а perfect square; (1, 1), (4, 2), (3, 9), (25, 5) 


2. For each of the following binary relations р on Z, decide which of the given ordered pairs belong to р. 


a. 
b. 
с. 
а. 


е. 


хру © х|у; (2, — 6), (3, 5), (8, 4), (4, 8) 

xpy +> х and y аге are relatively prime; (5, 8), (9, 16), (6, 8), (8, 21) 

xpy €» gcd(x, у) = 7; (28, 14), (7, 7), (10, 5), (21, 14) 

xpy €» x? + у? = z for some integer 2; (1, 0), (3, 9), (2, 2), (—3, 4) 

x py < х is a number from the Fibonacci sequence; (4, 3), (7, 6), (7, 12), (20, 20) 


3. Decide which of the given items satisfy the relation. 


а. 
b. 
с. 
а. 


p à binary relation ор Z, xpy +> x = —y; (1, — 1), (2, 2), (—3, 3), (—4, —4) 

p à binary relation оп №, x py «€» x is prime; (19, 7), (21, 4), (33, 13), (41, 16) 

p à binary relation on Q, xpy<> x = 1/y; (1, 2), (3, — 5), (4, 1/2), (1/2, 1/3) 

p a binary relation on N X №, (х, у)р (u, v) x + u = y + v; (1, 2), (3, 2), ((4, 5), (0, 1)) 


4. Decide which of the given items satisfy the relation. 


a. 


pabinaryrelationon S X C,where S = {states in the United States), C = {cities in the United States}, 
xpy €» y is the capital of x; (Indiana, Indianapolis), (Illinois, Chicago), (Kansas, Kansas City), 
(Kentucky, Louisville), (North Dakota, Bismarck) 


. p a binary relation on А X P, where A = {artists}, P = {paintings}, x py €» x painted y; (DaVinci, 


Mona Lisa), (Grant Wood, American Gothic), (Remington, Ridden Down), (Picasso, Blue Dancers), 
(van Gogh, Starry Night) 


. p à binary relation on C X M, where С = {composers}, M = {music}, xpy €x composed у; 


(Bernstein, West Side Story), (Presley, Blue Suede Shoes), (Gershwin, Rhapsody in Blue), (Beethoven, 
Moonlight Sonata), (Rogers and Hammerstein, Phantom of the Opera) 


. p a binary relation on A X B, where А = {authors}, В = {books}, хру < x wrote y; (Hemingway, 


The Old Man and the Sea), (Sawyer, Huckleberry Finn) (Poe, Moby Dick), (Orwell, 1984), (Tolstoy, 
Crime and Punishment) 


5. For each of the following binary relations on R, draw a figure to show the region of the plane it describes. 


a. 
b. 
с. 
а. 


xpyeyz2 
xpyex-y-l 
xpyexr ty < 25 
хру «х 2 у 
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6. For each of the accompanying figures, give the binary relation on R that describes the shaded area. 


y 


7. Identify each relation on N as one-to-one, one-to-many, many-to-one, or many-to-many. 
a. p = {(1, 2), (1, 4), (1, 6), (2, 3), (4, 3)) 
b. p = {(9, 7), (6, 5), (3, 6), (8, 5)} 
c. p = {(12, 5), (8, 4), (6, 3), (7, 12)} 
d. p = {(2, 7), (8, 4), (2, 5), (7, 6), (10, 1)} 
8. Identify each of the following relations on S as one-to-one, one-to-many, many-to-one, or many-to-many. 
а. 5 = № 
хру<ҷх = у + 1 
b. S = set of all women inVicksburg 
xpy €» x is the daughter of y 


c. S = e((1,2,3) 
ApB = |A| = |B| 
d. S= R 
хру<х = 5 
9. Let p and с Бе binary relations оп N defined by хру < “x divides y," хоу < 5x = y. Decide which of 
the given ordered pairs satisfy the following relations. 
a. p Ua; (2, 6), (3, 17), (2, 1), (0, 0) 
b. pN g; (3, 6), (1, 2), (2, 12) 
c. p’; (1, 5), (2, 8), (3, 15) 
d. o^; (1, 1), (2, 10), (4, 8) 


10. 


11. 


12. 


13. 


14. 
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Both p and c are binary relations from P to C where P = {people in theUnited States}, C = {cities in the 
United States}, xpy < x lives in y, and хоу < x works in y. Describe each of the following relations. 


a. pN c. pflc* 
b. pUa d. р N 


Let S = {1, 2, 3}. Test the following binary relations on S for reflexivity, symmetry, antisymmetry, and 
transitivity. 

а. р = {(1, 3), 3, 3), G, 1), @ 2), (2, 3), (1, 0, (1,2)} 

b. р = 10, D, (3,3), (2, D} 

c. p = {(1, 1), (1,2), (2, 3), (3, D, (1, 3)} 

d. p = {(1, 1), (1,2), (2,3), (1, 3)} 


Let S = {0, 1, 2, 4, 6}. Test the following binary relations on S for reflexivity, symmetry, antisymmetry, 
and transitivity. 


a. р = {(0, 0), (1, 1), (2, 2), (4, 4), (6, 6), (0, 1), (1,2), (2, 4), (4, 6) 


b. р = {(0, 1), (1, 0), (2, 4), (4, 2), (4, 6), (6, 4)} 

c. p = 10, 1), (1, 2), (0, 2), (2, 0), (2, 1), (1, 0), (0, 0), (1, 1), (2, 2)} 

d. p = {(0, 0), (1, 1), (2, 2), (4, 4), (6, 6), (4, 6), (6, 4)} 

e. p = Ø 

Test the following binary relations on the given sets S for reflexivity, symmetry, antisymmetry, and transitivity. 


a S=Q 
хру |x| = ly 


b. S=Z 

хру «x — yis an integral multiple of 3 
c. S=N 

хру €»x-yis even 
d. 5 = № 


xpy €» х is odd 


e. 5 = set of all squares in the plane 
Sip S; length of side of S, = length of side of S; 


Test the following binary relations on the given sets S for reflexivity, symmetry, antisymmetry, and 
transitivity. 
a. S — set of all finite-length strings of characters 
xpy «» number of characters in x — number of characters in y 
b. S = {0, 1, 2, 3, 4, 5} 
xpyoxty=5 
c. S = 0({1, 2, 3, 4, 5, 6, 7, 8, 9}) 
A pB<>|A| = [В| 
d. S = g({1, 2, 3, 4, 5, 6, 7, 8, 9}) 
АрВ © |A| = |B| 
e 5 = Мх М№ 
(х1, у)р(, у) ху = x, and yj = yy 
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15. Which of the binary relations of Exercise 13 are equivalence relations? For each equivalence relation, 


16. 


17; 


18. 


19. 


20. 


2 


— 


describe the associated equivalence classes. 
Which of the binary relations of Exercise 14 are equivalence relations? For each equivalence relation, 
describe the associated equivalence classes. 
Test the following binary relations on the given sets S for reflexivity, symmetry, antisymmetry, and 
transitivity. 
а. S=Z 
xpy<>x + y is a multiple of 5 
b. 5 = 2 
хру<х < у 
с. 5 = set of all finite-length binary strings 
xpy «€» xis a prefix of y 
d. S — set of all finite-length binary strings 
xpy < x has the same number of 15 as у 
Test the following binary relations on the given sets S for reflexivity, symmetry, antisymmetry, and 
transitivity. 


а. 5 = 2 
xpy €» x = ky for some integer k 
b. S=Z 


xpy €» there is a prime number p such that p|x and p|y 
c. S = g({1, 2, 3, 4, 5, 6, 7, 8, 9}) 
ApB € A(B = Ø 
d. S = 9([1, 2,3, 4, 5, 6, 7, 8, 9) 
ApB «A = В 
Let S be the set of people in the United States. Test the following binary relations on S for reflexivity, sym- 
metry, antisymmetry, and transitivity. 
a. x py < x is at least as tall as y. 
b. xpy < x is taller than y. 
с. xpy €» x is the same height as y. 
d. хру <x is a child of y. 


Let S be the set of people in the United States. Test the following binary relations on S for reflexivity, 
symmetry, antisymmetry, and transitivity. 


a. xpy < x is the husband of y. 

b. xpy «» x is the spouse of y. 

с. хру < x has the same parents as y. 
d. хру €» x is the brother of y. 


. For each case, think of a set S and a binary relation p on S (different from any in the examples or prob- 


lems) satisfying the given conditions. 

a. p is reflexive and symmetric but not transitive. 

b. p is reflexive and transitive but not symmetric. 

с. p is not reflexive or symmetric but is transitive. 

d p is reflexive but neither symmetric nor transitive. 


22. 


23. 
24. 
25. 


26. 


27. 
28. 
29. 


30. 


3 


— 
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Let p and с be binary relations on a set S. 

a. If p and с are reflexive, is p U ø reflexive? Is р N с reflexive? 

b. If p and с are symmetric, is р U с symmetric? Is р N с symmetric? 

c. If p and с are antisymmetric, is p U с antisymmetric? Is p N с antisymmetric? 

d. If p and ø are transitive, is p U с transitive? Is р N с transitive? 

Find the reflexive, symmetric, and transitive closure of each of the relations in Exercise 11. 
Find the reflexive, symmetric, and transitive closure of each of the relations in Exercise 12. 
Given the following binary relation 


S — set of all cities in the country 
x py <> Take-Your-Chance Airlines flies directly from x to y 


describe in words what the transitive closure relation would be. 
Two additional properties of a binary relation p are defined as follows: 


pis irreflexive means: (Vx)(x € S — (x, x) € p) 
р is asymmetric means: (Vx) Vy)x є 5 Лує 5 Л (х,у) єр (у, х) € p) 


а. Give an example of a binary relation p on set S = (1, 2, 3} that is neither reflexive nor irreflexive. 

b. Give an example of a binary relation p on set S = {1, 2, 3} that is neither symmetric nor asymmetric. 
c. Prove that if p is an asymmetric relation on a set S, then p is irreflexive. 

d. Prove that if p is an irreflexive and transitive relation on a set S, then p is asymmetric. 

e. Prove that if p is anonempty, symmetric, and transitive relation on a set S, then p is not irreflexive. 
Does it make sense to look for the irreflexive closure of a relation? (See Exercise 26.) Why or why not? 
Does it make sense to look for the asymmetric closure of a relation? (See Exercise 26.) Why or why not? 


Let S be an n-element set. How many different binary relations can be defined on S? (Hint: Recall the 
formal definition of a binary relation.) 


Let p be a binary relation on a set S. For A С 5, define 


#A = {x|x ESA (Му)(ує A > xpy)} 
A# = {x|x ESA (Wy) € 4 > ypx)} 


a. Prove that if p is symmetric, then #A = A. 

b. Prove that if 4 C B then #B C #A and B¥ C AF. 
c. Prove that А C (4). 

d. Prove that 4 C #(4#). 


. Draw the Hasse diagram for the following partial orderings. 


a. S = {a,b,c} 

p = {(a, a), (b, b), (с, с), (a, b), (b, с), (a, с)} 
b. S = {a, b, c, d} 

p = {(a, a), (b, b), (c, с), (d, d), (a, b), (a, с)} 
c. S = {Ø, {a}, (a, b}, 4с}, а, c}, {Ь}} 


ApBOACB 


32. 
33. 


34. 


35. 


36. 


37. 


38. 


39. 


40. 
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For Exercise 31, name any least elements, minimal elements, greatest elements, and maximal elements. 
Let (S, <) be a partially ordered set, and let А C S. Prove that the restriction of < to А is a partial 
ordering on А. 


a. Draw the Hasse diagram for the partial ordering “x divides y” on the set {2, 3, 5, 7, 21, 42, 105, 210}. 
Name any least elements, minimal elements, greatest elements, and maximal elements. Name a totally 
ordered subset with four elements. 

b. Draw the Hasse diagram for the partial ordering “x divides y” on the set {3, 6, 9, 18, 54, 72, 108, 162}. 
Name any least elements, minimal elements, greatest elements, and maximal elements. Name any 
unrelated elements. 

Draw the Hasse diagram for each of the two partially ordered sets. 

a. $ = {1, 2,3, 5, 6, 10, 15, 30} b. 5 = o({1, 2,3} 
xpy €» x divides y ApBoACB 

What do you notice about the structure of these two diagrams? 

For each Hasse diagram of a partial ordering in the accompanying figure, list the ordered pairs that belong 

to the relation. 


a. 5 b. 


is) 
— 
e 
сл 


> 
^5 


Let (S,p) and (7, с) be two partially ordered sets. A relation ju on S X T is defined by 
(51, 1) u (S5, £5) €» s, ps, and tja tj. Show that u is a partial ordering on 5 X T. 


Let p be a binary relation on a set S. Then a binary relation called the inverse of p, denoted by р !, is 


defined by хр !y < ypx. 

a. For p = {(1, 2), (2, 3), (5, 3), (4, 5)} on the set №, whatisp !? 

b. Prove that if p is a reflexive relation on a set S, then р! is reflexive. 

c. Prove that if p is a symmetric relation on a set S, then р! is symmetric. 

d. Prove that if p is an antisymmetric relation on a set S, then р! is antisymmetric. 

e. Prove that if p is a transitive relation on a set S, then р! is transitive. 

f. Prove that if p is an irreflexive relation on a set S (see Exercise 26), then р! is irreflexive. 


g. Prove that if p is an asymmetric relation on a set S (see Exercise 26), then р! is asymmetric. 


Prove that if a binary relation p on a set S is reflexive and transitive, then the relation рр! is an 


equivalence relation (see Exercise 38 for the definition of p~ !). 


a. Let (S, p) be a partially ordered set. Then р! can be defined as in Exercise 38. Show that (S, р!) isa 
partially ordered set, called the dual of (S, p). 
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b. If (S, p) is a finite, partially ordered set with the Hasse diagram shown, draw the diagram of the dual of 
(S, p). 


c. Let (S, p) be a totally ordered set and let X = {(x, x)|x є S]. Show that the setdifference p ! — X 
equals the set p'. 


41. A computer program is to be written that will generate a dictionary or the index for a book. We will 
assume a maximum length of n characters per word. Thus, we are given a set S of words of length at most 
n, and we want to produce a linear list of these words arranged in alphabetical order. There is a natural 
total ordering < on alphabetic characters (a « b, b < c, etc.), and we will assume our words contain 
only alphabetic characters. We want to define a total ordering < on S called a lexicographical ordering 
that will arrange the members of S alphabetically. The idea is to compare two words X and Y character by 
character, passing over equal characters. If at any point the X character alphabetically precedes the corre- 
sponding Y character, then X precedes Y; if all characters in X are equal to the corresponding Y characters 
but we run out of characters in X before characters in Y, then X precedes Y. Otherwise, Y precedes X. 

Formally, let X = (xi, x7, ..., xj) and Y = (у, у, ... , yj) be members of S with j = k. Let B (for 
blank) be a new symbol, and fill out X with k — j blanks on the right. X can now be written (x1, x», ..., х). 
Let B precede any alphabetical character. Then X < Y if 


x, 7 y, and x, X y, 


or 


Xi 5 ур 0 — у, +009 Xm = У„(т = k) 


Xm+1 7 Ym+1 and х, i x Ут+1 


Otherwise, Y < X. 


Note that because the ordering < on alphabetical characters is a total ordering, if Y < X by “other- 
wise," then there exists m = k such that x, = у, X3 = y5,..., Xm = Ym Xm 1 Æ Уап у Ж 41: 
Show that < on S as defined above is a total ordering. 


42. Apply the total ordering described in Exercise 41 to the words Боо, bug, be, bah, and bugg. Note why each 
word precedes the next. 


43. Exercise 41 discusses a total ordering on a set of words of length at most n that will produce a linear list 
in alphabetical order. Suppose we want to generate a list of all the distinct words in a text (for example, 
a compiler must create a symbol table of variable names). As in Exercise 41, we assume that the words 
contain only alphabetical characters because there is a natural precedence relation already existing 
(a < b, b < c, and so on). If numeric or special characters are involved, they must be assigned а 
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precedence relation with alphabetical characters (the collating sequence must be determined). If we 
list words alphabetically, it is a fairly quick procedure to decide whether a word currently being pro- 
cessed is new, but to fit the new word into place, all successive words must be moved one unit down 
the line. If the words are listed in the order in which they are processed, new words are simply tacked 
onto the end and no rearranging is necessary, but each word being processed has to be compared with 
each member of the list to determine if it is new. Thus, both logical linear lists have disadvantages. 

We describe a structure called a binary search tree; using this structure, a search process called a 
binary tree search can usually determine quickly whether a word is new and, if it is, no juggling is re- 
quired to fit it into place, thus eliminating the disadvantages of both linear list structures described earlier. 
Suppose we want to process the phrase “when in the course of human events.” The first word in the text 
is used to label the first node of a graph. Once a node is labeled, it drops down a left and right arc, putting 
two unlabeled nodes below the one just labeled. 


JN 


When the next word in the text is processed, it is compared with the first node. When the word being pro- 
cessed alphabetically precedes the label of a node, the left arc is taken; when the word follows the label 
alphabetically, the right arc is taken. The word becomes the label of the first unlabeled node it reaches. 
(If the word equals a node label, it is a duplicate, so the next word in the text is processed.) This procedure 
continues for the entire text. Thus, 


when 
then 
when 
in 
the 
then 
when 


in 
course the 


44. 


45. 


4 


a 
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until finally 


when 


course 


events 


By traversing the nodes of this graph in the proper order (described by always processing the left nodes 
below a node first, then the node, then the right nodes below it), an alphabetical listing “course, events, 
human, in, of, the, when" is produced. 


a. This type of graph is called a tree. (Unlabeled nodes and arcs to unlabeled nodes usually are not 
shown.) Turned upside down, the graph can be viewed as the Hasse diagram of a partial ordering <. 
What would be the least element? Would there be a greatest element? Which of the following ordered 
pairs would belong to <: (in, of), (the, of), (in, events), (course, of)? 

Here the tree structure contains more information than the partial ordering, because we are inter- 
ested in not only whether a word у; precedes a word w, in the partial ordering sense but also whether 
w, is to the left or right of w;. 


b. Build a binary search tree for the phrase “Old King Cole was a merry old soul.” Eliminate unla- 
beled nodes. Considering the upside-down graph as the Hasse diagram of a partial ordering, name the 
maximal elements. 


The alphabetical ordering defined in Exercise 41 can be applied to words of any finite length. If we define 
A* to be the set of all finite-length “words” (strings of characters, not necessarily meaningful) from the 
English alphabet, then the alphabetical ordering on A* has all words composed only of the letter А preced- 
ing all other words. Thus, all the words in the infinite list 


a, ad, aaa, aaaa, ... 


precede words such as “b” or “aaaaaaab.” Therefore this list does not enumerate A*, because we can 
never count up to any words with any characters other than a. However, the set A* is denumerable. Write 
a partial enumeration of A* by ordering words by length (all words of length 1 precede all words of length 
2, and so on) and then alphabetically ordering words of the same length. 


a. For the equivalence relation р = {(a, a), (b, Б), (c, с), (a, c), (c, a)}, what is the set [a]? Does it have 
any other names? 

b. For the equivalence relation p = {(1, 1), (2, 2), (1, 2), (2, 1), (1, 3), (3, 1), (3, 2), 2, 3), (3, 3), (4, 4), 
(5, 5), (4, 5), (5, 4)} what is the set [3]? What is the set [4]? 


. Prove that for any positive integer n, congruence modulo л is an equivalence relation on the set Z. 
4T. 
48. 
49. 


For the equivalence relation of congruence modulo 2 on the set Z, what is the set [1]? 
For the equivalence relation of congruence modulo 5 on the set Z, what is the set [—3]? 
Assume that x = y (mod n) and z = w (mod n) for some positive integer n. Prove that 
a. X + z = y + w (mod n) 

b x — 2 = у – w(modn) 

с. х=2 = y: w (mod n) 

d. х? = y" (mod n) fors 2 1,n = 2 
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50. Let p be a prime number. Prove that x? = уг (mod p) if and only if x = y (mod p) or x = —y (mod p). 


51. а. Given the partition (1, 2) and (3, 4} of the set S = (1, 2, 3, 4}, list the ordered pairs in the correspond- 
ing equivalence relation. 


b. Given the partition (a, b, c) and (d, e) of the set S = (a, b, c, d, е}, list the ordered pairs in the cor- 
responding equivalence relation. 


52. Let S be the set of all books in the library. Let p be a binary relation on 5 defined by x py <> “the color of 
x’s cover is the same as the color of y's cover.” Show that p is an equivalence relation on S and describe 
the resulting equivalence classes. 


53. Let S = N and let p be a binary relation on S defined by x py €» x? — y? is even. Show that p is an equiva- 
lence relation on S and describe the resulting equivalence classes. 


54. Let S = К and let p be a binary relation on S defined by xpy <+> x — y is an integer. 
a. Show that p is an equivalence relation on S. 
b. List 5 values that belong to [1.5]. 


55. Let S = № х N and let p be a binary relation on S defined by (x, y) p(z, w) +> y = w. Show that p is an 
equivalence relation on S and describe the resulting equivalence classes. 


56. Let S = N х Nand let p bea binary relation on S defined by (х, y)p(z, w) €» x + y = z + w. Show that 
p is an equivalence relation on S and describe the resulting equivalence classes. 


57. Let S be the set of all binary strings of length 8 and let p be a binary relation on 5 defined by хру < 
y starts with the same bit value (0 or 1) as x and y ends with the same bit value (0 or 1) as x. 


a. Show that p is an equivalence relation on S. 

b. How many strings are in the set S? 

c. Into how many equivalence classes does p partition S? Explain your answer. 
d. How many strings are in each equivalence class? 


58. The documentation for the Java programming language recommends that when a boolean “equals method" 
is defined for an object, it should be an equivalence relation. That is, if p is defined by x py «» x.equals(y) 
for all objects in the class, then p should be an equivalence relation. In a graphics application, a programmer 
creates an object called a point, consisting of two coordinates in the plane. The programmer defines an 
equals method as follows: If p and q are any two points in the plane, then 


p.equals(q) <> the distance from p to q is = c 


where c is a small positive number that depends on the resolution of the computer display. Is the program- 
mer's equals method an equivalence relation? Justify your answer. 


59. Let S be the set of all propositional wffs with n statement letters. Let p be a binary relation on S defined 
by РрО < “Р < Q is a tautology.” Show that p is an equivalence relation on 5 and describe the resulting 
equivalence classes. (We have used the notation P + О for PpQ.) 


60. Given two partitions тг; and m, of aset 5, 7, is a refinement of m, if each block of 77, is a subset of a block 

of 72. Show that refinement is a partial ordering on the set of all partitions of S. 
Exercises 61—72 all deal with partitions on a set. 

61. Let P, denote the total number of partitions of an n-element set, n = 1. The numbers P, are called Bell 
numbers. Compute the following Bell numbers. 
a. Pi 
b. Р, 
c. Р, 
d. P, 


62. 


63. 


64. 
65. 


6 


a 


67. 
68. 


69. 


70. 


7 


[EIN 


72. 
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From Exercise 61, you might be looking for a closed-form formula giving the value of P,. Although Bell 
numbers have been extensively studied, no closed-form formula has been found. Bell numbers can be 
computed via a recurrence relation. Let P, have the value 1. Prove that for n = 1, 


n-l 
P,= У С(п – 1, ЮР, 
к= 0 


(Hint: Use а combinatorial proof instead of an inductive proof. Let x be a fixed but arbitrary member 
of a set with п elements. In each term of the sum, п — К represents the size of the partition block that 
contains x.) 


Use the formula of Exercise 62 to compute P,, Р», P4, and Р,, and compare your answers to those in 
Exercise 61. 


Use the formula of Exercise 62 to compute P; and Р,. 


Let S(n, k) denote the number of ways to partition a set of n elements into k blocks. The numbers S(n, k) 
are called Stirling numbers. 


a. Find S(3, 2). 
b. Find S(4, 2). 


. Prove that for all n = 1, S(n, k) satisfies the recurrence relation 


S(n, 1) = 1 
S(n,n) = 1 
S(n + 1,k + 1) = S(n,k) + (k + DS(n,k + 1) forl sksn 


(Hint: Use a combinatorial proof instead of an inductive proof. Let x be a fixed but arbitrary member of 
a set with п + 1 elements, and put x aside. Partition the remaining set of п elements. A partition of the 
original set could be obtained either by adding {x} as a separate block or by putting x in one of the existing 
blocks.) 


Use the formula of Exercise 66 to rework Exercise 65. 


The recurrence relation of Exercise 66 is similar to Pascal’s formula, Equation (1) of Section 4.5. Use this 
relation to compute the numeric values in the first five rows of Stirling 5 triangle, which begins 


S(1, 1) 
SQ,1) 80,2) 
80,1) 50:2) 50,3) 


Prove that 
Р„= У, Sn, k 
k=1 


Use the formula of Exercise 69 and Stirling’s triangle (Exercise 68) to compute P,, Р», P3, and P4. 


. Find the number of ways to distribute 4 different-colored marbles among 3 identical containers so that no 


container is empty. 


Find the number of ways in which 5 different jobs can be assigned to 3 identical processors so that each 
processor gets at least 1 job. 
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73. Binary relations on a set S are ordered pairs of elements of S. More generally, an n-ary relation on a set S 
is a set of ordered n-tuples of elements of S. Decide which of the given items satisfy the relation. 


a. pa unary relation on 2, x € р €» x is a perfect square 


25, 39, 49, 62 


b. p a ternary relation on N, (x,y,z) E pex + у =: 


(1, 1, 2), (3, 4, 5), (0, 5, 5), (8, 6, 10) 
c. pa 4-агу relation on Z, (x, y,z, w) € p © y = |x| and w = x + 7° 
(—4, 4, 2, 0), (5, 5, 1, 5), (6, — 6, 6, 45), (—6, 6, 0, —2) 
74. A ternary relation p is defined on the set 5 = (2, 4, 6, 8) by (х, у, 2) E pox + y =z. List the 3-tuples 


that belong to p. 


75. If x 1s a real number, х = 0, then a number y such that x: y = 1 is called the multiplicative inverse of 
x. Given positive integers x and n, a positive integer y such that х · y = 1 (mod n) is called the modular 
multiplicative inverse of x modulo п. But 


x:y = 1 (modn) 
«»x:y — 1 = kn where kis an integer 
< ху —kn=1 
< 1 is a linear combination of x and и 
< gcd (x,n) = 1 
< x and n are relatively prime 


Thus, if x and n are not relatively prime, the modular inverse of x does not exist. If they are relatively 
prime, the modular inverse of x is the positive coefficient of x in the linear combination of x and n that 
equals 1. 

Use the Euclidean algorithm to find the modular multiplicative inverse of 21 modulo 25 (note that 21 
and 25 are relatively prime). 


76. Use the Euclidean algorithm to find the modular multiplicative inverse of 68 modulo 15 (see Exercise 75). 


SECTION 5.2 | TOPOLOGICAL SORTING 


If p is a partial ordering on a set S, then some elements of S are predecessors of 
other elements. If S is a set of tasks that are to be done, then the idea of x as a pre- 
decessor of y can be interpreted literally to mean that task x must be done before 
task y. Thus partial orderings and Hasse diagrams are natural ways to represent 
problems in task scheduling. 


| EXAMPLE | Ernie and his brothers run a woodworking shop in the hills of New Hampshire 
th 


at manufactures rocking chairs with padded cushion seats. The manufacturing 
process can be broken down into a number of tasks, some of which have certain 
other tasks as prerequisites. The following table shows the manufacturing tasks for 
a rocking chair, the prerequisite tasks, and the number of hours required to perform 
each task. 
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We can define a partial ordering on the set of tasks by 


Task Prerequisite Tasks Hours to Perform 
1. Selecting wood None 3. 0 
2. Carving rockers 1 4.0 
3. Carving seat 1 6.0 
4. Carving back 1 0 
5. Carving arms 1 3.0 
6. Selecting fabric None 1.0 
7. Sewing cushion 6 2.0 
8. Assembling back and seat 3,4 2.0 
9. Attaching arms 5,8 2.0 

10. Attaching rockers 2,8 3.0 

11. Varnishing 9,10 Бу. (0) 

12. Adding cushion Th Vi ORS 


x x y task x = task y or task x is a prerequisite to task y 


It is easy to see that this relation is reflexive, antisymmetric, and transitive. Also, 


x < y task x is a prerequisite to task y 


In the Hasse diagram for this partial ordering, the nodes are tasks; we'll add 
to each node the information about the time to perform the task. Also, as is tradi- 
tional, we'll orient the diagram so that if x < y, then x is to the left of y rather than 
below y. Thus the entire diagram runs from left to right rather than from bottom to 
top. Such a diagram for task scheduling is often called a PERT(program evalu- 
ation and review technique) chart, first developed for tracking the construction 
of Navy submarines but useful for managing any complex project with a num- 
ber of subtasks. The PERT chart for manufacturing rocking chairs is shown in 
Figure 5.7, with task numbers substituted for task names and arrows pointing to 
a task from its prerequisite task(s). The numbers in parentheses indicate the time 


required to perform the task. 


2(4.0) 


1(3.0) 


5(3.0) 


10(3.0) 


11(5.0) 


>e 


e 
6(1.0) 7(2.0) 


Figure 5.7 


>e 12(0.5) 
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PRACTICE 17 | Construct the PERT chart for building a house from the following task table. 


Task Prerequisite Tasks Days to Perform 

1. Clearing lot None 4 

2. Pouring pad 1 3 

3. Doing framing 2 7 

4. Shingling roof 3 6 

5. Adding outside siding 3 4 

6. Installing plumbing and wiring 4,5 6 

7. Hanging windows and doors 3 5 

8. Installing wallboard 6 5 

9. Painting interior 718 5 = 


A project represented by a PERT chart must begin with the tasks at the left- 
most edge of the PERT chart and end with the tasks at the rightmost edge. An 
upper limit on the time required to complete the project can be obtained by adding 
the times for performing each task, but this does not take into account the fact 
that perhaps some tasks can be performed in parallel, such as tasks 2 through 5 in 
Example 16. To obtain the minimum time required to complete the project, we can 
move through the chart from left to right, computing for each node the minimum 
time to complete the work from the beginning through the work at that node. Ifa 
node x has multiple nodes as prerequisites, all the prerequisite tasks must be com- 
pleted before we can begin work on x; thus we must add the time for task x to the 
maximum completion time of the prerequisite nodes. 


| | EXPL | Let's compute the time for completing each task in Example 16. 


Task 1: 330 

Task 2: 3.0+4.0=7.0 

Task 3: 3,07 09.0 = 9,0 

Task 4: 3.0+ 7.0 = 10.0 

Task 5: 3.0+3.0=6.0 

Task 6: 1.0 

Task 7: 1.0+2.0= 3.0 

Task 8: max(time to complete task 3, time to complete task 4) 


+ time to perform task 8 
= шер, й, ШЙ) аг 2%) = че Эй a2 50, 
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Task 9: max(time to complete task 5, time to complete task 8) 
+ time to perform task 9 
= max(6.0, 12.0) + 2.0 = 12.0 + 2.0 = 14.0 


Task 10: ^ max(time to complete task 2, time to complete task 8) 
+ time to perform task 10 
= max(7.0, 12.0) + 3.0 = 12.0 + 3.0 = 15.0 


Task 11: max(time to complete task 9, time to complete task 10) 
+ time to perform task 11 
= max(14.0, 15.0) + 5.0 = 15.0 + 5.0 = 20.0 


Task 12: max(time to complete task 7, time to complete task 11) 
+ time to perform task 12 
= max(3.0, 20.0) + 0.5 = 20.0 + 0.5 = 20.5 


Therefore the minimum number of hours to manufacture a rocking chair is 20.5. 
From node 12, we can travel back in the chart, selecting at each point of mul- 
tiple prerequisites the node that contributed the maximum value. This gives the 
sequence of nodes 


2 JUL, ЛО). 844, 1 
or, reversing this sequence, 
1, 4, 8, 10, 11, 12 


The sum of the times to perform each task in this sequence is 20.5. If any of 
these tasks takes longer to perform than its allotted time, the entire project will 
take longer than 20.5 hours. This sequence of nodes is a critical path through the 
PERT chart—performing these tasks in the allotted time is critical to completing 
the entire project on time. ө 


The critical path їп а PERT chart represents the minimum time to completion 
of the entire project. If a task not on the critical path takes longer than its allotted 
time to perform, then the critical path may shift to include this node, because it 
then becomes the bottleneck slowing down completion of the total project. In a 
complex project, the critical path must continually be recomputed to determine 
where best to allocate resources to move the project forward. 


PRACTICE 18 | Compute the minimum time to completion and the nodes on the critical path for the house- 
building project of Practice 17. 


Given a partial ordering p ona finite set, there is always a total ordering o that 
is an extension of p, meaning that if x py, then хосу. The process of topological 
sorting finds such a total ordering from a partial ordering. This is indeed a sorting 
process in the sense that the objects end up being totally ordered, but since they 
must be partially ordered to begin with, it is a very specialized sorting process. 

Recall that in a finite partially ordered set, an element is minimal if it has 
no predecessors. In a finite nonempty partially ordered set, at least one minimal 
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element must exist. To see this, let x belong to the set. If x is not minimal, then 
there is a y in the set with y px, y # x. If y is not minimal, then there is a z in the 
set with z py, z = y, and so on. Because the set is finite, this process cannot go on 
indefinitely, so one such element must be minimal. A minimal element in a Hasse 
diagram has no elements below it; a minimal element in a PERT chart has no ele- 
ments to its left. 

The accompanying pseudocode algorithm for topological sorting operates on 
a partially ordered set (5, р). Minimal elements (picked at random if there is a 
choice of minimal elements at any stage) are repeatedly removed from the ordered 
set until the set is empty. Each removal of a minimal element leaves a finite par- 
tially ordered set, so that another minimal element may be found. 


ALGORITHM TOPOLOGICALSORT 


TopSort(finite set S; partial ordering p on 5) 
//find a total ordering on S that is an extension of p 
Local variable 
integer i //enumerates tasks in total ordering 
i=1 
while 5 A © 
pick a minimal element x; from S; 
== 4х) 
і=і+ 1 
end while 
/ [xi < x, < x; < < х, is now a total ordering that extends р 
МИШӘ ПОЛ р Way осо оз 
end function TopSort 


The ordering x, < x; < x; <: < x, produced by this algorithm is a total 
ordering. To see that it is an extension of p, suppose that x; px;. Then x; precedes 
x; and x; must be chosen as a minimal element and removed from the set before x; 
can be chosen as a minimal element. Therefore i < j and х; < x,. 


| EXAMPLE 18 | П One topological sort of the partial ordering of Example 16 is 


Oy 1L, 775 By By Dy Shy Bo 10), 9, Mi, 12 


In Figure 5.7, either 6 or 1 is minimal and may be chosen as the first element. If 6 
is chosen and removed from the set, then, as shown in Figure 5.8, either 1 or 7 is 
minimal. If 1 is then chosen and removed from the set (Figure 5.9), then 2, 3, 4, 5, 
and 7 are all minimal and any one can be chosen next. The process continues until 
all nodes have been chosen. If Ernie's brothers all move to the city and he is left 
to build rocking chairs alone, the topological sort gives an order in which he can 
perform tasks sequentially. 
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1(3.0) 


Figure 5.8 


10(3.0) 


2(4.0) 


3(6.0) 
— 


4(7.0) 


11(5.0) 


5(3.0) 
© >e 12(0.5) 
7(2.0) 
Figure 5.9 


PRACTICE 19 | Find another topological sort for the partial ordering of Example 16. = 


PRACTICE 20 | Find a topological sort for the partial ordering of Practice 17. = 


The algorithm given here for topological sorting is still somewhat imprecise, 
as we have not given a mechanical method for finding a minimal element. Another 


algorithm will be described in Section 7.4. 


SECTION 5.2 REVIEW 


MAIN IDEAS 


* PERT charts are diagrams of partially ordered 
sets representing tasks and prerequisites among 
tasks. 

* A topological sort extends a partial ordering on a 
finite set to a total ordering. 


TECHNIQUES 


(2) Construct a PERT chart from a task table. 
* Find the critical path in a PERT chart. 
(0) Do a topological sort on a partially ordered set. 
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EXERCISES 5.2 


1. The following tasks are required in order to assemble a bicycle. As the manufacturer, you must write a 
list of sequential instructions for the buyer to follow. Will the sequential order given below work? Give 
another sequence that could be used. 


Task Prerequisite Tasks 
1. Tightening frame fittings None 

2. Attaching handle bars to frame 1 

3. Attaching gear mechanism 1 

4. Mounting tire on wheel assembly None 

5. Attaching wheel assembly to frame 1,4 

6. Installing brake mechanism 259,5 

7. Adding pedals 6 

8. Attaching seat 1 

9. Adjusting seat height 7,8 


2. Givea list of tasks and prerequisite tasks for cooking and serving a hamburger. 
3. Construct a PERT chart from the following task table. 


Task Prerequisite Tasks Time to Perform 
A E 3 
B C,D 5 
C A 2 
D A 6 
E None 2 
F A, G 4 
G E 4 
H В, Е 1 


4. Construct a PERT chart from the following task table. 


Task Prerequisite Tasks Time to Perform 
1 2 4 
2 3 2 
3 8 5 
4 3 2 
5 4,7 2 
6 5 1 
7 3 3 
8 None 5 


10. 


11. 
12. 
13. 
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. Compute the minimum time to completion and the nodes on the critical path for the problem in Exercise 3. 
. Compute the minimum time to completion and the nodes on the critical path for the problem in Exercise 4. 


. For the problem in Exercise 3, great improvements in productivity have knocked down the time to per- 


form task D from 6 units to 1 unit. Recompute the minimum time to completion and the nodes on the 
critical path. 


. For the problem in Exercise 4, an extra quality-control step has been added to task 4, which now requires 


4 units of time to perform. Recompute the minimum time to completion and the nodes on the critical path. 


. Do a topological sort on the partially ordered set shown. 


G A 
B 
F 
e 
H D E 
Do a topological sort on the partially ordered set shown. 
B өр 


Find а topological sort for ће problem in Exercise 3. 
Find a topological sort for the problem in Exercise 4. 


Given the following task chart for fixing an Asian dinner, find a total ordering in which the tasks can be 
performed sequentially. 


Task Prerequisite Tasks 
1. Chop onions 9 
2. Wash lettuce 11 
3. Make dressing 11 
4. Do stir fry 10 
5. Toss salad 2,3 
6. Cut up chicken None 
7. Grate ginger 9 
8. Chop bok choy 9 
9. Marinate chicken 6 
10. Heat wok 1, 7, 8, 11 
11. Prepare rice None 
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14. A U.S. journalist, on being posted to a bureaucratic foreign country, was faced with the following tasks 


before she 


could begin work. 


. Obtain a residence permit from the Public Security Bureau 


Prerequisite Tasks 


. Obtain a health certificate from the local hospital 


. Obtain a journalist work card from the Foreign Ministry 


. Obtain a customs certificate from the Customs Office 


. Post an announcement in the local newspaper about the pres- 
ence of her news company in the country 


. Obtain a journalist visa from the Public Security Bureau 


7. Obtain a foreign journalist housing contract from the local 


housing authority 


. Obtain a news organization permit from the Foreign Ministry. 


. Pick up her shipment of belongings from the United States 


Find a total ordering in which the tasks can be performed sequentially. 
15. Recall the problem posed at the beginning of this chapter: 


Your company has developed a program for use on a small parallel processing machine. According to 
the technical documentation, the program executes processes P1, P2, and P3 in parallel; these processes 
all need results from process P4, so they must wait for Process P4 to complete execution before they 
begin. Processes P7 and P10 execute in parallel but must wait until Processes P1, P2, and P3 have 
finished. Process P4 requires results from P5 and P6 before it can begin execution. P5 and P6 execute 
in parallel. Processes P8 and P11 execute in parallel but P8 must wait for Process P7 to complete, and 
Process P11 must wait for P10 to complete. Process P9 must wait for results from P8 and P11. You have 
been assigned to convert the software for use on a single processor machine. 


Use a topological sort to determine the order in which the processes should be executed sequentially. 


16. Given the following task chart for conducting a water quality study, find a total ordering in which the tasks 
can be performed sequentially. 


Task Prerequisite Tasks 
1. Plan schedule 7 
2. Collect new data 1,10 
3. Assemble team 7 
4. Collect prior water quality data 6 
5. Obtain equipment 7 
6. Identify streams None 
7. Decide on readings needed 4,8 
8. Review federal and state guidelines None 
9. Write report 11 
10. Distribute equipment 3,5 
11. Analyze new data 2 
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SECTION 5.3 | RELATIONS AND DATABASES 


A database is a storehouse of associated information about some enterprise. The 
user of a database can certainly retrieve some specific fact stored in the database. 
But a well-designed database is more than simply a list of facts. The user can per- 
form queries on the database to retrieve information not contained in any single 
fact. The whole becomes more than the sum of its parts. 

To design a useful and efficient computerized database, it is necessary 
to model or represent the enterprise with which the database is concerned. A 
conceptual model attempts to capture the important features and workings of 
the enterprise. Considerable interaction with those who are familiar with the 
enterprise may be required to obtain all the information necessary to formulate 
the model. 


Entity-Relationship Model 


One high-level representation of an enterprise is the entity-relationship model. 
In this model, important objects, or entities, in the enterprise are identified, 
together with their relevant attributes or properties. Then the relationships be- 
tween these various entities are noted. This information is represented graphi- 
cally by an entity-relationship diagram, or E-R diagram. In an E-R diagram, 
rectangles denote entity sets, ellipses denote attributes, and diamonds denote 
relationships. 


The Pet Lovers of America Club (PLAC) wants to set up a database. PLAC has 
bought mailing lists from commercial sources, and it is interested in people who 
own pets and in some basic information about those pets, such as the name, type 
of pet (dog, cat, and so on), and the breed. 

Figure 5.10 shows an E-R diagram for the PLAC enterprise. This diagram 
says that persons and pets are the entities. Persons have the attributes of Name, 
Address, City, and State. Pets have the attributes of PetName, PetType, and Breed. 
The diagram also shows that persons own pets. Thinking of the entities as sets, the 
Person set and the Pet set, the relationship “owns” is a binary relation from Person 
to Pet—the ownership relation is captured by (person, pet) ordered pairs. The “1” 
and “N” on the connecting lines indicate that this binary relation is one-to-many; 
that is, in this particular enterprise, one person can own many pets, but no pet has 
multiple owners. (Pets with multiple owners would result in a many-to-many rela- 
tion.) Also, in this example, some persons may own no pets, and some pets may 
have no owners. 

The fact that no pet has multiple owners is one of the “business rules” of 
the enterprise. Such business rules are important to identify when designing 
a database, because they can determine various features of the database, as we 
will see. 
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Pet 
Figure 5.10 ө 


Relational Model 


Another representation of an enterprise, called a relational model, can be devel- 
oped from the E-R model. Both the entity sets and the relationships of the E-R 
model become relations (in the mathematical sense) in the relational model. The 
relations are described by tables. A relational database consists of collections of 
such tables. 

An entity set table is named for the entity set. Each row in the table contains 
the values of the n attributes for a specific instance of that entity set. Thus the rela- 
tional table may be thought of as a set of n-tuples (rows), and an individual row is 
called a tuple. True to the idea of a set, no duplicate tuples exist, and no ordering 
of the tuples is assumed. The ordering of the attributes is unimportant, except that 
consistency must be maintained; that is, each column in the table contains values 
for a specific attribute in all of the tuples. The number of attributes (columns) 
is called the degree of the relation. The number of n-tuples (rows) is called the 
cardinality of the relation; it is the cardinality (in the set-theoretic sense) of the 
set of rows. 

More formally, a database relation is a subset of D, X D, X ++- X D, where 
D, is the domain from which attribute А; takes its values. This means that the 
database use of the word relation is consistent with our definition of an n-ary 
relation on multiple sets (page 330). Beyond the data in the table itself, additional 
information, sometimes called metadata— data about data—is needed to specify 
the domain for each attribute. Is the domain the set of all possible strings, or do 
these strings have to follow a specific format? Is the domain the set of integers or 
does it have to be the set of integers within some specific range? Does the attribute 
represent a date? If so, the domain has to specify the specific date format to be 
used, for example, March 13, 2014 or 3/13/14 or 3/13/2014, and so forth. In addi- 
tion, each attribute domain D; is assumed to contain a special NULL value (empty 
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value), so a given tuple could have a NULL value for one or more of its attributes. 
Any tuple (xj, x», ..., x,) in a relation table must satisfy the n-ary predicate of the 
form (Vxj)(x; € Dj). 


| | EXAMPLE 20 | The Person relation in the PLAC database might contain the following data. 


Person 

Name Address City State 
Patrick, Tom 2425 Samset Sarasota EE 
Smith, Mary 1121 Ridge Rd. Rockville IL 
Collier, Jon 429 Via Rivio Venice [[Ё 
Jones, Kate 345 Forest St. Cleveland OH 
Smith, Bob 1201 45th St. Falls City MA 
White, Janet 110 Toledo Rd. Brookville GA 
Garcia, Maria 24 E. 56th St. New York City NY 


The four attributes for each tuple are Name, Address, City, and State. The meta- 
data specify that the domain for the Name attribute is the set of strings of the form 
LastName, FirstName; the domain for the State attribute is the set of legitimate 
two-character state abbreviations. The Pet relation could be 


Pet 

PetName PetType Breed 

Spot Dog Hound 

Twinkles Cat Siamese 

Lad Dog Collie 

Lassie Dog Collie 

Mohawk Fish Moorish idol 

Tweetie Bird Canary 

Tiger Cat Shorthair е 


Because there are no duplicate tuples in a relation, giving the value of all n 
attributes of a tuple clearly distinguishes that tuple from all others. However, there 
may be a minimal subset of the attributes that can be used to uniquely identify 
each tuple. This subset is called the primary key of the relation; if the subset con- 
sists of more than one attribute, then it is a composite primary key. In the table 
describing the relation (and in the E-R diagram), the primary key is underlined in 
the row of attribute names. No component of the primary key should ever have a 
NULL (empty) value. This entity integrity constraint merely confirms that each 
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tuple must have a primary key value in order to distinguish that tuple and that all 
attribute values of the primary key are needed in order to identify a tuple uniquely. 

Another business rule of the PLAC enterprise is that all people have unique 
names; therefore Mame is sufficient to identify each tuple and was chosen as the 
primary key in the Person relation. Note that for the Person relation as shown in this 
example, State could not serve as a primary key because there are two tuples with 
State value “IL.” However, just because Name has unique values in this instance 
does not preclude the possibility of duplicate names. It is the business rule that 
determines that names will be unique. (There is no business rule that says that ad- 
dresses or cities are unique, so neither of these attributes can serve as the primary 
key, even though there happen to be no duplicates in the Person relation shown.) 

The assumption of unique names is a somewhat simplistic business rule. The 
primary key in a relation involving people is often an identifying number that is a 
unique attribute. This used to be a Social Security number, but due to privacy con- 
cerns, institutions now often generate a local unique identifier such as a student ID 
number or employee ID number, or they use a driver’s license number. Because 
PetName is the primary key in the Pet relation of Example 20, we can surmise 
the even more surprising business rule that in the PLAC enterprise, all pets have 
unique names. A more realistic scenario would call for creating a unique attribute 
for each pet, sort of a pet Social Security number, to be used as the primary key. 
This key would have no counterpart in the real enterprise, so the database user 
would never need to see it; such a key is called a blind key or surrogate key. Blind 
keys are often generated automatically by the database system using a simple se- 
quential numbering scheme. 

An attribute in one relation (called the “child” relation) may have the same 
domain as the primary key attribute in another relation (called the “parent” rela- 
tion). Such an attribute is called a foreign key (of the child relation) into the parent 
relation. A relation for a relationship (that is, for a diamond in the E-R diagram) 
between entities uses foreign keys to establish connections between those entities. 
There will be one foreign key in the relationship relation for each entity participat- 
ing in the relationship. 


The PLAC enterprise has identified the following instance of the Owns relation- 
ship. The Name attribute of Owns is a foreign key into the Person relation where 
Name is a primary key; PetName of Owns is a foreign key into the Pet relation, 
where PetName is a primary key. The first tuple establishes the Owns relationship 
between Bob Smith and Spot; that is, it indicates that Bob Smith owns Spot. 


Owns 

Name PetName 
Smith, Bob Spot 
Smith, Mary Twinkles 
Jones, Kate Lad 
Jones, Kate Lassie 
Collier, Jon Tweetie 
White, Janet Tiger 
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Persons who do not own pets are not represented in Owns, nor are pets with 
no owners. The primary key of Owns is PetName. Recall the business rule that 
no pet has multiple owners. If any pet could have multiple owners, the composite 
primary key Name/PetName would have to be used. Name alone cannot serve as 
the primary key because people can have more than one pet (for example, Jones, 
Kate, does not identify a unique tuple.) e 


In a one-to-one or one-to-many relationship such as our example, a separate 
relationship table (like Owns), while not incorrect, is also not necessary. 


Because PetName in the Owns relation is a foreign key into the Pet relation, 
the two relations can be combined (using an operation called outer join over 
PetName) to form the PetOwner relation. 


PetOwner 

Name PetName PetType | Breed 
Smith, Bob Spot Dog Hound 
Smith, Mary Twinkles Cat Siamese 
Jones, Kate Lad Dog Collie 
Jones, Kate Lassie Dog Collie 
NULL Mohawk Fish Moorish idol 
Collier, Jon Tweetie Bird Canary 
White, Janet Tiger Cat Shorthair 


This PetOwner relation can replace both the Owns relation and the Pet relation 
with no loss of information. PetOwner contains a tuple with a NULL value for 
Name. This tuple does not violate entity integrity because Name is not a compo- 
nent of the primary key but instead is still a foreign key into Person. e 


Operations on Relations 


Two unary operations that can be performed on relations are restrict and project. 
The restrict operation creates a new relation made up of those tuples of the origi- 
nal relation that satisfy certain conditions. The project operation creates a new 
relation made up of certain attributes from the original relation, eliminating any 
duplicate tuples. The restrict and project operations can be thought of in terms 
of subsets. The restrict operation creates a subset of the rows that satisfy certain 
conditions; the project operation creates a subset of the columns that represent 
certain attributes. 
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| EXAMPLES | The operation 


Restrict PetOwner where PetType = “Dog” giving DogOwner 


results in the relation DogOwner. 


DogOwner 

Name PetName PetType | Breed 
Smith, Bob | Spot Dog Hound 
Jones, Kate | Lad Dog Collie 
Jones, Kate | Lassie Dog Collie 


The operation 
Project PetOwner over (Name, PetType) giving Preference 


results in the relation Preference. 


Preference 


Name PetType 


Smith, Bob Dog 


Smith, Mary | Cat 


Jones, Kate Dog 


NULL Fish 


Collier, Jon Bird 


White, Janet | Cat 


PRACTICE 21 | Write the relation that results from the operation 


Project Person over (Name, State) giving Locale ш 


Because relations are sets of n-tuples, the binary operations of union, inter- 
section, and set difference can be applied to two relations with the same basic 
structure. Thus in our example, two different tables containing information about 
pet owners, both laid out with the same structure, could be intersected to produce 
a relation containing all the common 4-tuples. 

Another binary operation, join, can be performed on two relations with a 
common attribute (column). Theoretically, this operation initially forms the Car- 
tesian product of all n-tuples (rows) in the first relation with all k-tuples (rows) in 
the second relation. It views the result as a set of (л + &)-tuples and then restricts 
to the subset of those where the common attribute has the same value, writing the 
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result as a set of (n + k — 1)-tuples (the common attribute is written only once). 
Join is therefore not really a separate operation but is defined as the result of doing 
a Cartesian product followed by a restrict. 


| EXAMPLE 24 | The operation 


Join Person and PetOwner over Name giving Listing 


results in the Listing relation. 


Listing 

Name Address City State | PetName |PetType | Breed 
Smith, Mary |1121 Ridge Rd. | Rockville | Twinkles Cat Siamese 
Collier, Jon |429 Via Rivio | Venice JL Tweetie Bird Canary 
Jones, Kate | 345 Forest St. | Cleveland | OH Lad Dog Collie 
Jones, Kate |345 Forest St. | Cleveland | OH Lassie Dog Collie 
Smith, Bob |1201 45th St. |Falls City МА | Spot Dog Hound 
White, Janet | 110 Toledo Rd. | Brookville | GA Tiger Cat Shorthair 


The restrict, project, and join operations can be applied in various combina- 
tions to formulate queries that the user wishes to perform on the database. For 
example, suppose the query is 


Give the names of all cats whose owners live in Illinois. (1) 


If the only existing relations are Person and PetOwner, the following sequence of 
operations will produce a relation that answers this query: 


Restrict PetOwner where PetType = “Cat” giving Results! 


Results1 

Name PetName PetType | Breed 
Smith, Mary | Twinkles Cat Siamese 
White, Janet | Tiger Cat Shorthair 


Restrict Person where State = “IL” giving Results2 


Results2 


Name Address City State 


Smith, Mary | 1121 Ridge Rd. | Rockville | IL 


Collier, Jon 429 Via Rivio Venice IL 
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Join Results2 and Results! over Name giving Results3 


Results3 


Name Address City State | PetName | PetType Breed 


Smith, Mary | 1121 Ridge Rd. | Rockville 


L Twinkles | Cat Siamese 


Project Results3 over PetName giving FinalResults 


FinalResults 


PetName 


Twinkles 


This query could also be performed by first doing the join operation of Example 
24 followed by the restrict and project operations, but the join table would be much 
larger. 


Relational algebra is a theoretical relational database language in which the re- 
strict, project, and join operations can be combined. The relational algebra equiva- 
lent of the sequence of operations we did to find the names of cats whose owners 
live in Illinois would be the statement 


project (join(restrict PetOwner where PetType = “Саї?) and 
(restrict Person where State = “IL”) over Name) 
over PetName giving Final Results. (2) 


SQL is an international standard relational database language; the preceding 
query would appear as the following SQL statement, where the lines are numbered 
only for discussion purposes: 


. SELECT PetName 

. FROM PetOwner, Person 

. WHERE PetOwner. Name — Person. Name 

. AND PetType = “Cat” 

. AND State = “IL”; (3) 


л > ошоо — 


SQL’s SELECT statement can actually perform relational algebra restricts, proj- 
ects, and joins, as shown here. Lines 4 and 5 represent the two restrict operations. 
Line 2 represents the Cartesian product between the two relations and line 3 iden- 
tifies the common attribute. Therefore lines 2 and 3 together represent the join. 
Line 1 represents the project operation. AND, OR, and NOT connectives are also 
available. ө 


Instead of using the relational algebra approach, in which the restrict, project, 
and join operations are used to process a query, we can use the relational calculus 
approach. In relational calculus, instead of specifying the operations to be done 
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in order to process a query, we give a set-theoretic description of the desired result 
of the query. We specify what we want, not how to get it. Sounds like Prolog (see 
Section 1.5). In fact, the description of the set may involve notation from predicate 
logic; remember that predicate logic is also called predicate calculus, hence the 
name relational calculus. Relational algebra and relational calculus are equivalent 
in their expressive power; that is, any query that can be formulated in one lan- 
guage can be formulated in the other. 


The relational calculus expression for the query asking for the names of all cats 
whose owners live in Illinois is 


Range of x is PetOwner 

Range of y is Person 

{x.PetName | x.PetType = “Cat” and 

exists y(y.Name = x.Name and y.State = “IL”)} (4) 


Here “Range of x is PetOwner” specifies the relation from which the tuple x may 
be chosen, and “Range of y is Person” specifies the relation from which the tuple 
y may be chosen. (The use of the term range is unfortunate. We are really talking 
about domain in the same sense we talked about the domain of an interpretation in 
predicate logic—the pool of potential values.) The notation “exists у” stands for 
the existential quantifier (Эу). e 


Expressions (1) through (4) all represent the same query expressed in English 
language, relational algebra, SQL, and relational calculus, respectively. 


PRACTICE 22 | Using the relations Person and PetOwner, express the following query in relational algebra, 
SQL, and relational calculus form: 


Give the names of all cities where dog owners live. н 


REMINDER 


Any comparison involving 
a NULL value results in 
NULL. 


NULL Values and Three-valued Logic 


The value of an attribute in a particular tuple may be unknown, in which case the 
attribute is assigned a NULL value. For example, we might have the tuple 


Bruno, Dog, NULL 


in the Pet table if Bruno is a dog of unknown breed. (Note that Bruno’s breed 
might be unknown in some absolute sense, or it might simply be unknown to the 
person entering the data.) 

Because NULL means “unknown value,” any comparisons between a NULL 
value and any other value must result in NULL. Thus 


“Poodle” = NULL 


results in NULL; since the NULL value is unknown, it is also unknown whether 
it has the value “Poodle.” 
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Ordinary comparisons (does 2 = 2? does 2 = 5?) result in True or False 
values, but when a NULL value is involved, the result, as we have seen, will be 
NULL. This introduces a three-valued logic where expressions can have values 
of True, False, or NULL. Truth tables can be written for three-valued logic (see 
Exercise 53 of Section 1.1). 
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Most database management systems follow these rules of three-valued logic 
until a final truth value decision must be made, and then a NULL value gets set to 
False. But this can have unexpected consequences. For example, if Bruno is added 
to the Pet table and the following SQL query is executed 


SELECT PetName 

FROM Pet 

WHERE PetType = “Dog” 
AND NOT (Breed = “СоШе”); 


we may expect to see Bruno’s name in the resulting relation, since Bruno’s breed 
is not “Collie.” But as Bruno's attribute values are compared with the criteria 
specified in the SOL statement, we get 


PetType = “Dog” AND NOT (Breed = “СоШе”) 
“Dog” = “Dog” AND NOT (NULL = “СоШе”) 
True AND NOT NULL 

True AND NULL 

NULL 


which then is set to False, so Bruno does not satisfy this query. On second thought, 

because Bruno’s breed is NULL, he might actually be a collie; this query result 

reflects the fact that it cannot be said with certainty that Bruno is a noncollie dog. 
But consider the SQL query 


SELECT PetName 

FROM Pet 

WHERE PetType = “Dog” 
AND Breed = NULL; 
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Surely this describes Bruno. However, remember that the result of any compari- 
son involving NULL is NULL, so 


PetType = “Dog” AND Breed = NULL 
“Dog = “Dog” AND NULL = NULL 
True AND NULL 

NULL 


which then is set to False. Contrary to intuition, Bruno does not satisfy this query 
either. The only true fact about Bruno is that he is a dog. 
The SQL query 


SELECT PetName 

FROM Pet 

WHERE PetType = “Оор” 
AND Breed Is NULL; 


is a completely different query from the preceding one. The WHERE clause is 
asking whether the Breed attribute for any tuple has the value NULL. This query 
would produce the following result because Bruno is the only tuple in the Pet table 
with a NULL value for Breed. 


IsNull 


PetName 


Bruno 


Database Integrity 


New information must be added to a database from time to time, obsolete infor- 
mation deleted, and changes or updates made to existing information. In other 
words, the database will be subjected to add, delete, and modify operations. An 
add operation can be carried out by creating a second relation table with the new 
information and performing a set union of the existing table and the new table. 
Delete can be accomplished by creating a second relation table with the tuples to 
be deleted and performing a set difference that subtracts the new table from the 
existing table. Modify can be achieved by a delete (of the old tuple) followed by an 
add (of the modified tuple). 

These operations must be carried out so that the information in the database 
remains in a correct and consistent state that agrees with the business rules. En- 
forcing three "integrity rules" will help. Data integrity requires that the values 
for an attribute do indeed come from that attribute's domain. In our example, for 
instance, values for the State attribute of Person must be legitimate two-letter 
state abbreviations (or the NULL value). Entity integrity, as we discussed earlier, 
requires that no component of a primary key value be NULL. These integrity 
constraints clearly affect the tuples that can be added to a relation. 

Referential integrity requires that any values for foreign keys of child rela- 
tions into parent relations either be NULL or have values that match values in the 
corresponding primary keys of the parent relations. The referential integrity con- 
straint affects both add and delete operations (and therefore modify operations). 
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For instance, we could not add a tuple to PetOwner with a non-NULL Name value 
that does not exist in the Person relation, because this would violate the Owns 
relation as a binary relation on Person X Pet. Also, if the Bob Smith tuple is de- 
leted from the Person relation, then the Bob Smith tuple must be deleted from the 
PetOwner relation or the Name value *Bob Smith" changed to NULL (a business 
rule must specify which is to occur) so that PetOwner’s foreign key Name does not 
violate referential integrity. This prevents the inconsistent state of a reference to 
Bob Smith in PetOwner when Bob Smith no longer exists as a “Person.” 


SECTION 5.3 REVIEW 


TECHNIQUES 


W) Carry out restrict, project, and join operations in a 
relational database. 

Formulate relational database queries using rela- 
tional algebra, SQL, and relational calculus. 


EXERCISES 5.3 


MAIN IDEAS 


* A relational database uses mathematical relations, 
described by tables, to model objects and relation- 
ships in an enterprise. 

* The database operations of restrict, project, and 
join are operations on relations (sets of tuples). 

* Queries on relational databases can be formulated 
using the restrict, project, and join operations, SOL 
statements, or notations borrowed from set theory 
and predicate logic. 


Exercises 1—4 refer to the Person, Pet, and PetOwner relations of Examples 20 and 22. 


1. Consider the following operation: 


Restrict Pet where PetType = “Cat” giving Kitties 


a. Write a query in English that would result in the information contained in Kitties. 


b. What is the cardinality of the relation obtained by performing this operation? 


c. Write an SQL query to obtain this information. 


2. Consider the following operation: 


Project Person over (Name, City, State) giving Census 


a. Write a query in English that would result in the information contained in Census. 


b. What is the degree of the relation obtained by performing this operation? 


c. Write an SQL query to obtain this information. 


3. Write the results of the following operation: 


Project Pet over (PetName, Breed) giving What Am I 


4. Write the results of the following operation: 


Restrict PetOwner where PetType = “Bird” OR PetType = “Cat” giving SomeOwners 
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Exercises 5—28 are all related to the same enterprise. 


5. A library maintains a database about its books. Information kept on each author includes the author's 
name and country of origin. Information kept on each book includes the ISBN, title, publisher, and sub- 
ject. Authors and books are the entities in this enterprise, and “writes” is a relationship between these enti- 
ties. Sketch an E-R diagram for the enterprise. In the absence of any business rules, what must be assumed 
about the binary relation “writes” regarding whether it is one-to-one, one-to-many, and so on? 


6. In a relational model of the library database, there is an author relation, a book relation, and a writes rela- 
tion. Give the table heading for each of the relation tables, underlining the primary key. Business rules 
state that authors are uniquely identified by name and books are uniquely identified by ISBN. Explain your 
choice of primary key for the Writes relation table. 


For Exercises 7—16, use the following relation tables and write the results of the operations. These tables are 
sorted by primary key (or in the case of the Writes table, by the first component of the primary key), which is 
not required but is useful for Exercise 25. Many database systems maintain data in sorted order by primary key 
using a tree structure (see Section 5.1, Exercise 43). 


Author 


Name Country 


Chan, Jimmy China 


East, Jane U. S. 


King, Dorothy England 


Kovalsco, Bert U.S. 


Lau, Won China 

Nkoma, Jon Kenya 

Quercos, Tom Mexico 
Book 
ISBN Title Publisher | Subject 
0-115-01214-1 Birds of Africa Loraine Nature 


0-364-87547-X | Early Tang Paintings Bellman Art 


0-56-000142-8 | Springtime Gardening | Swift-Key | Nature 


0-816-35421-9 | Springtime Gardening | Harding Nature 


0-816-53705-4 Baskets for Today Harding Art 


0-816-88506-0 | Autumn Annuals Harding Nature 
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Writes 

Name ISBN 

Chan, Jimmy 0-364-87547-X 
East, Jane 0-56-000142-8 


King, Dorothy 
King, Dorothy 
Kovalsco, Bert 


Lau, Won 


0-816-35421-9 
0-816-88506-0 
0-816-53705-4 
0-364-87547-X 


0-115-01214-1 


Nkoma, Jon 


7. Restrict Author where Country = “0. S." giving Results7 

8. Restrict Writes where Name = “Dorothy King” giving Results8 

9. Restrict Book where Publisher = “Bellman” or Publisher = “Swift Key” giving Results9 

10. Restrict Book where Publisher = “Harding” and Subject = “Art” giving Results 10 

11. Project Author over Name giving Results11 

12. Project Author over (Name, Country) giving Results12 

13. Project Book over (Publisher, Subject) giving Results13 

14. Project Book over (ISBN, Title, Subject) giving Results 14 

15. Join Book and Writes over JSBN giving Results 15 

16. Join Author and Writes over Name giving Results 16 
For Exercises 17—23, using the relation tables given before Exercise 7, express each query in relational algebra, 
SQL, and relational calculus forms. Also give the result of each query. 

17. Give the titles of all books about art. 

18. Give the titles of all books published by Harding. 

19. Give the names of all authors who publish with Harding. 

20. Give the names of all authors who have written nature books. 

21. Give the titles of all books written by U.S. authors. 

22. Give the titles, ISBNs, and publishers of all art books whose authors live in the United States. 

23. Give the authors’ names and book titles for all art books written by English authors. 

24. If the tuple 


= 


Fleur, Suzanne NULL 
gets added to the Author table, write the results of the SQL query 


SELECT Name 

FROM Author 

WHERE Country = “U. S.” 
OR Country = NULL; 
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25. Suppose a join operation over some attribute is to be done on two tables of cardinality p and q, 
respectively. 


a. The first step is usually to form the Cartesian product of the two relations and then examine the result- 
ing tuples to find those with a common attribute value. How many tuples result from the Cartesian 
product that then have to be examined to complete the join operation? 


b. Now suppose that the two tables have each been sorted on the common attribute. Explain how the join 
operation can be done more cleverly, avoid the Cartesian product, and examine (read) at most only 


(p + д) rows. 
c. To accomplish a join operation of Author and Writes over Name, how many rows must be examined? 


d. To accomplish a join operation of Book and Writes over /SBN, how many rows must be examined? 
(See Exercise 26 for why this operation would not be a good idea anyway.) 


26. One rule of thumb about good database design is “опе fact, one place." Suppose you try to combine the 
Book and Writes tables over /SBN into a single relation as was done with the PetName and Owns relation. 
This table would have a heading of the form 


How would the resulting table violate the “опе fact, one place" rule? How many tuples have to be updated 
if the publisher “Bellman” changes its name to “Bellman-Boyd’’? 


For Exercises 27 and 28, suppose that an additional attribute called RoyaltyPercent with a domain of integers 
between 0 and 100 is added to the Writes relation. The new Writes table appears here. Because the domain of 
RoyaltyPercent 15 numerical, arithmetic comparisons can be done on a given RoyaltyPercent value. 


Writes 

Name ISBN RoyaltyPercent 
Chan, Jimmy 0-364-87547-X 20 

East, Jane 0-56-000142-8 100 

King, Dorothy 0-816-35421-9 100 

King, Dorothy 0-816-88506-0 100 

Kovalsco, Bert 0-816-53705-4 100 

Lau, Won 0-364-87547-X 80 

Nkoma, Jon 0-115-01214-1 100 


27. a. Write an SQL query to give the author's name, the title and ISBN of the book, and the royalty percent 
for all authors with a royalty percent of less than 100. 


b. Write the results of the query. 


28. What database integrity errors would be caused by attempting each of the following actions? 
a. Adding a tuple in the Writes table: Wilson, Jermain 0-115-01214-1 40 
b. Modifying a tuple in the Writes table: Chan, Jimmy 0-364-87547-Х Sixty 
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Exercises 29-36 are all related to the same enterprise. 


29. A corporation sponsors a yearly campaign to solicit monetary contributions from its employees for a lo- 
cal charity, and the company decides to use a database to keep track of the data. Employee data already 
include employee ID, first name, last name, and department. Employees sign a contribution pledge on 
a particular date, that specifies the total amount they wish to donate and the number of equal biweekly 
payroll deductions (starting with the next pay period) they want to use to pay off the total. The payroll 
department needs to know details about each payment, including the contribution pledge the payment is 
for, the payment date and the amount deducted. An employee can make multiple pledges. 


a. Do you agree that the following “data decomposition” is consistent with the enterprise description? 
If not, what should be added or what should be removed? 


Entity Attributes 


Employee EmployeelD FirstName LastName Department 


Contribution | ContributionID | EmployeelD ContributionDate | TotalAmount | NumberofPayments 


Payment ContributionID | PaymentDate | PaymentAmount 


b. Identify a primary key for each of the Employee, Contribution, and Payment entities and explain your 
choice. 


30. Draw an E-R diagram based on Exercise 29. 


3]. A "universal relation" contains all the data values in one relation. The table represents a report that might 
be distributed to the campaign manager. The universal relation as of 1/16/2014 is shown here. 


Employee | First Last Department | Contribution | Contribution | Total Ы са Payment Payment 

ID Name | Name ID Date Amount Payments | Pate Amount 
1 Mary |Black | Accounting 101 1/1/2013 $300.00 3 1/15/2013 | $100.00 
1 Mary  |Black |Accounting 101 1/1/2013 | $300.00 3 1/31/2013 $100.00 
1 Mary  |Black Accounting 101 1/1/2013 | $300.00 3 2/15/2013 | $100.00 
1 Mary Black | Accounting 105 6/1/2013 | $210.00 3 6/15/2013 | $70.00 
1 Mary |Black | Accounting 105 6/1/2013 | $210.00 3 6/30/2013 | $70.00 
1 Mary Black | Accounting 105 6/1/2013 | $210.00 3 7/15/2013 | $70.00 
2 June Brown | Payroll 107 6/1/2013  |$300.00 2 6/15/2013 $150.00 
2 June Brown | Payroll 107 6/1/2013  |$300.00 2 6/30/2013 | $150.00 
2 June Brown | Payroll 108 1/1/2014 | $600.00 12 1/15/2014 | $50.00 
3 Kevin |White |Accounting 102 1/1/2013 | $500.00 2 1/15/2013 | $250.00 
3 Kevin |White |Accounting 102 1/1/2013 | $500.00 2 1/31/2013 | $250.00 
3 Kevin |White |Accounting 109 1/1/2014 | $500.00 2 1/15/2014 | $250.00 
4 Kelly Chen |Payroll 104 4/15/2013 |$100.00 1 4/30/2013 $100.00 
6 Conner | Smith | Sales 103 1/1/2013 | $150.00 2 1/15/2013 | $75.00 
6 Conner | Smith | Sales 103 1/1/2013 | $150.00 2 1/31/2013 | $75.00 
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a. Given this universal relation, create and populate with data the three relation tables for the three entities 
described in Exercise 29. Underline the primary key in each table. 


b. Describe any foreign keys in the relation tables. 


c. Consider the form of the Employee IDs. This is probably what kind of key? 


32. a. If Mary Black moves from the Accounting Department to the Sales Department, how many tuples must 
be updated in the universal relation? 


b. The three relation tables from Exercise 31 should follow the “опе fact, one place" rule (see Exercise 
26). With the same change to Mary Black's department, how many tuples must be updated in the data- 
base using the three relation tables of Exercise 31? 


Exercises 33—36 make use of the three relation tables from Exercise 31. 


33. Write an SQL query to give the employee ID, pay dates, and payment amounts for all pay dates with 
amounts > $100. Give the result of the query. 


34. Write an SQL query to give the contribution ID, pay date, and payment amount for all payments by Mary 
Black. Give the result of the query. 


35. Write an SQL query to give the first and last names and payment amount of all employees who had a 
payroll deduction on 1/15/2013. Give the result of the query. 


36. Write an SQL query to reproduce the universal relation of Exercise 31 from the three relation tables. 


SECTION 5.4 


FUNCTIONS 


In this section we discuss functions, which are really special cases of binary rela- 
tions from a set S to a set 7. This view of a function is a rather sophisticated one, 
however, and we will work up to it gradually. 


Definition 

Function 1s a common enough word even in nontechnical contexts. A newspaper 
may have an article on how starting salaries for this year's college graduates have 
increased over those for last year's graduates. The article might say something 
like, “Тһе salary increase varies depending on the degree program,” or, “The 
salary increase is a function of the degree program." It may illustrate this func- 
tional relationship with a graph like Figure 5.11. The graph shows that each degree 
program has some figure for the salary increase associated with it, that no degree 
program has more than one figure associated with it, and that both the physical 
sciences and the liberal arts have the same figure, 1.5%. 


3.096 
2.5% 
2.0% 
1.5% 
1.0% 
0.5% 

0% 


Engineering Physical Computer Liberal arts Business 
science science 


Figure 5.11 
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Of course, we also use mathematical functions in algebra and calculus. The 
equation g(x) = x? expresses a functional relationship between a value for x and 
the corresponding value that results when the value for x is used in the equation. 
Thus an x value of 2 has the number 2? = 8 associated with it. (This number 
is expressed as g(2) = 8.) Similarly, g(1) = 1? = 1, g(—1) = (-1)° = —1, and 
so on. For each x value, the corresponding g(x) value is unique. If we were to 
graph this function on a rectangular coordinate system, the points (2, 8), (1, 1), and 
(—1, —1) would be points on the graph. If we allow x to take on any real number 
value, the resulting graph is the continuous curve shown in Figure 5.12. 


(2, 8) 


Figure 5.12 


The function in the salary increase example could be described as follows. We set 
the stage by the diagram in Figure 5.13, which indicates that the function always 
starts with a given degree program and that a particular salary increase is associ- 
ated with that degree program. The association itself is described by the set of or- 
dered pairs {(engineering, 2. 25%), (physical sciences, 1. 5%), (computer science, 
2. 75%), (liberal arts, 1. 5%), (business, 2. 0%)}. 


Salary increases by percent 
Degree programs 


Computer science 


Degree 


Figure 5.13 


Real numbers Real numbers 


Figure 5.14 


For the algebraic example g(x) = x°, Figure 5.14 shows that the function 
always starts with a given real number and associates a second real number with it. 
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The association itself is described by {(x, g(x))|g(x) = 2°}, or simply g(x) = x^. 
This set includes (2, 8), (1, 1), (— 1, — 1), but because it is an infinite set, we cannot 
list all its members; we have to describe them. 

From the above examples, we can conclude that there are three parts to a 
function: (1) a set of starting values, (2) a set from which associated values come, 
and (3) the association itself. The set of starting values is called the domain of the 
function, and the set from which associated values come is called the codomain 
of the function. Thus both the domain and codomain represent pools from which 
values may be chosen. (This usage is consistent with our use of the word domain 
when discussing predicate wffs in Section 1.2. There the domain of an interpreta- 
tion is a pool of values that variables can assume and to which constant symbols 
may be assigned. Similarly, the domain D, of an attribute A; in a database relation, 
discussed in Section 5.3, is a pool of potential values for the attribute.) 

The picture for an arbitrary function fis shown in Figure 5.15. Here fis a 
function from S to T, symbolized f: 5 — T. S is the domain and T is the codo- 
main. The association itself is a set of ordered pairs, each of the form (s, f) where 
seS,teT,andtis the value from T that the function associates with the value 
s from S; t = f(s). Hence, the association is a subset of S X T (a binary relation 
from 5 to Т). But the important property of this relation is that every member of 
S must have one and only one 7 value associated with it, so every s є S will ap- 
pear exactly once as the first component of an (5, f) pair. (This property does not 
prevent a given 7 value from appearing more than once.) 


Domain $ Codomain Т 


Figure 5.15 


We are now ready for the formal definition of a function. 


TERMINOLOGY FOR FUNCTIONS 

Let S and T be sets. A function (mapping) f from S to T, f: S — T, is a subset of 
S X T where each member of S appears exactly once as the first component of 
an ordered pair. 5 is the domain and 7 15 the codomain of the function. If (s, 0) 
belongs to the function, then ¢ is denoted by /(s); t is the image of s under 

f. sis a preimage of ¢ under f, and fis said to map s to t. For А C 5, f(A) denotes 


Ola e 4}. 


A function from S to T' is a subset of S X T with certain restrictions on the 
ordered pairs it contains. That is why we spoke of a function as a special kind of 
binary relation. By the definition of a function, a binary relation that is one-to- 
many (or many-to-many) cannot be a function. Also, each member of S must be 
used as a first component. 

We have talked a lot about values from the sets S and 7, but as our example 
of salary increases shows, these values are not necessarily numbers, nor is the as- 
sociation itself necessarily described by an equation. 
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PRACTICE 23 | Which of the following formulas are functions from the domain to the codomain indi- 
cated? For those that are not, why not? 


. fiS —>T where S = T = {1, 2, 3}, = {(1, 1), (2, 3), (3, 1), (2, D} 

‚ g: Z — N where g is defined by g(x) = |x| (the absolute value of x) 

. h: N = № where A is defined by A(x) = x — 4 

d. f: S = T where S is the set of all people in your hometown, T is the set of all automobiles, and f 
associates with each person the automobile that person owns 

e. g: S — T where S = (2013, 2014, 2015, 2016}, T = {$20,000, $30,000, $40,000, $50,000, 
$60,000}, and g is defined by the graph in Figure 5.16. 

f. h: S — T where S is the set of all quadratic polynomials in x with integer coefficients, T = Z, and 

h is defined by A(ax? + bx +c) = b * c 
. f£: R > R where fis defined by f(x) = 4x — 1 
‚ g N № where g is defined by 


х + 31Ёх > 5 
so) = | 


о с Фф 


Боо 


x Их =5 


30 


Profit in thousands of dollars 


2013 2014 2015 2016 
Year 


Figure 5.16 Profits of the American Earthworm Corp. m" 


PRACTICE 24 | Let f: Z — Z be defined by f(x) = x’. 


a. What is the image of —4? 
b. What are the preimages of 9? п 


When we studied recursive definitions in Section 3.1, we talked about sequences, 
where a sequence S was written as 

S(1), S(2), S(3), ... 
Changing the notation to 


fü). Д2), fG), ... 


we see that a sequence is nothing but a list of functional values for a function f 
whose domain is the positive integers, and this is how a sequence is often defined. 


EXAMPLE 28 
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Indeed, the algorithms we gave for computing the values in such sequences were 
pseudocode that computes the function. 

Also in Section 3.1, we talked about recursive operations such as a” where a 
is a fixed nonzero real number and л = 0. This is also simply a function f(n) = a" 
whose domain is N. e 


The definition of a function includes functions of more than one variable. We 
can have a function f: S, X 5 X ··· X S, — T that associates with each ordered 
n-tuple of elements (s;, 5, ..., Sn), S; € S, a unique element of 7: 


Ай Ж NOX {1,2} —Z is given by 05 y, 2) =x +z. Then f(—4, 3,1) = 
(-4y + 1 = —64 + 1 = -63. А 


In Section 4.1 we defined a unary operation on a set 5 as associating a unique 
member of S, x”, with each member x of S. This means that a unary operation on 
S is a function with domain and codomain S. We also defined a binary operation 

о on a set S as associating a unique member of S, x ° y, with every (x, y) pair of 
elements of S. Therefore a binary operation on $ is a function with domain S X S 
and codomain S. e 


Again, domain values and codomain values are not always numbers. 


Let S be the set of all character strings of finite length. Then the association that pairs 
each string with the number of characters in the string is a function with domain S and 
codomain N (we allow the *empty string," which has zero characters). e 


Any propositional wff with n statement letters defines a function with domain 
1T, Е)" and codomain (T, Е}. The domain consists of all n-tuples of T-F values; 
with each n-tuple is associated a single value of T or F. The truth table for the wff 
gives the association. For example, if the wff is 4 V B', then the truth table 


A | B |в | A v B' 
т | т Е т 
т F | Т T 
Е | T F F 
F F | Т т 


says that the image of the 2-tuple (F, Т) under this function is Е. If we call this 
function f, then ДЕ, Т) = Е. ө 
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PRACTICE 25 | Let the function defined by the wff А ^ (B V C") be denoted by f. What is f(T, T, Е)? 


What is ДЕ, T, Е)? 


The next example defines two functions that are sometimes useful in analyz- 
ing algorithms. 


The floor function |x| associates with each real number x the greatest integer less 
than or equal to x. The ceiling function [x] associates with each real number x the 
smallest integer greater than or equal to x. Thus |2.8] = 2, [2.8] = 3,|—4.1] = —5, 
and [—4.1] = —4. Both the floor function and the ceiling function are functions 
from R to Z. e 


PRACTICE 26 


a. Sketch a graph of the function |x]. 
b. Sketch a graph of the function [х]. п 


For any integer x and any positive integer n, the modulo function, denoted by 
f(x) = x mod n, associates with x the nonnegative remainder when x is divided by 
n. We can write x as x = qn + r, 0 Sr < n, where q is the quotient and r is the 
remainder, so the value of x mod n is r. 


25 = («2 4^ flo Vial 2 = [| 
21= 3:7 + 05021 той7 = 0 
15 = 3:4 + 3 ѕо 15 mod 4 = 3 
17 = (—4):5 + 3 5o —17 mod 5 = 3 (it is true that 
17 = (=3)6 se (=) 
but remember that the 


remainder must be 
nonnegative) 


Section 5.6 discusses some of the many applications of the modulo function. ө 


The definition of a function f: S — T includes three parts—the domain set S, 
the codomain set T, and the association itself. Is all this necessary? Why can't we 
simply write an equation, like g(x) = x^, to define a function? 

The quickest answer is that not all functional associations can be described by 
an equation (see Example 30, for instance). But there is more to it—let’s limit our 
attention to situations where an equation can be used to describe the association, 
such as g: R > R where g(x) = x?. Even in algebra and calculus, it is common to 
say “consider the function g(x) = x°,” implying that the equation is the function. 
Technically, the equation only describes a way to compute associated values. The 
function л: R > R given by A(x) = x? — 3x + 3(x + 5)—15 is the same func- 
tion as g because it contains the same ordered pairs. However, the equation is 
different in that it says to process any given x value differently. 
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On the other hand, the function f: Z — R given by f(x) = x? is not the same 
function as g. The domain has been changed, which changes the set of ordered 
pairs. The graph of f(x) would consist of discrete (separate) points (Figure 5.17). 
Most of the functions in which we are interested have this feature. Even in situa- 
tions where one quantity varies continuously with another, in a digital computer 
we approximate by taking data at discrete, small intervals, much as the graph of 
g(x) (see Figure 5.12) is approximated by the graph of f(x) (see Figure 5.17). 


Hot o (2,8) 
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ey 


Figure 5.17 


Finally, let's look at the function k: R — C given by k(x) = x°. The equa- 
tion and domain is the same as for g(x); the codomain has been enlarged, but 
the change does not affect the ordered pairs. Is this function considered the same 
function as g(x)? It is not, but to see why, we’ll have to wait until we discuss the 
onto property of functions. Then we will see that g has the onto property while k 
does not, so we do not want to consider them the same function. 

In summary, a complete definition of a function requires giving its domain, 
its codomain, and the association, where the association may be given by a verbal 
description, a graph, an equation, or a collection of ordered pairs. 


EQUAL FUNCTIONS 
Two functions are equal if they have the same domain, the same codomain, and 
the same association of values of the codomain with values of the domain. 


Suppose we are trying to show that two functions with the same domain and 
the same codomain are equal. Then we must show that the associations are the 
same. This can be done by showing that, given an arbitrary element of the domain, 
both functions produce the same associated value for that element; that is, they 
map it to the same place. 


PRACTICE 27 | Let S = {1,2,3} and T= {1,4,9}. The function 3:5 Т is defined by 
f= {(1, 1), (2, 4), (3, 9)). The function g: S > T is defined by the equation 


Prove that f — g. 


Ў (4k — 2) 


gn) = na 
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REMINDER 


To show that a function 
is onto, pick an arbitrary 
element in the codomain 
and show that it has a 
preimage in the domain. 
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Properties of Functions 


Onto Functions 

Let f: S — T be an arbitrary function with domain S and codomain T (Figure 
5.18). Part of the definition of a function is that every member of S has an image 
under f and that all the images are members of 7; the set R of all such images is 
called the range of the function f. Thus, А = {f(s)|s € S}, or R = f(S). Clearly, 
R C T; the range R is shaded in Figure 5.19. If it should happen that А = Т, that 
Is, that the range coincides with the codomain, then the function is called an onto 
function. 


Domain 5 Codomain T 


Figure 5.18 


Domain 5 Codomain T 


Figure 5.19 


ONTO (SURJECTIVE) FUNCTION 
A function f: S — Т is an onto or surjective function if the range of f equals the 
codomain of f. 


In every function with range Ё and codomain Т, R C T. To prove that a given 
function is onto, we must show that T C R; then it will be true that R = T. We 
must therefore show that an arbitrary member of the codomain is a member of the 
range, that is, that it is the image of some member of the domain. On the other 
hand, if we can produce one member of the codomain that is not the image of any 
member of the domain, then we have proved that the function is not onto. 


The function g: R > R defined by g(x) = x? is an onto function. To prove that 
g(x) is onto, let r be an arbitrary real number, and let x = Vr. Then x is a real 
number, so x belongs to the domain of g and g(x) — (С)? = r. Hence, any mem- 
ber of the codomain is the image under g of a member of the domain. The function 
k: R > C given by k(x) = x? is not onto. There аге many complex numbers (i, for 
example) that cannot be obtained by cubing a real number. Thus, g and k are not 
equal functions. e 
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Let f: Q > Q be defined by f(x) = 3x + 2. To test whether fis onto, let д є Q. 
We want an x € Q such that f(x) = 3x + 2 = q. When we solve this equation for 
x, we find that x = (д — 2)/3 is the only possible value and is indeed a member 
of Q. Thus, q is the image of a member of @ under f, and fis onto. However, the 
function Л: Z — Q defined by A(x) = Зх + 2 is not onto because there are many 
values q € ©, for example 0, for which the equation 3x + 2 = q has no integer 
solution. e 


PRACTICE 28 | Which of the functions found in Practice 23 are onto functions? = 


PRACTICE 29 | Suppose a function f: (T, F}" — (T, F} is defined by a propositional wff P (see Example 31). 


Give the two conditions on P under each of which f will fail to be an onto function. н 


One-to-One Functions 

The definition of a function guarantees a unique image for every member of the 
domain. A given member of the range may have more than one preimage, how- 
ever. In our very first example of a function (salary increases), both physical sci- 
ences and liberal arts were preimages of 1.5%. This function was not one-to-one. 


* DEFINITION ONE-TO-ONE (INJECTIVE) FUNCTION 
A function f: S — T is one-to-one, or injective, if no member of T' is the image 
under f of two distinct elements of S. 


The one-to-one idea here is the same as for binary relations in general, as 
discussed in Section 5.1, except that every element of S must appear as a first com- 
ponent in an ordered pair. To prove that a function is one-to-one, we assume that 
there are elements s; and s; of S with f(s,) = f(s;) and then show that s, = s. To 
prove that a function is not one-to-one, we produce a counterexample, an element 
in the range with two preimages in the domain. 


REMINDER 


To show that a function 

f is one-to-one, assume 
f(s) = f(S2) and show that 
S4 = Sp. 


The function g: R > R defined by g(x) = x° is one-to-one because if x and y are 
real numbers with g(x) = g(y), then x? = y? and x = y. The function f: К —> К 
given by f(x) = x? is not one-to-one because, for example, f(2) = f(—2) = 4. 
However, the function Л: № > № given by A(x) = x? is one-to-one because if х 
and y are nonnegative integers with h(x) = А(у), then x? = y?; because x and y are 
both nonnegative, x — y. e 


PRACTICE 30 | Which of the functions found in Practice 23 are one-to-one functions? 
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The floor function and the ceiling function of Example 32 are clearly not one-to- 
one. This is evident also in the graphs of these functions (Practice 26), which have 
a number of horizontal sections, indicating that many different domain values in R 
are mapped by the function to the same codomain value in Z. e 


Figure 5.20 gives simple illustrations about functions and their properties. In 
each case, the domain is on the left and the codomain is on the right. 


єс = 


Not a function Not a function Function, not one-to-one, not onto 
Function, not one-to-one, onto Function, one-to-one, not onto Function, one-to-one, onto 
Figure 5.20 

Bijections 


Ф DEFINITION BIJECTIVE FUNCTION 
A function f: S — T is bijective (a bijection) if it is both one-to-one and onto. 


ЭТОТ The function g: R — R given by g(x) = x? is a bijection. The function in part 
(g) of Practice 23 is a bijection. The function f: К > R given by Дх) = x? is 
not a bijection (not one-to-one), and neither is the function k: R — C given by 
k(x) — x? (not onto). e 


Composition of Functions 


Suppose that f and g are functions with f: S— T апа g: T — U. Then for any 
s € S, f(s) is a member of T, which is also the domain of g. Thus, the function g 
can be applied to f(s). The result is g(/(s)), a member of U (Figure 5.21). Taking 
an arbitrary member s of S, applying the function f, and then applying the function 
gto f(s) isthe same as associating a unique member of U with s. In short, we have 
created a function S — U, called the composition function of f and g and denoted 


by g ° f (Figure 5.22). 
T 


REMINDER 


To prove that a function is 
a bijection requires prov- 

ing two things— onto and 
one-to-one. 


S 


Figure 5.21 
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Figure 5.22 


* DEFINITION COMPOSITION FUNCTION 
Let f: S— T and g: T— U. Then the composition function, g ° f, is a function 
from 5 to U defined by (g ° f Xs) = g(f(s)). 


Note that the function g ° f is applied right to left; function fis applied first 
and then function g. 

The diagram in Figure 5.23 also illustrates the definition of the composition 
function. The corners indicate the domains and codomains of the three functions. 
The diagram says that, starting with an element of S, if we follow either path g ° f 
or path f followed by path g, we get to the same element in U. Diagrams illustrating 
that alternate paths produce the same effect are called commutative diagrams. 


Figure 5.23 


It is not always possible to take any two arbitrary functions and compose 
them; the domains and ranges have to be “compatible.” For example, if f: S — T 
and g:W — Z, where T and W are disjoint, then (g ° f)(s) = g(f(s)) is undefined 
because f(s) is not in the domain of g. 


PRACTICE 31 | Let f: R — R be defined by f(x) = x’. Let g: R > R be defined by g(x) = |x]. 


a. What is the value of (g ° f) (2.3)? 
b. What is the value of (f° g)(2.3)? п 


From Practice 31 we see that order is important in function composition, 
which should not be surprising. If you make a deposit in your checking account 
and then write a large check, the effect is not the same as if you write a large check 
and later make a deposit! Your bank is very sensitive to these differences. 
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Function composition preserves the properties of being onto and being one- 
to-one. Again, let f: S— T and g: T — U, but also suppose that both f and g 
are onto functions. Then the composition function g ° f is also onto. Recall that 
g9f:S— О, so we must pick an arbitrary и є U and show that it has a 
preimage under g°f in S. Because g is onto, there exists t€ T such that 
g(t) = и. And because f is onto, there exists s Є S such that f(s) = t. Then 


(e° f(s) = gCf(s)) = g(t) = и, and g ° f is an onto function. 


PRACTICE 32 | Let f: 5 — T and g: T — U, and assume that both fand g are one-to-one functions. 


Prove that g ° f is a one-to-one function. (Hint: Assume that (g ° f)(s,) = (g ° f)(S2).) п 


We have now proved the following theorem. 


* THEOREM ON COMPOSING TWO BIJECTIONS 
The composition of two bijections is a bijection. 


Inverse Functions 


Bijective functions have another important property. Let f: S — T be a bijection. 
Because fis onto, every t є T has a preimage in 5. Because fis one-to-one, that 
preimage is unique. We can associate with each element t of T a unique member of 
S, namely, that s є S such that f(s) = t. This association describes a function g, 
g: T — S. The picture for f and g is given in Figure 5.24. The domains and codo- 
mains of g and f are such that we can form both g ° f. S — S and f° g: T — T. If 
s E S, then (g »f)(s) = g(f(s)) = g(t) = s. Thus, g ° f maps each element of S 
to itself. The function that maps each element of a set S to itself, that is, that leaves 
each element of S unchanged, is called the identity function on S and denoted by 
is. Hence, g °/ = is. 


d 


Figure 5.24 


PRACTICE 33 | Show that f° g = ir. 


© DEFINITION 


© THEOREM 
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We have now seen that if f is a bijection, f: S — T, then there is a function 
g:T—S with gof=is and f° g = іг. The converse is also true. To prove 
the converse, suppose f:.S — T and there exists g: T — S with g° = i; and 
fe g = іт. We can prove that f is a bijection. To show that fis onto, let t € T. 
Then t = i,t) = (f° D = f(g(t)). Because g: T — S, g(t) € S, and g(t) is the 
preimage under f of t. To show that f is one-to-one, suppose /(51) = f(s;). Then 
gCf(s)) = g(f()) and (gef)s) = (6° /)(52) implying is(s;) = is(s2), or 


S| = s5. Thus, fis a bijection. 


INVERSE FUNCTION 
Let f be a function, f: S — T. If there exists a function g: T — S such that 
g^f-ig,and f° g = ir, then g is called the inverse function of f, denoted 


by f~. 


We have proved the following theorem. 


ON BIJECTIONS AND INVERSE FUNCTIONS 
Let f. S T. Then fis a bijection if and only if f^ ! exists. 


Actually, we have been a bit sneaky in talking about the inverse function 
of f. What we have shown is that if f is a bijection, this is equivalent to the ex- 
istence of an inverse function. But it is easy to see that there is only one such 
inverse function. When you want to prove that something is unique, the stan- 
dard technique is to assume that there are two different such things and then 
obtain a contradiction. Thus, suppose f has two inverse functions, fj ' and f; ! 
(existence of either means that fis a bijection). Both f~ ' and f; ! are functions 
from T to S; if they are not the same function, then they must act differently 
somewhere. Assume that there is a t є T such that f, (f) # f; (t). Because fis 
one-to-one, it follows that (A) # f(f; (0), or (fef; (t) 4 (fef; DO. 
But both fof! and fof} | are ip, so t ¥ t, which is a contradiction. We are 
therefore justified in speaking of f! as the inverse function of f. If f is a bijec- 
tion, so that f ! exists, then fis the inverse function for f^ !; therefore, / ' is 
also a bijection. 


PRACTICE 34 | f: К — R given by f(x) = 3x + 4 isa bijection. 


Describe f !. 
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We’ve introduced a lot of terminology about functions. Table 5.2 gives an 
informal summary of these terms. 


TABLE 5.2 

Term Meaning 

function Mapping from one set to another that associates with each member of the starting 
set exactly one member of the ending set 

domain Starting set for a function 

codomain Ending set for a function 

image Point that results from a mapping 

preimage Starting point for a mapping 

range Collection of all images of the domain 


onto (surjective) 
one-to-one (injective) 
bijection 

identity function 


inverse function 


Range is the whole codomain; every codomain element has a preimage 
No two elements in the domain map to the same place 

One-to-one and onto 

Maps each element of a set to itself 


For a bijection, a new function that maps each codomain element back where it 
came from 


Ф DEFINITION 


Permutation Functions 


Bijections that map a set to itself are given a special name. 


PERMUTATIONS OF A SET 

For a given set A, S4 = {f| fi A — A and fis a bijection}. S, is thus the set of 
all bijections of set A into (and therefore onto) itself; such functions are called 
permutations of A. 


If fand g both belong to S4, then they each have domain = range = A. There- 
fore the composition function г ° f is defined and maps A — A. Furthermore, 
because f and g are both bijections, our theorem on composing bijections says 
that g ° f is a bijection, a (unique) member of S4. Thus, function composition is a 
binary operation on the set Sy. 

In Section 4.4 we described a permutation of objects in a set as being an or- 
dered arrangement of those objects. Is this now a new use of the word “permuta- 
поп”? Not exactly; permutation functions represent ordered arrangements of the 
objects in the domain. If А = (1, 2, 3, 4}, one permutation function of A, call it 
fis given by f = {(1, 2), (2, 3), (3, 1), (4, 4)}. We can also describe function fin 
array form by listing the elements of the domain in a row and, directly beneath, 
the images of these elements under f. Thus, 


_fl 2 3. 4 
з 22:11) 


The bottom row 18 an ordered arrangement of the objects in the top row. 
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A shorter way to describe the permutation f shown in array form is to use 
cycle notation and write f = (1, 2, 3)—understood to mean that f maps each 
element listed to the one on its right, the last element listed to the first, and an 
element of the domain not listed to itself. Here 1 maps to 2, 2 maps to 3, and 3 
maps to 1. The element 4 maps to itself because it does not appear in the cycle. 
The cycle (2, 3, 1) also represents f. It says that 2 maps to 3, 3 maps to 1, 1 maps 
to 2, and 4 maps to itself, the same information as before. Similarly, (3, 1, 2) 
also represents f- 


PRACTICE 35 


a. Let A = (1, 2, 3, 4, 5}, and let f € S, be given in array form by 


Ee час 
f= К 23 J 
Write f in cycle form. 


b. Let A = {1, 2, 3, 4, 5}, and let g € S, be given in cycle form by g = (2, 4, 5, 3). Write g in 
array form. = 


If fand g are members of 5, for some set A, then g ° fe S4, and the ac- 
tion of g ° f on any member of A is determined by applying function f and then 
function g. If fand g are cycles, g » f is still computed the same way. 


If A = (1,2, 3,4} and f, g € S, are given by f= (1, 2, 3) and g = (2, 3), then 
gf = (2,3) » (1, 2, 3). But what does this composition function look like? Let's 
see what happens to element 1 of A. Working from right to left (first f, then о), 
1 — 2 under f and then 2 — 3 under о, so 1 — 3 under g ° f. If we want to write 
g^f asa cycle, we see that it can start with 


(1,3 


and we next need to see what happens to 3. Under f, 3 — 1 and then under g, 
] — 1 (because 1 does not appear in the cycle notation for g), so 3 — 1 under 
g ° f . Thus we can close the above cycle, writing it as (1, 3). But what happens to 
2 and 4? If we consider 2, 2 — 3 under f and then 3 — 2 under о, so 2 — 2 under 
g ° f . Similarly, 4 — 4 under f and 4 — 4 under g, so 4 — 4 under g ° f. We con- 
clude that g » f — (1, 3). e 


In Example 39, if we were to compute f° g = (1, 2, 3) ° (2, 3), we would get 
(1, 2). (We already know that order is important in function composition.) If, how- 
ever, f and g are members of S, and f and g are disjoint cycles—the cycles have 
no elements in common—then fo g = gof. 
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PRACTICE 36 | Let A = (1, 2, 3, 4, 5}. Compute g » f and f° g for the following cycles in S4. 


a. f — (5, 2, 3); g = (3, 4, 1). Write the answers in cycle form. 
b. f= (1, 2, 3, 4); g = (3, 2, 4, 5). Write the answers in array form. 
c. f= (1,3); g = (2, 5). Write the answers in array form. н 


Let A = (1, 2, 3, 4} and consider the cycle f € S, given by f = (1, 2). If we 
compute f° f= (1, 2) ° (1, 2), we see that each element of А is mapped to itself. 
The permutation that maps each element of A to itself is the identity function on 
A, i4, also called the identity permutation. 

If A is an infinite set, not every permutation of A can be written as a cycle. But 
even when 4 is a finite set, not every permutation of A can be written as a cycle; 
for example, the permutation g ° f of Practice 36(b) cannot be written as a cycle. 
However, every permutation on a finite set that is not the identity permutation can 
be written as a composition of one or more disjoint cycles. The permutation 


( 2- 3 4 `) 
4 2 5 Y 3 
of Practice 36(b) is (1, 4) » (3, 5) or (3, 5) » (1, 4). 


PRACTICE 37 | Write 


W—— 
2 А S 1 3 ($ 


as a composition of disjoint cycles. 


Among the permutations of A, some will map certain elements of A to them- 
selves, while others will so thoroughly mix elements around that no element in 4 
is mapped to itself. A permutation on a set that maps no element to itself is called 
a derangement. 


| EXAMPLE% | 7) The permutation fon A = {1, 2, 3, 4, 5} given in array form by 
12 ATS 
QUE SI CASES 
is a derangement. Members of S, that are not derangements, if written as a cycle 


or a product of cycles, will have at least one element of A that is not listed. Thus 
g € S, defined as g = (1, 4) ° (3, 5) maps 2 to itself, so g is not a derangement. @ 
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How Many Functions 


Suppose S and Т are finite sets, say |S'| = m and |T| = n. What can we say about 
the number of functions with various properties that map S to 7? First, let's just 
count the number of functions f: 5 — Т, assuming no special properties about 
the functions. The multiplication principle can be used here because we can think 
of defining a function by assigning an image to each of the т elements of S. This 
gives us a sequence of m tasks. Each task has n outcomes because each element of 
5 can map to any element in Т. Therefore the number of functions is 


InXnXnX-:-Xn|-n" 


m factors 


How many one-to-one functions are there from S to 7? We must have m = n 
or we can't have any one-to-one functions at all. (All the elements of S must be 
mapped to Т, and if т > n there аге too many elements in 5 to allow for a one- 
to-one mapping. Actually, this is the pigeonhole principle at work.) We can again 
solve this problem by carrying out the sequence of tasks of assigning an image to 
each element in S, but this time we cannot use any image we have used before. By 
the multiplication principle, we have a product that begins with the factors 


n(n—l(n-—2)-- 
and must contain a total of m factors, so the result is 


n(n— (n —2):-::[n - (m — D] = nn - Dn—2):--(n—m- 1) 


n! 
(T m P(n,m) 

How many onto functions are there from S to 7? This time we must have 
m > n so that there are enough values in the domain to provide preimages for 
every value in the codomain. (By the definition of a function, an element іп S can- 
not be a preimage of more than one element in Т) Our overall plan is to subtract 
the number of non-onto functions from the total number of functions, which we 
know. To count the number of non-onto functions, we'll use the principle of inclu- 
sion and exclusion. 

Enumerate the elements of set T'as tj, ... , £,, For each i, 1 = i = n, let A; de- 
note the set of functions from S to T that do not map anything to element t;. (These 
sets are not disjoint, but every non-onto function belongs to at least one such set.) 
By the principle of inclusion and exclusion, we can write 


[А, U *-* U A,] = p» | A,| z > |4,04 || + = |4,;0.4, А, 


1=і=п ]xi«cjzn 1=1</<&=п 


- (А 74, (1) 


* THEOREM 
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For any i, |4,| is the number of functions that do not map anything to t; but 
have no other restrictions. By the multiplication principle, we can count the num- 
ber of such functions by counting for each of the т domain elements its n — 1 
possible images. The result is that [4 = (n — 1)". Therefore the first summation 
in Equation (1) adds together terms that are all of the same size. There is one such 
term for each distinct individual set А, out of the n sets, so there are C(n, 1) such 
terms. 

For any i and j, 4; Aj is the number of functions that do not map anything 
to t; or 5, leaving n — 2 possible images for each of the m elements of S. Thus 
|4,74 P — (n — 2)". The second summation adds one such term for each distinct 
group of two sets out of л, so there are C(n, 2) such terms. 

A similar result holds for all the intersection terms. If there are k sets in the 
intersection, then there аге (n — k)” functions in the intersection set and there are 
С(п, k) distinct groups of k sets to form the intersection. Equation (1) can thus be 
written as 


А, U+ U AJ = Сп, D(n — 1)" — Cn, 2) — 2)" + C(n, Зп — 3)" 
=+ + (HDC, n) — п)" (2) 


Now the expression on the left of Equation (2) represents the number of all func- 
tions that fail to map to at least one of the elements of Т, that is, all the non-onto 
functions. If we subtract the value of this expression from the total number of 
functions, which we know is n”, we will have the number of onto functions. Thus 
the number of onto functions is 


n" — C(n, Dn = 1)" + C, 2)(n — 2)" — Cn, 3)(n — 3)" 
4o (DC, n — D[n = (п = DI? + (1) Сп, nn — ny 


where we've added the next-to-last term. The last term is zero, so the final answer is 


n" — C(n, 1)(п — 1)" + C(n, 2)(n — 2)" — C(n, 3)(n — 3)" 
+... + (=D! С(п, n — D" 


We'll summarize these results. 


ON THE NUMBER OF FUNCTIONS WITH FINITE DOMAINS 
AND CODOMAINS 
If |S| 2 m and |T| = n, then 


1. The number of functions f: 5 — T is n". 
2. The number of one-to-one functions f: S — Т, assuming that m = n, is 
n! 
(n — m)! 
3. The number of onto functions f: S > T, assuming that m = n, is 


n” = Cy Kn = 1)" Cun2)n-— 2)? = C3 (n3) 
+... + (= 1)" С(и, п — Day" 


| Section 5.4 Functions 399 


Let S = {А,В,С} and T = (a, b}. Find the number of functions from S onto T. 
Here m = 3 and n = 2. By our theorem on the number of functions, there are 


2—CO,)0y»28-2-1-26 


such functions. e 


PRACTICE 38 | One of the six onto functions in Example 41 can be illustrated by the following diagram: 
A 
uw 2 a 
B 
EM o b 
C 


Draw diagrams for the remaining five onto functions. и 


If A is a set with |А| = п, then the number of permutations of А is n!. This 
number can be obtained by any of three methods: 


1. Acombinatorial argument (each of the n elements in the domain must map 
to one of the л elements in the range with no repetitions) 

2. Thinking of such functions as permutations on a set with n elements and 
noting that P(n, n) = n! 

3. Using result (2) in the previous theorem with m — n 


We propose to count the number of derangements on A. Our plan is similar 
to the one we used in counting onto functions. We'll use the principle of inclusion 
and exclusion to compute the number of permutations that are not derangements 
and then subtract this value from the total number of permutation functions. 

Enumerate the elements of set A as a), ..., a,. For each i, 1 =i = n, let А; 
be the set of all permutations that leave a; fixed. (These sets are not disjoint, but 
every permutation that is not a derangement belongs to at least one such set.) By 
the principle of inclusion and exclusion, we can write 


|A; Eier del А, Е p» |44 ш by |4,04 + > I4, A; Ad 
1=і=лп 1=1<ј=п ]xi«j«kzn 
esses ee D" аса (3) 


For any i, |A,| is the number of permutations that leave a; fixed. By the mul- 
tiplication principle we can count the number of such functions by counting for 
each of the n domain elements, beginning with a,, its possible images. There is 
only one choice of where to map a; because it must map to itself; the next element 
can map anywhere except to a;, so there are n — 1 outcomes; the next element can 
map anywhere except the two images already used, so there are n — 2 outcomes, 
and so on. Continuing, there are 


(D = Dn -2):::0)7 (0 — D! 
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elements in A; for each i. Therefore the first summation in equation (3) adds to- 
gether terms that are all of the same size. The number of such terms equals the 
number of ways to pick one set А; out of the л such sets, or C(n, 1). 

In the second summation, the terms count the number of permutations on n 
elements that leave two of those elements fixed. There are 


DA) = 2): (0) = (n 2)! 


such functions in a given 4; А, and C(n, 2) ways to choose the two sets out of 
n. In general, if there are k sets in the intersection, then k elements must be held 
fixed, so there are (л — k)! functions in the intersection set, and there are C(n, k) 
ways to choose the К sets to form the intersection. Therefore equation (3) becomes 


I4, U +++ ОА, = Cn, D(n — 1)! — Cin, 2)(n — 2)! + Сп, 3)(n — 3)! 

— +++ + (—1)"*!C(n, nn — n)! 
This expression represents the number of all possible nonderangement permu- 
tations. We subtract this value from the total number of permutation functions, 


which is n!: 


n! — С(п, 1)(п — 1)! + C(n,2)X(n — 2)! — C(n, 32 — 3)! 
+++ (= "Си, п)(п — n)! 


Rewriting this expression, 


n! n! n! 


: 1)! 4 : 2)! 3)! 
Па D tasca M за — ay @ 3) 
szl ' 
Free + ( 1) ET 
n! n! n n! 
= nl + } + (- 1" — 
H 2л! 3) n! 
11 d . 
= mı w ш ылыа. z (4) 


! ! 
»( 1,1 SEE Le 1=2 


Written in array form, the two derangements are 


nO oe 
231 © 3 


к=з 
N чо 
p EN 
е 


© DEFINITIONS 
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Equivalent Sets 


EQUIVALENT SETS AND CARDINALITY 
A set S 15 equivalent to a set T if there exists a bijection f: S — T. Two sets that 
are equivalent have the same cardinality. 


The notion of equivalent sets allows us to extend our definition of cardinality 
from finite to infinite sets. The cardinality of a finite set is the number of elements 
in the set. If S is equivalent to T, then all the members of S and Т are paired off 
by fin a one-to-one correspondence. If S and 7 are finite sets, this pairing off can 
happen only when S and Т are the same size. With infinite sets, the idea of size 
gets a bit fuzzy, because we can sometimes prove that a given set is equivalent to 
what seems to be a smaller set. The cardinality of an infinite set is therefore given 
only in a comparative sense; for example, we may say that an infinite set А has (or 
does not have) the same cardinality as the set N. 


PRACTICE 39 | Describe a bijection f: Z — N, thus showing that Z is equivalent to № (Z and № have the 


same cardinality) even though N C Z. 


* THEOREM 


If we have found a bijection between a set 5 and №, we have established a one-to- 
one correspondence between the members of S and the nonnegative integers. We 
can then name the members of S according to this correspondence, writing s, for 
the value of S associated with 0, s, for the value of S associated with 1, and so on. 
Then the list 


505 $1, 85, ... 


includes all the members of S. Since this list constitutes an enumeration of S, S 15 
a denumerable set. Conversely, if $ is denumerable, then a listing of the members 
of S exists and can be used to define a bijection between S and N. Therefore a set 
is denumerable if and only if it is equivalent to N. 

For finite sets, we know that if S has n elements, then (5) has 2" elements. 
Of course, 2" > n, and we cannot find a bijection between a set with n elements 
and a set with 2" elements. Therefore S and g(S) are not equivalent. This result is 
also true for infinite sets. 


CANTOR'S THEOREM 
For any set S, S and g(S) are not equivalent. 


Proof: We will do a proof by contradiction and assume that S and (S) are equiva- 
lent. Let f be the bijection between S and (S). For any member s of S, f(s) is a 
member of (S), so f(s) is a set containing some members of S, possibly contain- 
ing s itself. Now we define a set X = (x € S|x € f(x)}. Because X is a subset of 
S, it is an element of (S) and therefore must be equal to f(y) for some y є S. 
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Then y either is or is not a member of X. If y € X, then by the definition of X, 
y € f(y), but since f(y) = X, then у € X. On the other hand, if y € X, then since 
X = f(y), y € f(y), and by the definition of X, y € X. In either case, there is a 
contradiction, and our original assumption is incorrect. Therefore S and g(S) are 
not equivalent. End of Proof 


The proof of Cantor's theorem depends on the nature of set X, which was care- 
fully constructed to provide the crucial contradiction. In this sense, the proof is 
similar to the diagonalization method (see Example 23 in Chapter 4) used to prove 
the existence of an uncountable set. Indeed, the existence of an uncountable set 
can be shown directly from Cantor's theorem. 


The set № is, of course, a denumerable set. By Cantor's theorem, the set (N) is 
not equivalent to N and is therefore not a denumerable set, although it is clearly 


infinite. 


SECTION 5.4 REVIEW 


TECHNIQUES 


* Test whether a given relation is a function. 
QJ Test a function for being one-to-one or onto. 
* Find the image of an element under function com- 
position. 
W Write permutations of a set in array or cycle 
form. 
* Countthe number of functions, one-to-one functions, 
and onto functions from one finite set to another. 


EXERCISES 5.4 


1. The accompanying figure represents a function. 


MAIN IDEAS 


* The concept of function, especially bijective 
function, is extremely important. 

* Composition of functions preserves bijectiveness. 

* The inverse function ofa bijection is itself a bijection. 

* Permutations are bijections on a set. 


a. What is the domain? What is the codomain? What is the range? 


b. What is the image of 5? of 8? 
c. What are the preimages of 9? 
d. Is this an onto function? Is it one-to-one? 
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2. The accompanying figure illustrates various binary relations from К to R. Which are functions? For those 
that are functions, which are onto? Which are one-to-one? 


(a) (b) 
(c) (d) 


3. Using the equation f(x) = 2x — 1 to describe the functional association, write the function as a set of 
ordered pairs if the codomain is R and the domain is 


a S = {0,1,2}. 
b. S = {1, 2,4, 5}. 
c. S = {V7, 1.5}. 


4. Using the equation f(x) = x^ + 1 to describe the functional association, write the function as a set of 
ordered pairs if the codomain is Z and the domain is 


a. S= {1,5}. 
b. S= {-1, 2, -2}. 
c. S = {-V12, 3}. 
5. Iff: Z > Z is defined by f(x) = 3x, find f(A) for 
a. A = {1,3,5}. 


b. A = {x|x € Zand (3y)(y є Z and x = 2y)}. 
6. Iff: R К is defined by f(x) = x’, describe 

a. f(N). 

b. f(Z). 

c. f(R). 


10. 


p 
— 


12. 


13. 


14. 


| 404 Relations, Functions, and Matrices 


. The function f: {all English words} — Z. In each case, find /(5). 


a. S = {dog, cat, buffalo, giraffe}, f(x) = the number of characters in x 
b. 5 = (goose, geese, moose, Mississippi}, f(x) = the number of double-letter pairs in x 
c. S = {cheetah, seal, porpoise, koala}, f(x) = the number of e’s in x 


. The function f: {binary strings} — {binary strings}. In each case, find /(5). 


a. S = {000, 1011, 10001}, f(x) = the second bit in x 
b. S = {111, 100, 0111}, f(x) = the binary string that is the sum of the first and last bit 
c. S = (001, 11, 101}, f(x) = the binary string that is equal tox + 1 


. True or false: 


a. An onto function means that every element in the codomain must have a unique preimage. 
b. A one-to-one function means that every element in the codomain must have a unique preimage. 


с. А one-to-one function means that no two elements in the domain map to the same element in the 
codomain. 


d. An onto function means that (the range) M (the codomain) = @. 


True or false: 

a. If every element in the domain has an image, it must be an onto function. 

b. If every element in the codomain has an image, it must be an onto function. 

c. If every element in the codomain has a preimage, it must be an onto function. 
d. If the domain is the larger than the codomain, it can't be a one-to-one function. 


. Let S = (0, 2, 4, 6} and T = (1,3, 5, 7}. Determine whether each of the following sets of ordered pairs 


is a function with domain S and codomain 7. If so, is it one-to-one? Is it onto? 
a. (0, 2), (2, 4), (4, 6), (6, 0)} 

b. {(6, 3), (2, 1), (0, 3), (4, 5)} 

c. {(2, 3), (4, 7), (0, 1), (6, 5) 

d. {(2, 1), (4, 5), (6, 3)} 

e. {(6, 1), (0, 3), (4, 1), (0, 7), (2, 5)} 


For any bijections in Exercise 11, describe the inverse function. 


Let S = the set ofall U. S. citizens alive today. Which of the following are functions from domain S to the 
codomain given? Which functions are one-to-one? Which functions are onto? 


a. Codomain = the alphabet, f(person) = initial of person’s middle name 
b. Codomain = the set of dates between January | and December 31, f(person) = person’s date of birth 
с. Codomain = 9-digit numbers, f(person) = person's Social Security number 


Let S = the set of people at a meeting, let Т = the set of all shoes in the room. Let f(x) = the left shoe x 
is wearing. 

a. Is this a function? 

b. Is it one-to-one? 

c. Is it onto? 


15. 


1 


a 


17. 


18. 


19. 


20. 


2 


— 


22. 


23. 


24. 


25. 
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Which of the following definitions describe functions from the domain to the codomain given? Which 
functions are one-to-one? Which functions are onto? Describe the inverse function for any bijective 
function. 


a. f: Z = N where fis defined by f(x) = x? + 1 

b. g: N — О where g is defined by g(x) = 1/x 

c. Л: Z X RI О where Л is defined by A(z, п) = z/(n + 1) 
d. f: {1, 2, 3} > (p, q, г} where f = {(1, 4), (2, r), (3,р)} 

e. g: N — N where g is given by g(x) = 2” 

f£ h: R? > R? where Л is defined by h(x, y) = (y + 1,x + 1) 


. Which of the following definitions describe functions from the domain to the codomain given? Which 


functions are one-to-one? Which functions are onto? Describe the inverse function for any bijective 
function. 

a. f: Z > № where fis defined by f(x, y) = x? + 2y? 

x/2 ifxis even 

x + lifxis odd 

c. е: R  R where g is defined by g(x) = 1/V/(x + 1) 

x t lifxis even 

x — lifxis odd 

e л: № > № where ^ is given by A(x, y,z) = х - y —z 

f. g: № > NP where g is defined by g(x, y) = (y, x, 0) 

Let f: К К be defined by f(x) = x", where n is a fixed, positive integer. For what values of n is f 
bijective? 


b. f: № — № where fis defined by f(x) = { 


d. f: № = № where fis defined by f(x) = { 


Let f: R — R be defined by f(x) = n2*, where n is a fixed, positive integer. For what values of n is fan 
onto function? 

Let A = (x, у} and let А* be the set of all strings of finite length made up of symbols from А. A function 
f: A* — Z is defined as follows: For s in A*, f(s) = the length of s. Is f one-to-one? Prove or disprove. 
Is fonto? Prove or disprove. 

Let А = (x, у} and let А* be the set of all strings of finite length made up of symbols from А. A function 
f: A* — Z is defined as follows: For s in A*, f(s) = the number of x's minus the number of y’s. Is f one- 
to-one? Prove or disprove. Is f onto? Prove or disprove. 


.Let A = {х,у} and let A* be the set of all strings of finite length made up of symbols from A. A function 


f: A* — A* is defined as follows: For s in A*, f(s) is the string obtained by writing the characters of s in 
reverse order. Is fone-to-one? Prove or disprove. Is fonto? Prove or disprove. 

Let A = (x, у} and let A* be the set of all strings of finite length made up of symbols from A. A function 
f: A* — A* is defined as follows: For s in A*, f(s) = xs (the single-character string x followed by s). Is f 
one-to-one? Prove or disprove. Is fonto? Prove or disprove. 

Let P be the power set of {a,b,c}. A function f: P— Z is defined as follows: For А in Р, 
f(4) = the number of elements in A. Is fone-to-one? Prove or disprove. Is f onto? Prove or disprove. 

Let P be the power set of (a, b} and let S be the set of all binary strings of length 2. A function f: P > S 
is defined as follows: For A in P, f(A) has a 1 in the high-order bit position (left end of string) if and 
only ifa is in A. f(A) has a 1 in the low-order bit position (right end of string) if and only if b is in A. Is f 
one-to-one? Prove or disprove. Is f onto? Prove or disprove. 

Let S = (x|x e Rand x = 1}, and Т = (x|x e Rand 0 < x = 1}. Find a function f 5 T that is a 
bijection. 


26. 


27. 


28. 


29. 
30. 
31. 
32. 


33. 


34. 


33. 
36. 
37. 


38. 


39. 
40. 
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Let S = (a, b, c, d} and T = (x, у, z}. 

a. Give an example of a function from S to T that is neither onto nor one-to-one. 
b. Give an example of a function from S to T that is onto but not one-to-one. 

c. Can you find a function from 5 to T that is one-to-one? 


Compute the following values. 
a. [3.4] 

b. [| —0.2] 

c. [0.5] 

Compute the following values. 
a. |—5—1.2] 

b. [-5-D3]] 

с. [2 * 3.7] 

d. [1 1/2 + 1/3 + 1/4] 


What can be said about x if |x| = [x]? 

Prove that [x] + 1 = [x + 1]. 

Prove that |х| = —| —x]. 

The ceiling function f(x) = [x]: R — Z. Prove or disprove: 

a. fis one-to-one 

b. fis onto 

Prove or disprove: 

a. |х| = x 

b. |2х| = 2|x| 

Prove or disprove: 

а. |х| + ly] = lx + y] 

b. |2х| = |x| + |х + 1/2] 

Prove that if 2% < n < 2**! then k = |log n| and k + 1 = [log n]. (Here log n means log; n.) 
Prove that if 2^ = n < 2^*! then |log n| + 1 = [log (n + 1)]. (Here log n means log; n.) 


Compute the value of the following expressions. 

a. 31 mod 11 

b. 16 mod 8 

c. 22 mod 6 

d. —7 mod 3 

a. List five values x such that x mod 7 = 0. 

b. List five values x such that x mod 5 — 2. 

Prove or disprove: For any integers x and y, x mod 10 + y mod 10 = (x + y) mod 10. 


Prove that x = y (mod n) if and only if x mod n = y mod n. (Recall the definition of congruence modulo 
n from Section 5.1.) 


. Let S be a set and let А be a subset of S. The characteristic function of A is a function c4: S — (0, 1} with 


c4(x) = 1 exactly when x € A. 
a. Let S = (1,2, 3, 4, 5} and A = (1,3, 5}. Give the ordered pairs that belong to c4. 
b. Prove that for any set S and any subsets А and В of S, cán p(x) = cx): eg(x). 
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c. Prove that c,(x) = 1 — c4(x). 


d. Is it true that for any set S and any subsets А and В of S, c4 u (x) = c4(x) + свх)? Prove or give a 
counterexample. 


42. Ackermann 5 function, mapping № to N, is a recursive function that grows very rapidly. It is given by 


А(0, п) = n + 1 forallne N 
A(m, 0) = A(m — 1, 1) forall m € № m > 0 
A(m, n) = A(m — 1, А(т, n — 1) forallme N,ne №, т> 0, п > 0 


a. Compute (show all steps) the value of A(1, 1). 
b. Compute (show all steps) the value of A(2, 1). 
c. The value of A(4, 0) = 13 = 27 — 3, still a small value. But A(4,1) = 2v — 3. Compute this value. 
d. Write a likely expression for the value of A(4, 2). 
43. Another rapidly growing function is ће Smorynski function, which also maps № to №. The definition is 


S(0, n) = n" forall n є № 
S(m, п) = S(m — 1, S(m — 1, n)) forall m E N, n e №, т> 0 


a. How does S(0, п) compare to A(0, п)? (See Exercise 40.) 
b. Find (show all steps) an expression for the value of S(1, л). 


c. A googolplex is a very large number, which if written in standard form (such as 1,000,000 ... ), even 
in 1-point font, would take more room to write than the diameter of the known universe. Look up the 
definition of the googolplex and write it as S(m, n) for a specific value of т and n. 


44. The Dwyer function also maps № to № and grows very rapidly, but it has a closed-form definition: 


D(m, n) — | и 


2" ! 


a. Compute the values of D(1, 1), D(2, 1), D(3, 1) and D(4, 1). 


b. Verify that D(2, 1) = (2*1 + 3)D(1, D, that DG, 1) = (2*2 + 3)D(2,1), and that D(4, 1) = 
(2*3 + 3)D(, 1). 


c. Verify that D(m, 1) satisfies the recurrence relation 
Dim + 1,1) = (2m + 3)D(m, 1) with D(0, 1) = 1 


(Hint: When you evaluate (т + 1, 1) and D(m, 1) do not divide the denominator factorial into the nu- 
merator factorial. Instead, think of the numerator factorial as a product of even and odd factors. 


d. Find (use a spreadsheet) the smallest value of m for which 


D(m, 1) < m" 


"This is the most common of several versions of Ackermann's function, all of which are recursive with extremely rapid growth rates. To watch the 
tedious recursiveness of computations of Ackermann's function, go to http://www. gfredericks.com/sandbox/arith/ackermann 
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45. Let S = {1, 2, 3, 4}, T = {1, 2, 3, 4, 5, 6}, and U = (6, 7, 8, 9, 10}. Also, let 


46. 


4T. 


48. 


49. 


50. 
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f = {(1,2), (2, 4), (3, 3), (4, 6)) be a function from S to T, and let 

g = ((1, 7), (2, 6), (3, 9), (4, 7), (5, 8), (6, 9)} be a function from T to U. Write the ordered pairs in the 

function g ° f. 

a. Letf: R — Z be defined by f(x) = |x]. Let g: Z — N be defined by g(x) = x°. What is (g ° f)(—4.7)? 

b. Let тар the set of books into the integers where fassigns to each book the number of words in its title. 
Let g: Z — Z be given by g(x) = 2x. What is (g ° f) (this book)? 


c. Let f map strings of alphabetical characters and blank spaces into strings of alphabetical consonants 
where f takes any string and removes all vowels and all blanks. Let g map strings of alphabetical 
consonants into integers where g maps a string into the number of characters it contains. What is (g ° f) 
(abraham lincoln)? 


Let f: № — N be defined by f(x) = x + 1. Let g: № — № be defined by g(x) = Зх. Calculate the value 
of the following expressions. 
‚ @°/)б) 
‚(7° 8X3) 
- (g °f)@) 
- (f° gx) 
- Cfo f )(x) 
(в ° g)(x) 
The following functions тар К to R. Give an equation describing the composition functions g ° f and 
f° g ineach case. 


a. f(x) = 6x°, а(х) = 2x 

b. f(x) = (к — 1)/2, g(x) = 4x 

с. f(x) = [x], g0) = Ix] 

Let f: S— T and о: T — U be functions. 
a. Prove that if g ° f is one-to-one, so is f. 


ao gg 


rh о 


b. Prove that if g о f is onto, so is g. 

c. Find an example where g ° f is one-to-one but g is not one-to-one. 

d. Find an example where g ° f is onto but fis not onto. 

a. Let f be a function, f: S — Т. If there exists a function g: T — S such that g ° f = is, then g is called a 
left inverse of f. Show that fhas a left inverse if and only if fis one-to-one. 

b. Let f be a function, f: S — T. If there exists a function g: T — S such that fo g = ip, then g is called 
a right inverse of f. Show that fhas a right inverse if and only if fis onto. 


c. Let f: № — № be given by f(x) = Зх. Then fis one-to-one. Find two different left inverse functions 
for f. 


d. Let f: № — № be given by f(x) = Hi Then fis onto. Find two different right inverse functions 
for f. 


. For each of the following bijections f: К > R, find f~'. 
a. f(x) = 2x 
b. f(x) =x 


c. f(x) = (x + 4/3 


52. Let f and g be bijections, f: S — T and g: T —^ U. Then f~ ' and =! exist. Also, g » f is a bijection from 


Sto U. Showthat(gof) = !eg'!. 


53. 


54. 


59; 


56. 


57. 


58. 


59. 


60. 
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Let A = (1,2, 3, 4, 5}. Write each of the following permutations on A in cycle form. 
1 2 3 4 5 
drj G 1 5 4 >) 
b. f= {(1, 4), (2, 5), (3, 2), (4, 3), (5, D} 
Let A {a, b, c, d}. Write each of the following permutations on А in array form. 
a. f = {(a, c), (b, b), (c, d), (d, a)} 
b. f= (c,a, b, 4) 
c. f = (d, Б, a) 
d. f = (a, b) ° (b, d) ° (c, a) 
Let A be any set and let S, be the set of all permutations of A. Let f, g, h є S4. Prove that the functions 
h ° (g° f) and (h° g) »f are equal, thereby showing that we can write Л ° g ° f without parentheses to 
indicate grouping. 
Find the composition of the following cycles representing permutations on А = (1, 2, 3, 4, 5}. Write 
your answer as a composition of one or more disjoint cycles. 
а. (2,4, 5, 3) e (1,3) 
b. (3, 5, 2) ° (2, 1, 3) ° (4, 1) (By Exercise 55, we can omit parentheses indicating grouping.) 
с. (2,4) ° (1, 2, 5) ° (2, 3, 1) ° (5, 2) 
Find the composition of ће following cycles representing permutations оп A = (1, 2, 3, 4, 5, 6, 7, 8}. 
Write your answer as a composition of one or more disjoint cycles. 
a. (1, 3, 4) ° (5, 1, 2) 
b. (2, 7, 8) ° (1, 2, 4, 6, 8) 
с. (1, 3, 4) ° (5, 6) ° (2, 3, 5) ° (6, 1) 
а. (2, 7, 1,3) ° (2, 8, 7, 5) ° (4, 2, 1, 8) 


Find the composition of the following cycles representing permutations on N. Write your answer as a 
composition of one or more disjoint cycles. 


а. (3, 5, 2) ° (6, 2, 4, 1) ° (4, 8, 6, 2) 

b. (1, 5, 13, 2, 6) ° (3, 6, 4, 13) ° (13, 2, 6, 1) 

с (1,2) ° (1, 3) ° (1, 4) e (1, 5) 

Find ће composition of the following cycles representing permutations on A = (a, b, с, d, е}. Write your 
answer as a composition of one or more disjoint cycles. 


a. (a, d, c, e) ° (d, c, b) ° (e, c, a, d) ° (a, c, b, d) 
b. (e, b, a) ° (b, e, d) ° (d, a) 
c. (b, e, d) ^ (d, a) ° (e, a, c) ° (a, c, b, e) 


Find a permutation on an infinite set that cannot be written as a cycle. 


. The function f written in cycle form as f = (4, 2, 8, 3) is a bijection on the set N. Write f~' in cycle form. 
62. 


The “pushdown store,” or “stack,” is a storage structure that operates much like a set of plates stacked on 
a spring in a cafeteria. All storage locations are initially empty. An item of data is added to the top of the 
stack by a “push” instruction, which pushes any previously stored items farther down in the stack. Only 
the topmost item on the stack is accessible at any moment, and it is fetched and removed from the stack 
by a “pop” instruction. 

Let’s consider strings of integers that are an even number of characters in length; half the characters 
are positive integers, and the other half are zeros. We process these strings through a pushdown store as 


63. 


64. 


65. 


66. 


67. 


68. 


69. 
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follows: As we read from left to right, the push instruction is applied to any nonzero integer, and a zero 
causes the pop instruction to be applied to the stack, thus printing the popped integer. Thus, processing 
the string 12030040 results in an output of 2314, and processing 12304000 results in an output of 3421. 
(A string such as 10020340 cannot be handled by this procedure because we cannot pop two integers from 
a stack containing only one integer.) Both 2314 and 3421 can be thought of as permutations, 


1 2 3 4 1 2 3 4 
D sie ae 
respectively, on the set A = {1, 2, 3, 4}. 
a. What permutation of A = {1, 2, 3, 4} is generated by applying this procedure to the string 12003400? 
b. Name a permutation of A = (1, 2, 3, 4} that cannot be generated from any string where the digits 1, 2, 
3, and 4 appear in order, no matter where the zeros are placed. 
Let S = {2, 4, 6, 8} and T = (1,5, 7}. 
a. Find the number of functions from S to 7. 
b. Find the number of onto functions from S to 7. 
Let S = {Р, О, R} and T = {k, l, т, n}. 
a. Find the number of functions from S to 7. 
b. Find the number of one-to-one functions from S to T. 
a. For |S| = 2, 3, and 4, respectively, use the theorem on the number of functions to show that the number 
of one-to-one functions from S to S equals the number of onto functions from S to S. 
b. Argue that for |S| = n, f: 5 — S is one-to-one if and only if fis onto. 
c. Find an infinite set S and a function f: 5 — S such that fis one-to-one but not onto. 
d. Find an infinite set S and a function f: 5 — S such that fis onto but not one-to-one. 


Let A = (a, b, c, 4}. How many functions are in S,? How many of these functions are derangements? 
Write all the derangements in array form. 


Let |S| = n. In parts a-e, find the number of 
functions from S to S. 
. one-to-one functions from 5 to S. 
. functions from 5 onto S (see Exercise 65). 


a. 
b 
c 
d. permutations from S onto S. 
e. derangements from S onto S. 
f. 


Order the values obtained in parts (a) through (e) from smallest to largest and explain why this ordering 
is reasonable. 


a. A system development project calls for five different tasks to be assigned to Maria, Jon, and Suzanne. 
In how many ways can the assignment be done if each of the three workers must get at least one task? 


b. In how many ways can the projects be assigned 1f Maria must develop the test plan, which is one of the 
five tasks, but may do other tasks as well? (Hint: Consider the two cases where Maria does and does 
not do any of the other tasks.) 


In a programming class of seven students, the instructor wants each student to modify the program from a 
previous assignment, but no student should work on his or her own program. In how many ways can the 
instructor assign programs to the students? 


7 


[IN 


72. 


7 


чә 


74. 


75. 


76. 
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. Find a calculus book and look up the Maclaurin series representation for the function е". 


. Use the answer to part (a) to find a series representation for e' !. 


. Use a calculator to compute an approximate value for e^! to about 5 decimal places. 
p pp p 


aoe g& 


. How can the answer to parts (b) and (c) help you approximate the number of derangements of n objects 
when n is large, say, n > 10? (Hint: Look at Equation (4) in this section.) 


e. Apply this approach to Exercise 69 and compare the results. 
f. Approximately how many derangements are there of 10 objects? 


. Let f be a function, f: S > T. 


a. Define a binary relation p on S by x p y < f(x) = f(y). Prove that p is an equivalence relation. 
b. What can be said about the equivalence classes if fis a one-to-one function? 
c. For $ = T = Zand f(x) = 3x’, what is [4] under the equivalence relation of part (a)? 


Prove that S(m, n), the number of ways to partition a set of m elements into n blocks, is equal to 1/n! 
times the number of onto functions from a set with m elements to a set with л elements. (Hint: Consider 
Exercise 71.) 


. By the definition of a function f from 5 to T, fis a subset of 5 X Т where the image of every s є S under 


fis uniquely determined as the second component of the ordered pair (s, /) in f. Now consider any binary 
relation p from S to T. The relation p is a subset of S X T in which some elements of S may not appear 
at all as first components of an ordered pair and some may appear more than once. We can view p as a 
nondeterministic function from a subset of S to T. An s € S not appearing as the first component of an 
ordered pair represents an element outside the domain of p. For an s € S appearing once or more as a first 
component, p can select for the image of s any one of the corresponding second components. 

Let S = {1,2,3}, T = (a, b, c}, and U = (m, n, o, p}. Let p be a binary relation on S X T and с be 
a binary relation on T X U defined by 


p = (0,4, (1, Б), (2, Б), (2, с), (3, с)} 
с = {(a, m), (а, о), (а, р), (b, п), (b, р), (c, о)} 


Thinking of р апа o as nondeterministic functions from S to T and T to U, respectively, we can form 
the composition c ° р, a nondeterministic function from S to U. 


a. What is the set of possible images of 1 under ø ° p? 

b. What is the set of possible images of 2 under ø ° p? of 3? 

Let f be a function, f: S > T. 

a. Show that for all subsets А and B of S, f(A B) C f(A) O fB). 

b. Show that ДА N B) = f(A) (Y f(B) for all subsets А and B of S if and only if fis one-to-one. 


Let C be a collection of sets, and define a binary relation p on C as follows: For 5, Te С, Sp T < S is 
equivalent to T. Show that p is an equivalence relation on C. 


Group the following sets into equivalence classes according to the equivalence relation of Exercise 75. 


A = {2,4} 

B=N 

€ = {x|x e N and (3yY(y e N and x = 2*y)} 
D = {a, b, с, d} 

E = g({1, 2}) 

F=Q* 


| 412 Relations, Functions, and Matrices 


Exercises 77 and 78 involve programming with a functional language. Functional programming languages, as 
opposed to conventional (procedural) programming languages such as C++, Java, or Python, treat tasks in 
terms of mathematical functions. A mathematical function such as f(x) = 2x transforms the argument 5 into 
the result 10. Think of a program as a big function to transform input into output. A functional programming 
language contains primitive functions as part of the language, and the programmer can define new functions as 
well. Functional programming languages support function composition, allowing for complex combinations of 
functions. Using the functional programming language Scheme, we can define the doubling function by 


(define (double x) 
(* 2 x) 


The user can then run the program and type 
(double 5) 
which produces an immediate output of 10. 
77.a. Write a Scheme function to square a number. 
b. What is the output from the following user input? 
(double (square 3)) 


78. Scheme also supports recursion, plus the usual control structures of procedural languages, such as condi- 
tional and iterative statements. 


a. Given the Scheme function 


(define (mystery n) 
(cond ((= n 1) 1) 
(else (*n (mystery (- n 1)))))) 


what is the output of 
(mystery 4) 


b. The “mystery” function is better known as 


SECTION 5.5 | ORDER OF MAGNITUDE 


Function Growth 


Order of magnitude is a way of comparing the “rate of growth” of different 
functions. We know, for instance, that if we compute f(x) = x and g(x) = x? for 
increasing values of x, the g values will be larger than the f values by an ever 
increasing amount. This difference in the rate of increase cannot be overcome 
by simply multiplying the f values by some large constant; no matter how large a 
constant we choose, the g values will eventually race ahead again. Our experience 
indicates that the f and g functions seem to behave in fundamentally different 
ways with respect to their rates of growth. In order to characterize this difference 
formally, we define a binary relation on functions. 

Let S be the set of all functions with domain and codomain the nonnegative 
real numbers. We can define a binary relation on S by 


f p g < there exist positive constants no, с, and c; such that, for all x = ле, 
cig(x) = f(x) = oga) 
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| EXAMPLE% | ҮШ Let fand g be functions in 5 where f(x) = Зх? and g(x) = 200: + 140x + 7 


1 
Let n = 2, су = 100? апа c, = 1. Then for x > 2, 


1 
Tog 200% + 140x + 7) = 3x* = (1)(200x? + 140x + 7) 


or 
2x* + 14x + 0.07 = 3x* = 200x? + 140x + 7 (1) 


Therefore fp g. ө 


PRACTICE 40 


a. Verify the inequality in Equation (1) for the following values of x: 2, 3, 4, 5. (Use a calculator.) 

b. In Example 44, can n, have the value 1 if c, and c; remain the same? 

c. Find a different set of three values no, c,, and c; that will also work to show that f p g in 
Example 44. a 


The relation p is an equivalence relation on S. For example, to prove that 
fof, we can pick ny = c, = c; = 1 and have 


(ПЛ) = f(x) = Df) 


PRACTICE 41 


a. Prove that p is symmetric. 
b. Prove that p is transitive. и 


Given that р is an equivalence relation, it partitions S into equivalence classes. 
If fis in the same class as g, then fis said to have the same order of magnitude as g, 
denoted by f = O(g) and pronounced “fis order о” or sometimes “fis big theta of 
g?” Because of symmetry, this also means that g is the same order of magnitude as 
f. org = OX f). (The notation f = O(g) is a bit of a misuse ofthe equality symbol 
because O(g) is not some function identical to f. It is just a shorthand way of say- 
ing that f € [g] under the equivalence relation p defined above.) 


DEFINITION ORDER OF MAGNITUDE 
Let f апа g be functions mapping nonnegative reals into nonnegative reals. Then 
fis the same order of magnitude as g, written f = O(g), if there exist positive 
constants no, Cı, and c; such that for x = no, c.g) = f(x) = се (х). 
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We will usually try to find the simplest representative of a given equivalence 
class. Thus for the functions fand g of Example 44, we would say f = Ө(х?) and 
g = O(x)). A polynomial is always the order of magnitude of its highest-degree 
term; lower-order terms and all coefficients can be ignored. This is not surprising, 
since for large values of x, the highest-degree term will dominate the result. 


PRACTICE 42 Prove (by finding appropriate constants that satisfy the definition of order of magni- 


tude) that f= OQ?) and g = Ө(х?) for the functions f and g of Example 44. 


h(x) = x2 


To understand more intuitively what these equivalence classes mean, we'll 
draw some graphs. Let h(x) є S, where A(x) = x°. Figure 5.25 shows the graph 
of h(x). Now suppose we multiply the / values by the two constants c, = 1/2 and 
Cy = 2. The functions cj/(x) and cjh(x) are shown as dotted lines in Figure 5.26. 
These dotted lines form a kind of envelope around the Л(х) values, roughly trac- 
ing the shape of A(x). Changing the value of the constants changes the width of 
the envelope but not the basic shape. If h(x) is a function with Л, = O(A), then 
there is some positive constant nọ and some envelope around A such that for all 
domain values to the right of ло, the A, values must fall within this envelope, as 
shown in Figure 5.27. Therefore the Л; values can never stray too far from the Л 
values. The functions л; and л are roughly the same size—they are the same order 
of magnitude. 


Figure 5.25 


Figure 5.26 Figure 5.27 


Let f(x) = x and A(x) = x’. Figure 5.28 illustrates that for the constants c, = 1/2 
and c; — 2, f'soon falls below the envelope. Reducing the c, constant (lowering 
the bottom edge of the envelope) only postpones the problem. Formally, we can 
do a proof by contradiction to show that fis not Ө(х?). Suppose f = O(x?). Then 
there exist constants n, and c, with cy? = f(x) for x = пу. But this would imply 
that cx? € x or ex € 1 or x = 1/с| for all x = no. Because c, is fixed, we can 
always choose x large enough so that x > 1/c,, which is a contradiction. Therefore 
f(x) = x is not O(x?). 


> X 
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f(x) =x 


Figure 5.28 e 


If we imagine functions representing various forms of transportation, then 
functions that are the same order of magnitude (belong to the same equivalence 
class) represent the same mode of transportation. One class represents travel on 
foot, another class represents travel by automobile, a third represents travel by air. 
Speeds within a given mode are about the same; ignoring coefficients and low- 
order terms amounts to ignoring the difference between walking and running or 
between a Jeep and a Jaguar or between a Cessna and a Boeing 787. Walking (at 
any speed) is distinctly different from driving, which is distinctly different from 
flying. 

We can imagine a hierarchy of orders of magnitude. For example, the class 
O(x) is a lower order of magnitude than the class G(x)? because functions that are 
O(x) eventually fall below functions that are ex). Also, the class O(log x) is a 
lower order of magnitude than O(x) (see Exercise 15 at the end of this section). In 
our transportation analogy, walking is slower than driving is slower than flying. 

A sort of arithmetic can be developed using order of magnitude. For example, 
if f(x) = x and f(x) = x^, then the function Cfi + Ax) = fix) + f(x) = 
x + x? = 0(x). In general, if f; = Olg) and f; = Ole), then ff f = 
O(max(gi,g;)) (see Exercise 8). When expressed in abbreviated form, 
this leads to somewhat bizarre equations such as Ө(х) + O(x?) = O(x’) or 
Ө(х2) + Ө(х2) = Ө(х?). 


More on Analysis of Algorithms 


Order of magnitude is important in analysis of algorithms, which we discussed in 
Section 3.3. In analyzing an algorithm, we identify the important tasks the algo- 
rithm performs. Usually the number of times such tasks must be done in executing 
the algorithm will depend on the size of the input. For example, searching a list of 
п elements or sorting a list of п elements will require more work as п increases. 
Typically, we can express input size as a nonnegative integer, so the functions that 
express the amount of work will be functions with domain N. We found in Section 
3.3 that a sequential search of л elements requires и comparisons in the worst case, 
while a binary search requires 1 + log n comparisons in the worst case (assuming 
nis a power of 2). Rather than compute the exact functions for the amount of work 
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done, it is easier and often just as useful to settle for order-of-magnitude informa- 
tion. Sequential search is O(n) and binary search is O(log n) in the worst case. 
Thus binary search is an order-of-magnitude improvement over sequential search. 
The order of magnitude of an algorithm is also known as its computational com- 
plexity because it reflects the amount of work inherent in the algorithm. Table 5.3 
gives a summary of the worst-case order of magnitude for algorithms discussed in 
the text and exercises of Section 3.3. 


TABLE 5.3 

Algorithm Operation Worst-Case Order of Magnitude 
SequentialSearch on list size n comparisons O(n) 

BinarySearch on sorted list size n comparisons O(log п) 

cae eae of pattern of length m comparisons amn) 

Evaluation of a polynomial of degree n multiplications and additions O(n) 

BubbleSort of list size п comparisons en?) 

SelectionSort of list size n comparisons Ө(п?) 

MergeSort of list size n comparisons Ө(п log n) 

QuickSort of list size n comparisons enm) 


To appreciate the effect of order of magnitude in evaluating algorithms, 
suppose we have two algorithms A and A’ to do the same job but they differ 
in order of magnitude—say, A is O(n) and A’ is O(n’). Even if each step in а 
computation takes only 0.0001 second, this difference will affect total computation 
time as n grows larger. The first two rows of Table 5.4 give total computation 
times for А and A' for various values of input length. Now suppose a third al- 
gorithm A” exists whose order of magnitude is not even given by a polynomial 
function but by an exponential function, say 2". The total computation times for 
A" are shown in the third row of Table 5.4. 


TABLE 5.4 
Total Computation Time 
Size of Input n 
Algorithm | Order 10 50 100 
A n 0.001 second 0.005 second 0.01 second 
A' n? 0.01 second 0.25 second 1 second 
А” 2" 0.1024 весопа 3570 years 4 X 108 centuries 


© DEFINITION 
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Note that the exponential case grows at a fantastic rate! Even if we assume 
that each computation step takes much less time than 0.0001 second, the relative 
growth rates between polynomial and exponential functions still follow this same 
pattern. Because of this immense growth rate, algorithms not of polynomial order 
are generally not useful for large values of n. In fact, problems for which no poly- 
nomial time algorithms exist are called intractable. 

Sometimes algorithms that are not polynomial in the worst case may still be 
efficient for *average"—Aand useful—input cases.” Nonetheless, in attempting to im- 
prove efficiency, we should ask whether a different algorithm of a lower order of 
magnitude exists before we worry about the details of fine-tuning a given algorithm. 

If f(n) represents the work done by an algorithm on an input of size n, it may 
be difficult to find a simple function g such that f = O(g). Remember that if we 
can find such а g, then f and g are functions that eventually (for large enough n) 
have roughly the same shape. But we may still be able to find a function g that 
serves as an upper bound for f. In other words, while / may not have the same 
shape as g, f will never grow significantly faster than g. Formally, this is expressed 
by saying that f = O(g) (“fis big O of g”). 


BIG OH 

Let fand g be functions mapping nonnegative reals into nonnegative reals. Then 
fis big oh of g, written f = O(g), if there exist positive constants по and c such 
that for x = no, f(x) = cg(x). 


If f = O(g), then g is a ceiling for f and gives us a worst-case picture of the 
growth of f. In Section 3.3, we learned that if E(n) is the number of divisions 
required by the Euclidean algorithm to find gcd(a, b), where b < a = n, then 
E(n) = O(log n) 

The big oh notation f = O(g) says that f grows at the same rate or at a slower 
rate than g. But if we know that f definitely grows at a slower rate than g, then we 
can say something stronger, namely that fis little oh of g, written f= o(g). The 
relationship between big oh and little oh is this: If f = O(g), then either f = O(g) 
or f = o(g), much like a = b says that either a = b or a < b. 


The Master Theorem 


In Section 3.2, we learned that a solution for divide-and-conquer recurrence rela- 
tions of the form 


S(n) — (2) + g(n) forn = 2,n = 2" 


is given by 


log n 
S(n) те: c'£^S(1) + X ce 80 
i=1 


ie 


?This is the case with the well-known simplex method for solving linear programming problems, which are a 
generalization of systems of linear equations to systems of inequalities—see Section 5.7. 
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This type of recurrence relation arises in the analysis of an algorithm that splits 
the input in half and operates recursively on one or more of the halves. A more 
general divide-and-conquer recurrence relation divides the input into subproblems 
that are each of size n/b and then operates recursively on a of the subproblems. 
Each subproblem requires S(n/b) work with g(n) representing the work required 
to divide into subproblems or recombine the results of solving the subproblems. 
Such a recurrence relation would have the form 


Stn) = as( 2) + a(n forn = 2n = ir Q) 


We assume that n is an integral power of Б so that dividing п by b over and over 
always results in an integer. 

A result called the master theorem gives us order-of-magnitude results via a 
cookbook formula for the case when g(n) = п“. Thus, assume the recurrence rela- 
tions of interest look like this? 


so) = as(2) + тюл = 2,0 =” 3) 


Ф THEOREM MASTER THEOREM 
Consider the recurrence relation 


S(1) = 0 


S(n) = (7) + n fora = 2 


where n = b”, a, and b are integers, a = 1, b > 1, and c is a nonnegative real 
number. Then 


а <В S(n) = O(n’) 
2. ifa = bf S(n) = O(n‘ log n) 
3. ifa > Б S(n) = Ө(п!°“) 


| EXAMPLE 46 | С) A recurrence relation of the form 


Stn) = 48(2) +m forn = 2,n= 5" 


matches the form of the recurrence relation shown in the master theorem where 
а = 4, b = 5, and c = 3. Because 4 < 5°, Case 1 applies and the master theorem 
says that S(n) = O(n’). ө 


ЗА more complicated master theorem gives similar results for the case of Equation (2). 
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S(n) 


Note that the master theorem does not give exact solutions; it gives order-of- 
magnitude results. Also, the value of S(1) (a constant) plays no part in determin- 
ing the result. 


The worst-case recurrence relation for comparisons done by the BinarySearch al- 
gorithm on a sorted list of size п (Example 30 in Chapter 3) is 


С(1) =1 


@ = 45) + 1 гп = 2,п = 2" 


We found the exact solution to be C(n) = 1 + log n, certainly O(log л). We can 
find this directly from the master theorem with a = 1, b = 2, c = 0. Because 
a = b°, C(n) = O(n log п) = O(log n). ө 


Proof of the Master Theorem 


To prove the master theorem, we’ll go back to an expansion technique. We apply 
the “recipe” of Equation (3) over and over. The recipe is that S at some value is a 
times 5 at that value divided by 2, plus that value to the c power. Therefore 
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This expansion must stop when n/b* = 1, or n = b*, which means k = m. At that 
point, using summation notation, 


m=i Г nN 
S(n) = a"S(1) + | (+) 


= a"S(1) + | 


| 
~ 3 
ll | 
© = 
S 
ны, 
a 
bte c 


т—1 i 
= а"$(1) + n° У (=) (4) 


Because л = b", it follows that m = log,n, so Equation (4) can be written as 


m=] i 
S(n) = а! "s(1) ak п У (=) 
ї=0 
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and this equation in turn, using property 11 of the logarithm function (see 
Appendix C), can be written as 


т—1 i 
so) = nsa + nS (ж) (5) 
i=0 


We’ll see the expression log, a frequently, so we’ll temporarily give it a simpler 
name: Let w = log,a. Then, expanding the summation, Equation (5) can be 
written as 


S(n) = n"S(1) 4 2l | (=) | (2) EN (sy 7 


We can see that the expression in brackets represents the sum of the first m terms 
of a geometric sequence with first term equal to 1 and common ratio r equal to 
(a/b^). If r # 1, this sum (see Exercise 27 in Section 2. 2) has the value 


and 


S(n) = n"S(1) + n° 


: (6) 
:-(g) 

b 
The condition r # 1 means (a/b^) ¥ 1 or a # b^. This means that Equation (6) 
holds for both Case 1 and Case 3 of the master theorem, so to prove those cases 


we'll do some tiresome algebra on Equation (6). 
First, note that 


m log, n с login c „log, a c, log, a 
[4 d 4 na nn nn log, à " 
n = = = : = И =n 


be be ES bt log, n pls ne nt 


so Equation (6) becomes 


S(n) = n"S(1) 4 —% = n”S(1) + 


or 


S(n) = n"S(1) 4 ra rd H n" (7) 
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Case 1: a < Б“. From a < b° we get (taking log, of both sides) 
log,a<corw<c 


In the right side of Equation (7), n^ is the highest-power term and has a positive 
coefficient (because b° — a > 0). Therefore S(n) = O(n‘). 


Case 3: a > b°. From a > Б we get (taking log, of both sides) 
Іова > согу >с 


In the right side of Equation (7), п” is the highest-power term and has a positive 
coefficient (because a — b° > 0). Therefore S(n) = O(n") = G(n'* ^). 


Case 2: a = b“. This is an easy case to prove and is left for you to do (see 
Exercise 28). 


SECTION 5.5 REVIEW 


TECHNIQUES MAIN IDEAS 
e Determine whether two functions are the same ° Functions can be grouped into equivalence classes 
order of magnitude. according to their order of magnitude, which is a 
W) Use the master theorem to find an order-of- measure of their growth rate. 
magnitude expression for the solution to certain ° Big theta, big oh and little oh (0, О, о) are 
divide-and-conquer recurrence relations. notations for relating the growth rates of two 
functions. 


EXERCISES 5.5 


1. 


. Prove, by finding constants that satisfy the definition of order of magnitude, that f= O(g) i 
. Prove, by finding constants that satisfy the definition of order of magnitude, that f= O(g) i 


. Prove, by finding constants that satisfy the definition of order of magnitude, that f= O(g) 1 


Prove, by finding constants that satisfy the definition of order of magnitude, that f = O(g) if f(x) = x and 
g(x) = 17x + 1. 


. Prove, by finding constants that satisfy the definition of order of magnitude, that f= O(g) if 


f(x) = Зх — 7x and g(x) = x°/2. 


. Prove, by finding constants that satisfy the definition of order of magnitude, that f= O(g) if 


Л) = 29x! — 4x — 15 and g(x) = 15x? + x. 


= 


fe) = Vx + 100 and g(x) = Vx. 


= 


Л) = x + log x and g(x) = x. 


= 


f(x) = log(3x?) and g(x) = log x. 


. In this section, we noted that ^, = O(A) implies that from some point on, / is within an “envelope” of A. 


Can this envelope ever be entirely above or entirely below Л? Explain. 


. Prove that if fi, is a function that is O(g,) and f; is a function that is O(g;), then the function / + р, 


defined by (fi + 5)() = Лб) + fŒ), is O(max(g;, 82)), where (max(g;, g;))(x) = max(gi(), 22%). 


. Find the smallest integer n for which x log x is О(х"). 
10. 


Find the smallest integer n for which (x* + 4x)/(x + 2) is О(х”). 
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Exercises 11—18 require familiarity with ideas from calculus. As an alternative to the definition of order of 
magnitude, a limit test can be used to prove that f= O(g): 


x 
f= Gg) if lim =p where p is a positive real number 
хоо g X 


As an aid in finding the limit of a quotient, if lim... f(x) = © and іт, о g(x) = © and f and g are 
differentiable functions, then L' Hópital's rule says that 


ПО СИИ. 
im = lim 


хэ B(x) хэ®а'(х) 


Thus 2x? + 7 = O(x?) because 


We +7 4 

lim ~—— = lim = 2 

x00 x x90 2x 
11. Use the limit test to do Exercise 1 again. 
12. Use the limit test to do Exercise 2 again. 

As another limit test, 1f 
x 
int = 
хэ g(x) 


then f= o(g). 
13. 
14. 


se the second limit test to prove that x = o(x?). 


se the second limit test to prove that Vx = o(x). 


16. 
17. 


se the second limit test to prove that (In x)? = o(x^?) where In x is the natural log of x, log, x. 


U 
U 
15. Use the second limit test to prove that log x — o(x). 
U 
U 


se both limit tests to group the following functions into classes by order of magnitude and to order those 
classes. Here In x is the natural log of x, log, x. 


17 x log x, 200 log x, 2* — x2, Wx, 10x? — 3x + 5, 420x, 41 In x? 


18. Use both limit tests to group the following functions into classes by order of magnitude and to order those 
classes. Here In x is the natural log of x, log, x. 


х, Vx, log x, x^, x log x, 2x? + x, е", (log x)’, In x, x? + log x 


19. You ask three different people to give you a worst-case order-of-magnitude expression for the work done 
by a particular algorithm on an input of size n. You receive three answers: 


i. O(n’) 

ii. о(п?) 

iii. O(n’) 
Which is the most useful and why? 


20. 
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An algorithm to determine whether a propositional wff with n statement letters is a tautology works by 
assigning, one at a time, all possible sets of truth values to the statement letters. The unit of work for this 
algorithm is the examination of one set of truth values. Explain why this algorithm is @(2”) in the worst 
case. 


For Exercises 21—26, use the master theorem to determine an order-of-magnitude expression for the work done. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


S(n) = (2) + п? 
S(n) = (2) +n 
S(n) = Ө +n 
S(n) = (7) + 
S(n) = (2) + Vn 
S(n) = (2) + 


The worst-case recurrence relation for comparisons done by the MergeSort sorting algorithm is 


са) = 0 
C(n) = 4) + (п = 1) огл = 2, п = 2" 


a. Use the master theorem to find an order-of-magnitude expression for the solution to the related recur- 
rence relation. 


Сп) = xc(2) + onfornz2,n-2" 


b. Compare the result of part (a) with the exact solution to C(n) (Exercise 22 of Section 3.3). 


Prove the master theorem for the case where a = b°. (Hint: Start with Equation (5). Also remember that a 
change of base of logarithms only involves multiplication by a constant.) 


SECTION 5.6 | THE MIGHTY MOD FUNCTION 


In Section 5.4 we defined the modulo л function as follows: 
Ifx = qn+r,0 Sr < п, ћеп х mod n =r. 


In other words, x mod n is the nonnegative remainder (also called the residue of x 
modulo п) when x is divided by the positive integer n. This seemingly innocuous 
function turns up in a surprising number of applications, some of which will be 
explored in this section. 


EXAMPLE 48 


PRACTICE 43 
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Military time uses a 24-hour clock, where the hours run from 0 through 23. One 
half hour before midnight in military time is 23:30. Standard time uses АМ and PM 
for the first half of the day and the second half of the day, respectively, with the 
hours in each half running from 0 through 11. One half hour before midnight in 
standard time is 11:30 рм. Conversion from military time to standard time uses the 
modulo 12 function: 


8:00 military time: Compute 8 mod 12 = 8 gives 8:00 am standard time 
16:00 military time: Compute 16 mod 12 = 4 gives 4:00 рм standard time 


The АМ or РМ designation is determined by whether the quotient in the division is 
0 (AM) or 1 (PM). Counting modulo 12 begins at 0 and wraps around to 0 again 
when 12 is reached: 


(0 Ih 2p Ah S Oh ООЗЕ ООК 7 


This counting scheme is sometimes taught to young children as “clock arithmetic.” @ 


Section 5.1 defined an equivalence relation called congruence modulo л: 
x = y (mod n) if x — y is an integral multiple of n. 


Given the similar terminology, one would expect that congruence modulo л and 
the modulo л function would surely be related. 


a. Prove that x = y (mod n) if and only if (x — y) mod n = 0. 
b. Prove that x = y (mod n) if and only if x mod n = y mod n. и 


Exercises at the end of this section show how some arithmetic operations 
using mod n can be broken down for simpler calculation. One such result 
(Exercise 7) is particularly useful: 

(ху) mod n = (x mod n : y mod n) mod n (1) 
By Equation (1), 
220 mod 6 = (22 mod 6 · 10 mod 6) mod 6 = (4 · 4) mod 6 = 16mod6 = 4 


In addition, spreadsheet software usually contains a mod function that speeds up 
computations. 


Hashing 


A hash function is a function A: S — T where the domain S is a set of text strings 
or integer values, and the codomain T is the set of integers {0, 1, ...,/ — 1} 


EXAMPLE 49 
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where f is some relatively small positive integer. If the domain 5 consists of text 
strings, we can imagine them encoded in some way into integer values, perhaps 
by an algorithm as simple as converting each individual letter of a text string into 
its position in the alphabet (a — 1, b — 2, and so on) and adding up the resulting 
list of integers to get one integer value. Therefore we can assume that S consists of 
integer values to begin with. 

The function A therefore maps a potentially large set of integer values 5 into 
a relatively small window of integer values 7. Consequently Л isn't likely to be a 
one-to-one function because there may be many values хү, х›, X3, ... from S such 
that h(x,) = A(x) = h(x3), in which case we say that x,, x», and x; all “hashed” to 
the same value. The term “hash function” came about because the domain value 
x is often chopped up into pieces in the process of computing A(x). Whatever the 
function A(x) does to x, the last step is almost always to apply the mod ¢ function 
so the final value of h(x) falls in the codomain (0, 1, ...,/ — 1}. 


S is a set of positive integers, T is the set of integers (0, ... , £ — 1} and the hash 
function A(x) is given by 


h(x) = x modt 
If t — 10, for example, then values are computed modulo 10: 


h(7) = 7mod 10 = 7 
h(23) = 23 mod 10 = 3 
h(59) = 59 mod 10 = 9 

h(158) = 158 тоа 10 = 8 
h(48) = 48 mod 10 = 8 


Here 158 and 48 hash to the same value. e 


A hash function is often used as part of a search algorithm. We have already 
discussed two search algorithms. In sequential search, n elements are stored in an 
unordered (random) list and a given target value is compared one-by-one with the 
list elements. In binary search, п elements are stored in a sorted list. The given 
target value is compared with the midpoint of the list, the midpoint of half the list, 
and so on. In a search using a hash function, n elements are stored in an array (a 
one-dimensional table) called a hash table, where the array is indexed from 0 
through £ — 1; the table is of size t. The element x is passed as the argument to 
the hash function and the resulting A(x) value gives the array index at which the 
element is then stored. Later, when a search is carried out, the target value is run 
through the same hash function, giving an index location in the hash table, which 
is where to look for the matching stored element. 

However, because the hash function is not one-to-one, things are not quite 
that simple. As the search list is loaded into the hash table, different values may 
hash to the same array index, producing a collision. There are several collision 
resolution algorithms available. One is called linear probing—just keep going in 
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the array (looping back to the top of the array when you get to the bottom) and 
store element x in the next available empty slot. Another method, called chaining, 
builds a linked list for each array index; the list is initially empty but eventually 
contains all elements that hashed to that index value. 


Using the hash function of Example 49, the elements 7, 23, 59, 158 and 48 are 
loaded in order into the hash table. The size of the hash table is the same as the 
modulo value used (10). With linear probing, the resulting hash table looks like 
Figure 5.29(a); 48 hashes to index 8 (occupied), tries index 9 (occupied), and fi- 
nally finds a slot at index 0. With chaining, assuming elements are added to the 
front of the list, the hash table looks like Figure 5.29(b). 


0| 48 O| e 

1 1 e 

2) 2| с 

3| 23 9 —m| 23 |» 
4 4 P 

5 3| € 

6 6| e 

JW! 3 7 —m 7 Je 
8| 158 8 —P-| 48 | T—-| 158 |e 
9! 39 9 —>| 59 |e 
(a) (b) 

Figure 5.29 


After the hash table is built, we can perform searches for target elements. To 
search for a target element of 48, the linear collision resolution scheme would 
repeat the steps used to store the 48—hash the target 48 to 8, then check index 9 
and then index 0, where the element is found. The target would be compared with 
three different values (158, 59, 48) to reach a successful conclusion. To search for 
a target of 68, the target would hash to index 8 (no match); then the elements at 
index 9 and index 0 would be checked, also with no success. Advancing to index 1, 
the hash table contains an empty slot, so the search concludes (after four compari- 
sons, counting comparing the target against an empty cell) that 68 is not stored in 
the hash table. The chaining scheme would hash a target value of 48 to index 8 
and then search (sequentially) the resulting linked list, finding the target value at 
the beginning of the list after one comparison. To search for a target value of 68, 
the search algorithm would search the same linked list but would get to the end, 
concluding that 68 is not stored in the hash table. e 
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PRACTICE 44 | Show the resulting hash tables if 28 is the next value stored in the hash tables of Figure 


5.29(a) and (b). 


Voting records for the citizens of a certain precinct are stored in a hash table of 
size 197 using the voter's Social Security number as a key value (the value that is 
hashed). Using a hash function modulo the hash table size, the index to search first 
for data on a voter with Social Security number 


328356770 
18 


328356770 mod 197 = 125 e 


Two goals should be kept in mind in designing a hash function: 


1. Given an argument value x, A(x) can be computed quickly. 
2. The number of collisions will be reduced because h(x) does a good job of 
distributing values throughout the hash table. 


Use of a modulo function as the hash function accomplishes the first goal. The 
second goal is harder to achieve because data can always be found that produce a 
large number of collisions, but distribution seems to work better on the average if 
the table size (the modulo value) is a prime number. 

The average number of comparisons required to search for an element us- 
ing hashing depends not on the total number n of elements in the hash table, but 
rather on the ratio of 7 to the total table size t. If this ratio is low, then (using linear 
probing) there are lots of empty slots, so you won’t have to look very far to find 
a place to insert a new element into the table or, correspondingly, to search for a 
target element that is in the hash table. Similarly, if this ratio is low and chaining 
is used, the average length of any linked list you may have to (sequentially) search 
for a target element should be short. This ratio n/t is called the load factor of the 
hash table. In sequential search or binary search, more elements in the set to be 
searched increases the work (number of comparisons) required. But using hash- 
ing, more elements can be searched just as efficiently as fewer provided the hash 
table size grows accordingly so that the load factor stays low. 


Computer Security 


Computer security (or the more general term information assurance) is a topic 
of critical interest when our economy, defense, and indeed our entire way of life 
depend so much on computers and information. The mod function plays a part in 
many aspects of security. 


Cryptography 
Children often delight in sending “secret messages” using some encoding/ 
decoding scheme that they know and that (they believe) their parents don’t know! 
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In the adult world, military information, financial information, and company pro- 
prietary information that must be transmitted securely uses the same process. The 
original information (called the plaintext) is encrypted using an encryption key, 
resulting in coded text called the ciphertext. The ciphertext is transmitted and 
when it is received, it can be decoded using the corresponding decryption key. 
Encryption and decryption are inverse functions in the sense that 


decryption (encryption (plaintext)) = plaintext 


If the message is intercepted by someone with no knowledge of the decryption key, 
it will not be useful unless the encryption scheme can be broken. Cryptography 
is the study of various encryption/decryption schemes. The broader term 
cryptology includes not only cryptography but also the techniques used to ana- 
lyze and “break” coded messages. One of the most famous examples of breaking 
coded messages occurred during World War П when a British team that included 
mathematician Alan Turing, working at Bletchley Park, was able to break the 
supposedly invincible German “Enigma” code and decipher the plans of German 
submarine movement. 

Military use of cryptographic techniques can be traced back to Julius Caesar 
who sent messages to his generals in the field using a scheme now known as the 
Caesar cipher. Let us assume that plaintext messages use only the 26 capital let- 
ters of the alphabet, that spaces between words are suppressed, and that each letter 
is first mapped to its corresponding position in the alphabet: 


7 |8 |9 |10/11 |12 113 |14 |15 |16 |17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 


We'll denote this mapping as the bijection g: {A,...,Z}— {0,... , 25}. 
Then a positive integer key value / is chosen that shifts each number k posi- 
tions to the right with a “wrap-around” back to the beginning if needed (this is 
the mod function). Finally, the function g ' is applied to translate the resulting 
number back into a letter. This produces the final ciphertext character c that 
corresponds to an original plaintext character p. Informally, we could compute 
с from p by just shifting over, in a circular fashion, k positions in the list of let- 
ters, but the conversion to numbers allows us to write a mathematical function 
to carry out the encoding and decoding algorithmically. The encoding function 
is given by 


f(p) = g (lep) + к] mod 26) 


The decoding process is to find the number for c, shift left k positions with a 
"wrap-around" back to the end if needed, then change the resulting digit back into 
a character. The decoding function 15 


f «og (lele) – k] mod 26) 
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| EXAMPLES? | In a Caesar cipher with k = 3, 


E encodes іо g [4 + 3] mod 26) = g !(7 mod 26) = g (7) = Н 
Y encodes to g !([24 + 3] mod 26) = е (27 mod 26) g (1) = B 


And for k = 3, 


Н decodes to р !([7 — 3] mod 26) = g ‘(4 mod 26) = g (4) = E 
В decodes to g ([1 — 3] mod 26) = р !(-2 mod 26) = р (24) = Ү e 


Notice that if you intercept a message encoded using a Caesar cipher, you 
need to try only 26 possible key values to break the code (actually only 25 since 
k — 26 shifts each number back to itself). The Caesar cipher is not a very secure 
code; it may have served well, however, in an era when not many people could 
read at all. For additional security, the function g could be a less obvious bijection; 
the number of possible bijections g is 26! This would mean a total of 26!*25 pos- 
sibilities to try for g and К, too large a number even if you had your handy laptop 
with you in Gaul. As a cryptanalyst, you would narrow down the possibilities by 
making use of statistical characteristics of the language, such as individual letter 
frequency, frequency of certain letter pairs, and so on. 


PRACTICE 45 | Decode the following ciphertext that was encoded with a Caesar cipher using a key of 7: 
AOLJHAPUAOLOHA. Е 


The Caesar cipher is a simple substitution cipher, meaning that each plain- 
text character is coded consistently into the same single ciphertext character. 
Encryption techniques where a single plaintext character contributes to several 
ciphertext characters (and one ciphertext character is the result of several plaintext 
characters) introduce diffusion. The advantage to diffusion is that it hides the fre- 
quency statistics of individual letters, making analysis of an intercepted ciphertext 
message much more difficult. 

DES (Data Encryption Standard) is an internationally standard encryption 
algorithm developed in 1976. DES was developed to safeguard the security of 
digital information, so we may consider the plaintext to be a string of bits (a string 
of Os and 1s). Unlike the Caesar cipher that encodes each single plaintext charac- 
ter individually, DES is a block cipher. A block of 64 plaintext bits is encoded as 
a unit using a 56-bit key. This results in a block of 64 ciphertext bits. However, 
changing one bit in the plaintext or one bit in the key changes about half of the 
resulting 64 ciphertext bits, so DES exhibits high diffusion. One might expect 
this effect to require some extremely complex mathematical encoding function, 
but DES actually uses many simple operations over and over. The DES algorithm 
calls for 16 “rounds” to be done; the original 56-bit key is modified from one 
round to the next, as is the original 64-bit plaintext block. The modifications in- 
volve the following, among other things: 
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* Running bit strings through permutation functions f(i) = j such that the 
new bit value at position i in the string is the old bit value at position j 

* Combining bit strings from the plaintext and the key using a bitwise 
exclusive-ORoperation CD (two Os or two Is result in a 0 bit, while a single 
0 and a single 1 result in a 1 bit) 

* Changing the key by splitting the bit string in half and performing a circu- 
lar left shift on each half of 1 bit or 2 bits, depending on the round number 


It's easy to look at a bit string and see informally what the result of a circular 
left shift will be. But, as in the case of the Caesar cipher, we'd like to put the shift 
on a mathematical (algorithmic) footing, and the mod function comes into play. 


Consider a 5-bit binary string x, such as 11010. A 1-bit circular left shift of x would 
involve moving bits as follows— 


( NN NS 


—resulting in 


Thinking of 11010 as a binary number, each column represents a power of 2, as 
opposed to the powers of 10 that our decimal numbers represent. The decimal 
equivalent of x = 11010 is therefore 


1:270 1:23 + 0:22 + 1:2! + 0:20 = 16-8092 = 26 


and the decimal equivalent of y = 10101 is 16 + 4 + 1 = 21. 

To mathematically compute y from x, we can carry out the following steps. 
While all steps are done on binary strings, we will also show the decimal equiva- 
lents. Multiplication of a binary string by 2 moves the digits one column to the left, 
padding the rightmost column with 0; multiplication by 2^ ^ moves the digits four 
columns to the right, padding the four leftmost columns with 0, just as happens 
when multiplying a decimal number by 10 or by 10 ^, respectively. 


Binary Decimal 

x — 11010 26 

let p = x mod 2* = 01010 10 (26 mod 16) 
letg = p:2 = 10100 20 
lets = x © р = 10000 16 

lett = 5-2 ^ = 00001 1 


lety = q + ¢ = 10101 21 
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Here y is the result of a 1-bit left circular shift of x. This algorithm generalizes 
to an n-bit string x (p = xmod2" ,q— p-2,5 = (улп = эе * D, and 
y = q + t). A2-bit left circular shift can be accomplished by two 1-bit shifts. ө 


PRACTICE 46 | Use the algorithm of Example 53 to compute the 1-bit circular left shift of 1011 (write the 
bit strings for x, p, q, and so on). 


A ciphertext that is the result of a DES encryption can be decoded into plain- 
text by reversing all the steps of the encoding process, including applying the keys 
for each round in the reverse order in which they were used for encoding. Because 
the DES algorithm is well-known, the only “secret” part is the 56-bit key that is 
used. Hence a DES ciphertext can be decrypted by a brute force technique of try- 
ing all possible 56 bit keys, of which there are 2*°. Even assuming that on the aver- 
age one would hit the right key after trying about half the possibilities, this is still 
25 binary strings to generate and test in the DES algorithm. This number used to 
be considered so impossibly large that DES encoding was deemed perfectly se- 
cure, and the U.S. government insisted for many years that it was “unbreakable.” 
But with faster computers and using parallelism, it's possible to find the key in a 
matter of hours. 

In recognition of this potential weakness, a new encoding scheme called AES 
(Advanced Encryption Standard) was adopted by the U.S. National Institute of 
Standards and Technology in 2001 after a five-year design competition. AES is 
also a block encryption scheme, but it uses a key length of 128 bits or more. (AES 
also uses a form of the Euclidean algorithm, discussed in Section 2.3, the idea of 
relatively prime numbers, discussed in Section 2. 4, arithmetic modulo и, illustrat- 
ed in the first few exercises at the end of this section, and ideas from Section 9.1.) 

A disadvantage of both DES and AES is that they are symmetric encryp- 
tion (also called private key encryption) schemes. The same key is used to both 
encode and decode the message; for example, in a Caesar cipher, the key is the 
amount of shift k, and decoding is accomplished by shifting left instead of right. 
In a private key encryption scheme, both the sender and receiver must know the 
key. The problem of securely transmitting a message turns into the problem of 
securely transmitting the key to be used for the encryption and decryption. 

Asymmetric encryption (public key encryption) schemes use different 
keys for encoding and decoding. The decryption key cannot be derived in any 
practical way from the encryption key, so the encryption key can be made public. 
Anyone can send a message to the intended receiver in encrypted form using the 
receiver's public key, but only the intended receiver, who has the decryption key, 
can decode it. The best-known asymmetric encryption scheme is the RSA public 
key encryption algorithm, named for its developers Ron Rivest, Adi Shamir, 
and Len Adleman. RSA uses the mod function, as well as the Euler phi function 
discussed in Section 2.4, to produce secure public keys. The RSA method works 
as follows: 


1. Two large prime numbers p and q (on the order of 200 digits each) are 
chosen at random (such numbers can be found relatively easily), and 
р`9 = п 15 computed. 
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2. RSA, like DES and AES, is a block cipher, so let us assume that a block 
B of text has been encoded by some invertible function g into an integer T 
(much as is done in the Caesar cipher) where T is an integer, 0 < T < n. 

3. The Euler phi function (n) is computed. Because n is the prod- 
uct of two primes, we know from Equation (2) of Section 2.4 that 
p(n) = (p — Iq — D. 

4. A value е is chosen with 1 < e < (n) such that e and (п) are relatively 
prime, that is, gcd(e, ф(п)) = 1. (This will surely be true if e is itself a 
prime number.) 

5. gcd(e, ф(п)) = 1 means that 1 can be written as a linear combination of e 
and ф(л) 


d-e+f-g(n) = 1 
or 
d:e = 1 mod g(n) 


The Euclidean algorithm can be used to find unique values for d (and f ) 
such that 1 < d < q(n). 

6. The public key is the pair of values (л, e). T is encoded by computing 
T^mod n. 

7. The private key is the pair of values (n, d), of which only the “d” part 
is secret. The received message T° mod n is decoded by computing 
(T* mod n)! mod n. 


(7° mod n)? mod n = (T5)! mod n by Equation (1) 
= T“modn with d: e = 1 mod g(n) 


and it happens that this expression equals 7. (The proof depends on two 
classic results from number theory called Fermats5 little theorem and the 
Chinese remainder theorem. See Exercises 37-41.) 

8. Then g (Т) = B, the original text block. 


Because n and e are public, anyone who could factor n into its prime factors р 
and q could then reconstruct the decryption key d and decode the message. How- 
ever, there is no efficient algorithm to find the prime factors of a (large) n. Hence, 
while the RSA cryptosystem is not technically secure, it is practically so, although 
a number of demonstration "factor attacks" have been made using many people 
working together via PCs and the Internet. 

Although public key encryption solves the “Кеу distribution" problem of shared 
keys between sender and receiver, it is a relatively slow encryption/decryption 
method. Hence in applications such as financial transactions over the Internet, public 
key encryption is often used for secure transmission of keys for private key commu- 
nication, which is then used to transmit the actual message. The user's browser may 
send a request to the Web server for the server's public encryption key. Upon receipt, 
the browser uses this public key to send an encrypted message back to the server 
that contains a symmetric key for DES encryption. At this point both the browser 
and the server share a secret key that has been securely transmitted, and the rest of 
the transaction can be carried out using the faster DES encryption. 
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Using RSA encryption/decryption, let p = 17 and q = 13. Then л = 221 and 
q(n) = 16:12 = 192. Pick e = 11; e and ф(л), 11 and 192, are relatively prime. 
Using the Euclidean algorithm, with 11 and 192, 


192 = Ig hil se S or 5 = 192 — 17-11 
11= 2-5 +1 ог 1 = 11 = 225 
Se Sell + (0 


from which 
1=11-2:5 = 11 = 2/192 – 17-11] = 35-11 = 2: 192 


which gives a value for d of 35. If the integer Т = 8, then 8 is encoded as follows, 
where the tedious modulo arithmetic can be broken down using Equation (1). The 
encoding key 15 (и, е) and both values are used in the computation. 


8!! mod 221 = 8?- 8°. 8?- 8? mod 221 
= 512: 512: 512 · 64 mod 221 
= 70: 70 · 70 · 64 mod 221 
= 4900 · 4480 mod 221 
= 38 · 60 mod 221 
= 2280 тоа 221 
= 70 


To decode ће encrypted value 70, compute 


7055 mod 221 = (702)!7- 70 mod 221 = (4900)!7- 70 mod 221 

= 38!7-70 mod 221 = (38255 - 38 - 70 mod 221 
(1444)? · 38 - 70 mod 221 = (118)*- 38 · 70 mod 221 
= [(118?]*- 38 - 70 mod 221 = (13924)4 - 38 - 70 mod 221 
= 14-38-70 mod 221 = 2660 mod 221 = 8 


which was the original encoded integer 

This is not a realistic example because р and q (and л) are relatively small. 
But even here the computations are quite tedious. Again, a spreadsheet and its mod 
function will be helpful. e 


PRACTICE 47 | Assume you receive a ciphertext message of 166 that was encoded using your public key 
(п, e) from Example 54. Decode the message to obtain the original integer Т. 


Hashing for Password Encryption 

The user of a computer system typically has to enter a user ID and password to 
authenticate him or her as a legitimate user, a person entitled to use these comput- 
ing resources. The list of user IDs and corresponding passwords must be stored 
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somewhere in the computer system, and this information is obviously sensitive. 
Anyone with a copy of the password file has open access to the computer and could 
even use the ID/password that has the highest level of computer privileges (such as 
the system administrator ID/password). Therefore the password file, like the se- 
cret messages discussed earlier, must be protected. The operating system will, by 
default, store the password file with the highest level of protection so that only a 
system administrator account has access. But as an added protection, the file will 
be encrypted in some way. A password file might have entries something like this: 


User Ір | Encrypted PW 


jgarcia ax*79% 


wbriggs | ee&46# 


Unlike secret messages being transmitted, the password file is in constant use. 
The operating system must be able to quickly authenticate a given user by check- 
ing that the password the user enters matches the password stored for that user 
ID. When the user ID jgarcia is entered, one approach would be to find jgarcia 
in the password file, decrypt the corresponding encrypted password (ax*79% in 
this case) and see if the result matches what the user entered. But this means that, 
however fleetingly, a legitimate “clear” password would be stored within the com- 
puter for this user ID, and might be captured by someone hacking the system at 
that moment, perhaps even someone posing as the jgarcia user and guessing at the 
correct password. 

A better approach is to apply the encryption key (also stored in the system) to 
the password entered and then check the resulting encrypted password against the 
table entry of ax*79%. A match occurs only if the correct password was entered. If 
a hacker’s attempt fails, he or she does gain the knowledge that what they entered 
is not the jearcia password, but that is of no help in finding the correct password. 

Better still is to use a form of encryption that does not require storing an 
encryption key. A hash function is often used to encrypt passwords. The ideal 
cryptographic hash function / has two characteristics: 


1. Given x, it is easy to compute the hashed value A(x). 
2. Given a hashed value z, it is difficult to find a value x for which h(x) = z. 


Because of these characteristics, a hash function is also called a one-way 
encryption. The password file would now have entries such as 


User ID Hashed PW 


jgarcia h(password 1) 


wbriggs | h(password 2) 


Property (2) means that if the password file falls into the wrong hands, it is of 
little use in trying to break into the system, even if the hash algorithm being used 
is known. Hence securing the password file is no longer a concern. Property (2) 
also means that a cryptographic hash function is likely to be more complex than a 
hash function used to build a hash table for searching, as discussed earlier. There 
are many well-known encryption/hashing functions, most of which involve use of 
the modulo л function where л is some power of 2. 
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There is the slight possibility that user A and user B could pick different 
passwords that hash to the same value, or even that A and B pick the same pass- 
word. As far as ordinary usage is concerned, this doesn’t matter; both A and B 
will be authenticated as legitimate users. However, if A steals the password file 
(no longer securely protected), A could notice that user B’s password hashes to 
the same value as 4°, and if indeed the passwords are the same, А could log 
in as B and do damage to B’s files. Most password encryption schemes append 
some sort of timestamp to a password when the password is first created, and 
(password + timestamp) is encrypted. If A and B choose the same password, 
their timestamps (and hashed values) will differ. The timestamp is stored in 
the password file along with the user ID and when the user enters his or her 
password, the timestamp is appended and the result hashed and compared to 
the table entry. 


Miscellaneous Applications 


Identification Codes 
Sometimes codes are used for identification purposes, not the “secret code” 
purposes we discussed earlier. 


An International Standard Book Number, or ISBN, is a numeric identification 
code associated with a published book. The 10-digit ISBN standard was adopted 
in 1970, but it was replaced by a 13-digit ISBN standard in 2007. Books published 
since 2007 generally carry both a 10-digit code and a 13-digit code. 

The ISBN-10 is written as four blocks of digits separated by hyphens or blanks. 
Reading left to right, the first block of digits is a group identifier for a country, area, 
or language area participating in the ISBN system, the second block of digits iden- 
tifies the publisher within that group, and the third block is the number assigned by 
the publisher to this particular work. The final block consists of a single digit from 
0-9 or a single X to represent 10. If the first nine digits of the ISBN-10 are 


a 10503040 506070809 


then the tenth digit, or check digit, C, is computed by the formula 


9 


С= КОТ w 


nel 


—that is, sum up the product of each digit times its position in the list and to the 
result apply the modulo 11 function. 

For example, in the ISBN-10 0-394-80001-X, the 0 indicates the English 
language group, the 394 identifies the publisher as Random House, the 80001 
identifies the title The Cat in the Hat by Dr. Seuss, and the check digit X (10) is 
computed from 


[I e) d 2:031 de 8» 9 apod 3E Sea sr» sr 7 «(x S0 se 9» Il] aryl = 
98 mod 11 = 10 


The purpose of the check digit is to detect certain types of errors in the ISBN code, 
such as a mistyped digit or a transposition of two digits. 


PRACTICE 48 
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In the ISBN-13, there is an additional block of three digits in the front that 
identifies the type of industry. For a published book, this block is always 978. The 
next three blocks agree with the ISBN-10, but the final block, again a check digit, 
is computed differently. If the first 12 digits of the ISBN-13 are 


A143444 5A 6A 7Agdod 10411812 


then the 13th digit, the check digit C, is computed from the 12 digits by the formula 


6 5 
хо |тоо 0 
i=1 i=0 
—that is, three times the sum of all the even digits + the sum of all the odd digits 
plus the check digit should give a multiple of 10. 

The ISBN-13 for The Cat in the Hat is 978-0-394-80001-1. The check digit C 
(digit 13) is computed from 


Dar 9^7 «9 че 3° O ae 3 4F S59) 4p di ap Sew че 
ap 32 s- Oar 32 il ae C= 99 ab (C 


so C = 1 to make a total of 100, a multiple of 10. ө 


a. If an ISBN-10 begins with 0-534-37488, what is the check digit? 
b. If digits in this number are transposed to 0-534-37848, what is the check digit? ш 


The UPC-A (Universal Product Code) is the common bar code found on all 
goods in stores (food, magazines, and so on). It is an encoding of a 12-digit num- 
ber, where the first digit is a sort of classification digit, the next five digits are a 
manufacturer code, the next five digits are the product code, and the last digit is a 
check sum digit. Each digit is assigned a 7-bit binary code, but this is not the usual 
binary number representation; for example, the binary code for 5 is 0110001. The 
7-bit code for any single digit is represented by a pattern of vertical bars and spaces 
(space-bar-space-bar if the digit occurs in the left half of the 12-digit string, bar- 
space-bar-space if it occurs in the right half). The spaces and bars can have varying 
widths from 1 to 4, 1 being the thinnest and 4 the thickest. If a 5 occurs in the left half 
of the 12-digit string, its binary code would be represented by a space of width 1, 
a bar of width 2, a space of width 3, and a bar of width 1. 


EXAMPLE 58 
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If a 5 occurs in the right half, the same binary code would be represented by 
a bar of width 1, a space of width 2, a bar of width 3, and a space of width 4. In 
addition, there are some extra bits inserted at each end and in the middle for a total 
of 95 bits. 

The check digit C (digit 12) is computed from the previous 11 digits as follows: 


5 5 
З + > ay, + C modio = @ 
ї=0 i=1 


—that is, three times the sum of all the odd digits plus the sum of all the even digits 
plus the check digit should give a multiple of 10. This formula is similar to the one 
for the ISBN-13 check digit except that for the ISBN-13 it is the even digits that 
are multiplied by 3 and for the bar code it is the odd digits. ө 


The American Bankers Association (ABA) devised a code to represent a specific 
financial institution. This number was originally designed for paper checks, and 
it does appear on the bottom of paper checks, but today it is also used for such 
transactions as direct deposit or electronic funds transfer. This routing number 
consists of 8 digits plus a check digit. The first four digits represent Federal Re- 
serve information connected with the financial institution and the second four 
digits identify the institution itself. The check digit C is computed by 


2 2 1 
З F TY Cso Ir lS Oyaa ЕЕ (G mod 10 — 0 
i=0 i=0 i=0 


—that is, digits 1, 4, and 7 are multiplied by 3 and added, digits 2, 5, and 8 are 
multiplied by 7 and added, and digits 3 and 6 are multiplied by 1 and added. The 
total plus the check digit should be a multiple of 10. ө 


Generating and Decomposing Integers 

The modulo function provides an easy way to generate integer values within some 
range 0 through n — 1 for some positive integer n. Take any positive integer m and 
compute m mod п. If you have a function to generate a random (or pseudorandom) 
integer т, this process generates a random (or pseudorandom) integer within the 
desired range. You may also want to cycle through the integers in this range in a 
controlled fashion. 


You want to show a sequence of five images on your Web page. The images are 
stored in an arraylike structure as /mage(1), ..., Image(5), and the actual image 
to be displayed is stored in the variable DisplayImage, which is initialized to 
Image(1): 


i=1 
DisplayImage = Image(i) 
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Image(1) is the first image displayed. After that, at every clock tick (or at pre- 
defined intervals), the statements 


i=imod5 +1 
DisplayImage = Image(i) 


are executed. The effect 15 to cycle the index i through the values 2, 3, 4, 5, 1, and 
so on, and to cycle the displayed images as /mage(2), Image(3), Image(4), Im- 
age(5), Image(1), and so on. e 


The modulo function can also be used to decompose a multidigit integer into 
its component digits. 


To decompose a three-digit integer into the ones, tens, and hundreds digits, one can 
use the following algorithm: 


temp — number 
ones = temp mod 10 
temp — (temp — ones)/10 


tens — temp mod 10 
temp — (temp — tens)/10 
hundreds — temp ө 


PRACTICE 49 | Given the integer 375, walk through the algorithm of Example 59 to separate the digits. „ 


EXAMPLE 60 


Modular Arithmetic Designs 

The mod function can be used to make interesting quiltlike patterns by arranging 
“tile” images (small square images) in tables based on addition modulo л and then 
repeating these tables to form the completed quilt image.* Addition modulo п is 
defined on the set of integers (0, 1, 2, ..., n — 1} by 


x +„у = (x t y) mod 


In this example we will use addition modulo 6. Addition modulo 6 applies to the 
integers (0, 1,2,3,4, 5) using the rule x +, y = (x + y) mod 6. The addition ta- 
ble follows. It shows, for example, 


3 +64 = (3 + 4) mod 6 = 7mod6 = 1 


‘This section on modular arithmetic designs was adapted with permission from http://britton.disted.camosun. 
bc.ca/modart/jbmodart2.htm; the software used to generate the design is Cayley Quilter, available to down- 
load at http://www.wou.edu/-burtonl/cquilter.html 
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No values of 6 appear in the table, and each row is a circular left shift of the previ- 
ous row. 


213 14 

22- | S Ida 

2 || 3 | 4]5 
2 |219 | ЛЕ a | O] t 
эол 2 
4|4|\5|0|1|2([3 
Sia ү» |“ 


We will need six Ше images of various geometric designs: 


re P" Lay Ve 


Tile 0 Tile 1 Tile? Tile 3 Tile 4 Tile 5 


The tiles are then arranged in a 6 X 6 table according to the addition modulo 6 
pattern, that is, the interior of the addition modulo 6 table. The top row shows im- 
ages left to right of 0, 1, 2, 3, 4, 5, the next row shows images left to right of 1, 2, 
3, 4, 5, 0, and so forth. 


The final image is composed of four copies of the table arranged in a 2 X 2 grid 
where the original table goes in the upper-left quadrant, its horizontal reflection 
goes in the upper-right quadrant, and the bottom two quadrants are vertical reflec- 
tions of the top half. 
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SECTION 5.6 REVIEW 


TECHNIQUES (2) Perform encryption and decryption using the RSA 


* Perform calculations using the mod function. ВО Ои 


К А i i e Compute the check digit for the ISBN-10, 
Build or search a hash table that uses linear probing ISBN-13, UPC-A, and ABA codex: 
for collision resolution. 


D Build or search a hash table that uses chaining for MAIN IDEAS 


collision resolution. — А 
* Encode and decode using a Caesar cipher. * The humble modulo function is useful in many im- 


* Apply the algorithm to compute a circular left shift portant applications. 


of a binary string. 


EXERCISES 5.6 
Exercises 1—8 concern modular arithmetic mod и. 


1. 25 = 11 + 14. Show by computing each expression that 
25 mod 6 = (11 mod 6 + 14 mod 6) mod 6 
2. 395 = 129 + 266. Show by computing each expression that 
395 mod 4 = (129 mod 4 + 266 mod 4) mod 4 
3. 262 = 74 + 188. Show by computing each expression that 


262 mod 13 = (74 mod 13 + 188 mod 13) mod 13 


10. 


11. 


12. 


13. 
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. Prove that for any integers x and y, 


(x + y) mod n = (x mod n + y mod n) mod n 


. 486 = 18 - 27. Show by computing each expression that 


486 mod 5 = (18 mod 5 - 27 mod 5) mod 5 
7067 = 191 - 37. Show by computing each expression that 


7067 mod 8 = (191 mod 8 - 37 mod 8) mod 8 


. Prove that for any integers x and y, 


(x - y) mod n = (x mod n · y mod n) mod n 


. Prove or disprove: For any positive integer x, (—x) mod n = —(x mod n). 


. Using the hash function of Example 51, which of the following Social Security numbers would cause a 


collision with 328356770, the Social Security number of that example? 
a. 060357896 
b. 896137243 
c. 712478993 
d. 659027781 


Find a set of five numbers in the range [0, 200] that cause 100% collision using the hash function 
h(x) = x mod 13 


Using a hash table of size 11 and the hash function x mod 11, show the results of hashing the following 
values into a hash table using linear probing for collision resolution: 


1, 13, 12, 34, 38, 33, 27, 22 


Using the completed hash table from Exercise 11, compute the average number of comparisons needed to 
perform a successful search for a value in the table. 


When a computer program is compiled, the compiler builds a symbol table for storing information about 
the identifiers used in the program. A scheme is needed to quickly decide whether a given identifier has 
already been stored in the table and, if not, to store the new identifier. A hash function is often used to 
locate a position in the table at which to store information about an item. 

For simplicity, assume that the items to be stored are integers, that the hash table can hold 17 items in 
positions 0—16, and that the hash function h(x) is given by h(x) = x mod 17. 

Linear probing is used for collision resolution. 


a. Using the hash function and collision resolution scheme described, store the sequence of values 23, 14, 
52, 40, 24, 18, 33, 58, 50. Give the location in the table at which each is stored. 


b. After the table of part (a) has been filled, describe the process to search for 58 in the table. Describe the 
process to search (unsuccessfully) for 41 in the table. 


14. 
15; 


16. 


17. 


18. 


19, 


20. 


2 


— 


22. 


23. 
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Explain what problem can arise if an item stored in a hash table is later deleted. 


A disadvantage of hashing with linear probing for collision resolution is that elements begin to cluster 
together in groups of adjacent array locations. Assume that you have a very good hashing function (it dis- 
tributes elements evenly throughout the hash table). Start with an empty hash table of size ¢ that will store 
data using linear probing for collision resolution. 


a. What is the probability of hashing the first element to location p (and storing it there, since it is the first 
item and there will be no collisions)? 


b. Once location p is occupied, what is the probability of storing the second item in location p + 1 
(modulo the table size)? 


с. Once locations р and р + 1 are occupied, what is the probability of storing the third item in location 
p + 2 (modulo the table size)? 


Generalize the answers to Exercise 15 to explain why using linear probing for collision resolution causes 
clustering. 


Decode the following ciphertext messages that were encoded using a Caesar cipher with the given key. 
a. JUUBFNUUCQJCNWMBFNUU, k = 9 

b. XAEWFVMPPMKERHXLIWPMXLCXSZIWHMHKCVIERHKMQFPIMRXLIAEFI, k = 4 

c. IURSAYZGXJCOZNZNKQTOLKOTZNKROHXGXE, k = 6 

Using a Caesar cipher, encode the following plaintext messages. 

a. ATTACK FROM BEYOND THE RIVER, k = 5 

b. WE ARE SHORT OF SUPPLIES, k = 10 

c. BADLY NEED AMMUNITION, & = 8 


The following ciphertext message is intercepted; you suspect it is a Caesar cipher. Find a value of k that 
decodes the message, and give the corresponding plaintext. 


BUNNY WXFFNVJALQXWAXVNCXVXAAXF 


The following ciphertext message is intercepted; you suspect it is a Caesar cipher. Find a value of k that 
decodes the message, and give the corresponding plaintext. 


EQOKAGAZRMOQNAAWPGPQ 


. Use the algorithm of Example 53 to compute the 1-bit circular left shift of the following binary strings 


(write the bit strings for x, p, q, and so on). 
a. 10011 
b. 0011 


Use the algorithm of Example 53 to compute the 1-bit circular left shift of the following binary strings 
(write the bit strings for x, p, q, etc.). 


a. 10110 
b. 1110 
Consider a "short form" of DES that uses 16-bit keys. Given the 16-bit key 


1101000101110101 


as input to a DES round that uses a circular left shift of 2 bits, what would be the key for the next round? 


24. 


25; 


26. 
23. 


28. 


29. 


30. 


3 


- 


32. 


39. 
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Describe how to perform a 1-bit right circular shift on a 4-bit binary string x using the left circular shift 
operation. 

Using RSA encryption/decryption, let p = 5 and q = 3. Then n = 15 and q(n) = 4 · 2 = 8. Picke = 3. 
a. Use the Euclidean algorithm to find the value of d. 

b. Encode Т = 8 using the public key (и, е). 

c. Decode your answer to part (b) to retrieve the 8. 

Why is the RSA encryption of Exercise 25 a poor choice? 

Using RSA encryption/decryption, let p = 5 and q = 11. Then л = 55 and g(n) = 4 · 10 = 40. 
Pick e = 7. 

a. Use the Euclidean algorithm to find the value of d. (Hint: If the Euclidean algorithm produces an equa- 


tion 1 = x - e + f: e(n)where the value of x is negative, add and subtract the product e y(n) to the 
right side of the equation to get a positive value for d.) 


b. Encode Т = 12 using the public key (л, e). 

c. Decode your answer to part (b) to retrieve the 12. 

Using RSA encryption/decryption, let p = 23 and q = 31. Then n = 713 and y(n) = 22 · 30 = 660. 

Pick e = 17. 

a. Use the Euclidean algorithm to find the value of d. 

b. Encode T = 52 using the public key (7, e). 

c. Decode your answer to part (b) to retrieve the 52. 

a. All п people in a group wish to communicate with each other using messages encrypted with DES or 
AES. A different secret key must be shared between each pair of users. How many keys are required? 

b. АП n people in a group wish to communicate with each other using messages encrypted with a public 
key encryption system. How many keys are required? 


Computer users are notoriously lax about choosing passwords; left to their own devices, they tend to pick 
short or really obvious passwords. At Simpleton University, passwords must contain only lowercase let- 
ters, and the campus computer system uses an (unrealistically simple) cryptographic hash function given 
by the following algorithm: 


1. Letters in the password are converted to an integer equivalent (a — 1, b — 2, and so on). 


2. In the result of step 2, all individual digits are added (for example, 17 becomes 1 + 7) to give an 
integer value x. 


3. h(x) = x mod 2? 
Joe Hack has managed to steal the password table, and he notices that there is a password entry of 20 for 


bsmith. Joe decides to try to hack into the system as bsmith by guessing bsmith s password. Can you guess 
bsmith 5 password? 


‚а. The ISBN-10 of the sixth edition of this book is 0-7167-6864-C where C is the check digit. What is the 


check digit? 
b. What is the ISBN-13 of the sixth edition? 


A bookstore placed an order for 2000 copies of Harry Potter and the Deathly Hallows, the seventh and 
final volume in the hugely popular Harry Potter series by J. K. Rowling. When placing the order with the 
publisher, the ISBN-13 978-0-545-01022-5 was used. Is this correct? 


Given the 11 digits 02724911637, compute the check digit for the UPC-A code. 
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34. A taxpayer wants his tax refund to be deposited directly to his bank. He enters the bank’s routing number, 
including the check digit, as 


025107036 


Is this routing number correct? 
35.a. Write an algorithm to decompose a four-digit integer into the ones, tens, hundreds, and thousands 
digits. 
b. Apply this algorithm to decompose the integer 7426. 


36. The following quilt image is based on addition modulo n for what value of n? 


OOOX 


Exercises 37—41 involve a proof of step 7 of the RSA method. 


37. Prove that if x = y (mod n) and c is a constant integer, then xc = yc (mod n). 


38. This exercise explores the converse of Exercise 37, which is the issue of cancellation under congru- 
ence modulo n. In other words, if xc = yc (mod n) for some constant integer c, is it then true that 
x = y (mod n)? Not always, as it turns out. 


a. Prove that if gcd(c,n) = 1, then xc = yc (mod n) implies x = y (mod n). 


b. Prove that хс = yc (mod n) implies x = y (mod n) only if ged (c,n) = 1. To do this, it is easier to 
prove the contrapositive: 


If gcd (c, n) = 1, then there exist integer values x and y for which xc = yc (mod n) but x # y mod n. 


(Hint: Suppose c = mk and n = mk with k > 1. Consider x = mk and y = т».) 
c. Find values for x, у, c, and n where xc = yc (mod n) but x # y (mod n). 
39. If p is a prime number and a is a positive integer not divisible by p, then 


а?! = 1(mod p) 


40. 


4 


— 
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This result is known as Fermat5 little theorem (as opposed to the very famous Fermat’s last theorem men- 
tioned in Section 2.4). 
Let S = {0, а, 2a, ...,(p — Dal T = (0, 1,2, ..., (p — 1)}. Let f be given by f(ka) = (ka) mod p; 
that is, f computes the residue modulo p. 
. Prove that fis a one-to-one function from S to T. 
. Prove that f'is an onto function. 
. Prove that [а · 2a : (р — 1)a] mod p = (p — 1)! mod p 
. Prove that a? ^! = 1 (mod p) 
. Let a = 4 and p = 7. Compute the set of residues modulo p of (4, 8, 12, ..., 24}. 


c о с gp 


m 0 


Let a = 4 and p = 7. Show by direct computation that 4° = 1(mod 7). 


Let m, m, ...,M, be pairwise relatively prime positive integers (that is, gcd(m,,m,) = 1 for 
|!-iik-zni-zk),letm = mm;:::m,, and let a), а, ..., a, be any integers. Then there is an integer 
x such that 


a, (mod m,) 


а, (mod m;) 


à, (mod m,) 


x 


and any other integer y that satisfies these relations is congruent to x modulo m. This result is known as 
the Chinese remainder theorem (based on work done by the Chinese mathematician Sun-Tsu in the first 
century AD). The following steps will prove the Chinese remainder theorem. 


а. Let s and t be positive integers with gcd(s, f) = 1. Prove that there exists an integer w such that 
sw = 1 (mod ?). 


b. For each i,1 = i < n, let M; = m/m;. Prove that gcd(M;, m) = 1. 

c. Prove that there is an integer x; such that Mx; = 1 (mod m,) and a;My; = а, (той т. 
d. Prove that a,M,x, = 0 (mod m,) for all К = i. 

. Let x = a,Myx, + М + +++ + a, M,x,. Prove that for 1 <i S n, x = а, (тоа т). 


Let y be such that y = а, (той m), 1 = i = n. Prove that x = y (mod m). (Hint: Use the fundamental 
theorem of arithmetic and write m as a product of distinct primes, m = php ... p^). 


m 0 


. The remaining step in the proof of the RSA algorithm is to show that if d: e = 1 mod g(n), then 


Т тоа п = Т. 
a. Prove that T^" can be written as 7(T’~'/4— or as T(T4- J^ for some integer k. 


b. Prove that if T is not divisible by p, then T“ = T (mod p), and if T is not divisible by q, then 
Т“ = T (mod д). 


c. Prove that if p| T, then Т“ = T (mod p) and T is not divisible by q, so T°’ = T (mod q) by part (b). 
d. Prove that if 4| T, then T” = T (mod q) and T is not divisible by p, so Т“ = T (mod p) by part (b). 
e. From parts (b)-(d) T*^ = T (mod p) and Т“ = T (mod q) in all cases. Prove that T” mod n = T. 


| 446 Relations, Functions, and Matrices 


SECTION 5.7 | MATRICES 


Terminology 


Data about many kinds of problems can often be represented using a rectangular 
arrangement of values; such an arrangement is called a matrix. Thus 


1 0 4 
з —6 8 
is a matrix with two rows and three columns. The dimensions of the matrix are 
the number of rows and columns; here A is a 2 X 3 matrix. 
Elements of a matrix А are denoted by ау, where i is the row number of the el- 


ement in the matrix and is the column number. In the example matrix А, a»; = 8 
because 8 is the element in row 2, column 3, of A. 


Average temperatures in three different cities for each month can be neatly sum- 
marized in a 3 X 12 matrix. Here we interpret the 3 rows as the 3 cities and the 
12 columns as the 12 months January-December. The average temperature in the 
third city in April, a34, is 67. 


259 2 39 dy So U Ж A G9 5 39 3$ 
А = |14 21 33 38 44 57 61 59 49 38 25 21 
35 46 54 67 78 86 91 94 89 75 62 51 e 


In Practice 2(c) of Section 5.1, the binary relation {(7, 9), (2, 5), (9, 9), (2, 7)} was 
defined on the set S = (2, 5, 7, 9}. Although a set is unordered, we can impose 
an ordering on the elements of S so that 2 is element 1 in S, 5 is element 2, and so 
forth. The matrix R below represents this binary relation by a 1 entry in position 
i, j if element i in set S is related to element j. Because (7, 9) says that element 3 is 
related to element 4, element ғ; 4 of R equals 1. 


rs 
Il 
D © em) © 
SS (m к 
S em) OG iS 
_ =. oO о 


Solutions to many problems can be obtained by solving systems of linear equa- 
tions. Suppose, for example, that you are placing an order for coffee beans for your 
sidewalk café. You want to order 70 pounds of beans, a mixture of Kona coffee 
beans and Colombian coffee beans. You are willing to spend $1180; Kona cof- 
fee costs $24 per pound, and Colombian coffee costs $14 per pound. How many 
pounds of each should you order? 
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The constraints in this problem are represented by the system of linear 
equations 


x+ y= 70 
24x + 14y = 1180 


These are linear equations because each of the two unknowns, x and y, appears 
only to the first power. (Also an equation such as ax + by = c represents a straight 
line when graphed on an х—у coordinate system.) The solution to this system of 
equations is x = 20, у = 50 (you can easily check that this is a solution). The 


matrix 
а u 
А == 
24 14 
is the matrix of coefficients for this system of linear equations. As we will see, the 
matrix of coefficients can be used to solve a system of linear equations. ө 
PRACTICE 50 | In the matrix: 
a 1 4 —6 8 
SO =y 
What is a3? What is 424? What is a3? a 


In a matrix, the arrangement of the entries is significant. Therefore, for two 
matrices to be equal, they must have the same dimensions and the same entries 
in each location. 


| EXAMPLE 64 | 64 Let 


x 4 
Х=|1 у 
g (qq 
3 4 
Y-|1 6 
2 W 
If X = Y, then x = 3,y = 6,z = 2, and w = 0. e 


We will often be interested in square matrices, in which the number of rows 
equals the number of columns. If A is an n X n square matrix, then the elements 
411, 025, ... » Any form the main diagonal of the matrix. If the corresponding ele- 
ments match when we think of folding the matrix along the main diagonal, then 
the matrix is symmetric about the main diagonal. In a symmetric matrix, а; = аң. 
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| EXAMPLE 65 | The square 3 X 3 matrix 


э m 
@ 2 
2 6 


> 
ll 
- Nn =l 


is symmetric. The upper triangular part (the portion above the main diagonal) is a 
reflection of the lower triangular part. Note that a,, = a = 5. e 


A more general way to represent arrangements of data is the array. Arrays are 
n-dimensional arrangements of data, where n can be any positive integer. If n = 1, 
then the data are arranged in a single line, which is therefore a list or finite se- 
quence of data items. This one-dimensional version of an array is called a vector. 
If n = 2, the array is a matrix. If n = 3, we can picture layers of two-dimensional 
matrices. For n > 3, we can formally deal with the array elements, but we can't 
really visualize the arrangement. The array data structure is available in many 
high-level programming languages because it is such a useful way to represent 
data in list form or tabular form. Generally, the number of elements expected in 
each dimension of the array must be declared in the program. The array X of Ex- 
ample 64, for instance, would be declared as a 3 X 2 array—a two-dimensional 
array (matrix) with three elements in one dimension and two in the other (that 15, 
three rows and two columns). 


Matrix Operations 


Although matrices are particular arrangements of individual elements, we can 
treat the matrices themselves as objects, just as we can treat sets of elements as 
objects. In each case we are abstracting up one level and looking at the collection 
as an entity, rather than looking at the individual elements that make up the col- 
lection. We defined operations on sets (union, intersection, and so forth) that made 
sets useful for solving counting problems. We can define arithmetic operations on 
matrices whose entries are numerical. These operations make matrices interesting 
objects to study in their own right, but they also make matrices more useful for 
certain tasks such as solving systems of equations. 

The first operation, called scalar multiplication, calls for multiplying each 
entry of a matrix by a fixed single number called a scalar. The result is a matrix 
with the same dimensions as the original matrix. 


| EXAMPLE 66 | GA The result of multiplying matrix 


by the scalar r — 3 is 
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Addition of two matrices A and B is defined only when A and B have the 
same dimensions; then it is simply a matter of adding the corresponding elements. 
Formally, if A and B are both n X m matrices, then C= A + B is an n X m 
matrix with entries 


Cy = ay + by 
For 
il 3 © ф =2 8 
A= 2 M A В = |1 5 2 
—4 5 il 2 ONES 
the matrix A + B is 
1 14 
А +В = 3 5 6 
— & 4 ө 
PRACTICE 51 | For r = 2, 
IX 4 0 
A=|-3 4 B= 9 2 
oy == 4 
find гА + В. Г 


Subtraction of matrices is defined by A — В = A + (—1)В. 

In a zero matrix, all entries are 0. If we add ann X m zero matrix, denoted 
by 0, to any n X m matrix A, the result is matrix A. We can symbolize this by the 
matrix equation 


0+A=A 
This equation is true because of a similar equation that holds for all the individual 


numerical entries, 0 + a; = a;. Other matrix equations are also true because of 
similar equations that hold for the individual entries. 
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| EXAMPLE 68 | БЕШ If A and B are n X m matrices and r and s are scalars, the following matrix 


equations are true: 


0+А=А 
А+В=В+А 

(A+ B) + C - A + (B + C) 
r(A + B) = rA + rB 

(r + s)A =rA + sA 
r(sA) = (rs)A 


To prove that А + B = B + A, for instance, it is sufficient to note that 
а + bj = by + aj for each entry in matrices A and B. e 


One might expect that multiplication of matrices would simply involve mul- 
tiplying corresponding elements in the two matrices, but the definition is more 
complicated than that. The definition of matrix multiplication is based on the use 
of matrices in mathematics to represent functions called linear transformations, 
which map points in the real-number plane to points in the real-number plane. 
Although we won't use matrices in this way, we will use the standard definition 
for matrix multiplication. 

To compute A times B, А · B, the number of columns in A must equal the 
number of rows in B. (This requirement means that the number of elements in a 
single row of A equals the number of elements in a single column of B.) Thus we 
can compute А · B if A is an n X m matrix and B is an m X p matrix. The result 
isan n X p matrix. An entry in row i, column j of A : В is obtained by multiplying 
all the elements in row i of A by the corresponding elements in column j of B and 
adding the results. Formally, А. B = C, where 


m 


Cy = >, agby 
k=1 
EXAMPLE 69 IE; 
S 3 
А = 2 к, В= |2 2 
м =i 2 К 
@ 5 


А 15а2 Х 3 matrix and B 15а3 X 2 matrix, so ће product А · В exists and is a 
2 X 2 matrix C. To find element су, we multiply corresponding elements of row 1 
of A and column 1 of B and add the results. 


2(5) + 4(2) + 3(6) = 10 + 8 + 18 = 36 
3 

E 4 J 

A =i 2 


3 = 

PE 
6| 5 

Element с» is obtained by multiplying corresponding elements of row 1 of A and 

column 2 of B and adding the results. 


Nn 


N 
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E j J à T j 
а к= m = = 
6 15 
The complete product is 
[ 4 JE а a 
A cub» 30 20 
6 e 


PRACTICE 52 | Compute А. B and B- A for 


From Practice 52 we see that even if A and B have dimensions so that both 
А. B and B - A are defined, А · B need not equal В. A. There are, however, sev- 
eral matrix equations involving multiplication that are true. 


Where A, B, and C are matrices of appropriate dimensions and r and s are scalars, 
the following matrix equations are true: 


A-(B:C) = (A B): C 
АВ + С)=А-В+ А.С 
(A+ ВС=А-С+В.С 
rA : sB = (75)(А · B) 


Verifying these equations for matrices of particular dimensions is simple, if 
tedious. ө 


The n X n matrix with 15 along the main diagonal and 05 elsewhere is called the 
identity matrix, denoted by I. If we multiply I times any n X n matrix A, we get 
A as the result. The equation 

ГА = А: IISA 


holds. 
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PRACTICE 53 | Let 


Verify that I-A = A-T = A. 


Ann X n matrix A is invertible if there exists an n X n matrix B such that 
A-B=B:-A=I 


In this case B is called the inverse of A, denoted by A’. 


Let 
=| 2. 3 =5 4 = 3} 
A= 2 [| 0 В= | 10 -7 6 
4 —2 5 8 —6 S 


Then, following the rules of matrix multiplication, it can be shown (Practice 54) 
that A-B = B-A = 1 50В = A`'. e 


PRACTICE 54 | For the matrices А and B of Example 70, 


a. Compute A : B. 
b. Compute В. A. н 


It is easy to write an algorithm for matrix multiplication by simply follow- 
ing the definition. A pseudocode version of the algorithm follows, where bracket 
notation A[i, j] replaces the subscript notation aj. 


ALGORITHM MATRIXMULTIPLICATION 


//computes n X p matrix А.В for n X m matrix A, m X p matrix B 
//stores result in C 
for i = 1 to do 
for j — 1 top do 
(pner t 
for k = 1 to m do 
Cyl = Ci] ROSE BUE] 
end for 
end for 
end for 
write out product matrix C 


| Section 5.7 Matrices 453 


The computational steps done in this algorithm are multiplications and ad- 
ditions, one multiplication and one addition each time the statement Cli, j] = 
Cli, j] + Ali, К] * B[k, j] is executed. This statement occurs within a triply nested 
loop and will be executed npm times. (Although this is quite obvious, it can also 
be justified by the multiplication principle as the number of possible outcomes of 
choosing indices i, j, and k.) If A and B are both n X n matrices, then there are 
O(m) multiplications and O(n") additions required. The total amount of work is 
therefore O(m) + Ө(т) = O(n’). 

Given the definition of matrix multiplication, it is hard to see how one could 
avoid O(n’) steps in computing the product of two n X n matrices, but a suffi- 
ciently clever approach does yield an improvement under certain conditions (see 
Exercise 54). 


Gaussian Elimination 


In Example 63 we encountered the following system of two linear equations in 
two unknowns: 


xt y= 70 
24x + 14у = 1180 


The general form for a system of n linear equations in n unknowns is 


OX, 15X? chen dua = Dy 


OX 5»X? +++ ida, = By 


йһХ\ T An2X2 pess AnnXn = b 


n 


with a matrix of coefficients 


а dj) -. Ain 
а 05) ... An 
йа Am +++ Ann 


To solve this system of equations, we first form the augmented л X (n + 1) 
matrix by adding the column of b’s to the matrix of coefficients: 


dj dij .. аһ dy 
аз dy .. d» b, 
ап Ang +++ Ann b, 


(The augmented matrix is simply a convenience to avoid having to write all the 
unknowns.) The next step is to “transform” the augmented matrix into one where 
the matrix-of-coefficients part is an upper triangular matrix, that is, all values 
of this n X n matrix below the main diagonal are 0’s. The result will be a matrix 
of the form 
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Су C15 m Cin di 
0 C25 vig Con d, 
0 0 .. Ca-Da-) Can Ф-1 
0 0 n 0 Cyn d, 


Now we can turn this back into a system of equations of the form 


CX СХ... CiXn а, 
0 Соух)... СХ, а, 
0 0 + Сп= 00и = Хат C(n- Dr*n d, i 
0 0 n 0 Cex: d, 


and solve the equations from the bottom up. We solve 
СппХп = d, 


for x,. Knowing the value of x,, we can then solve the next-to-last equation 


C(n — (n = Хп 1 + Ср = dai 


for х„- and so forth, back up to the top row. 

But how do we do the transformation? The allowable operations to carry out 
this transformation are called elementary row operations, none of which change 
the solution set of the underlying equations. These operations (performed on the 
augmented matrix) are 


1. Switch any two rows of the matrix. 
ii. Multiply all the elements in any one row of the matrix by a non-zero 
scalar. 
iii. Add a scalar multiple of any one row to another row. 


This process for solving systems of linear equations is known as Gaussian 
elimination, named for the famous German mathematician Karl Friedrich Gauss. 
Gauss did not actually invent this process, however; it was demonstrated in a Chi- 
nese mathematics treatise of the second century Ap, and it was probably known in 
China even earlier. 


| EXANPLE?2 | Solving the system of equations 


x+ pS 70 
24x + 14y = 1180 


using Gaussian elimination, we first form the augmented matrix 


Ё 1 | 
24 14 1180 
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We multiply row 1 by the scalar —24 and add the result to row 2 (the third elemen- 
tary row operation), giving 


1 1 70 
0 —10 —500 


The last row represents the equation 

—10у = —500 
from which y = 50. The first row represents the equation 

x+y=70 

and, because y = 50, this is 

x + 50 = 70 
so x = 20. The solution, as noted in Example 63, is x = 20, y = 50. ө 

Ey Lens apply Gaussian elimination to the system of 3 equations in 3 unknowns — 

shown here: 


She ec SM ap eZ = = 
7x + 9y — 3z = 14 
6x + 7y + 22 = 91 


The augmented matrix is 


A series of elementary row operations, as shown here, will convert the 3 X 3 
matrix of coefficients to upper triangular form. First, multiply row 1 by 1/2 to 
produce a 1 in the 1,1 position. 


223 11022 
1/2| 7 Oe —3 14 
6 7 2 9 


Then multiply row 1 by —7 and add it to row 2; also multiply row 1 by —6 and 
add it to row 3. 


d ЕИ lo =3/2 1 =li 
e 7 © =3 M giving P =B Qil 
m 6 7 2 gil 16 =1 157 
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Now multiply row 2 by 2/39 to produce a 1 in the 2,2 position. 


113 12 ЕШ 
2/39 DA =B о 
16 =i 157 


Multiply row 2 by —16 and add it to row 3. 


1 =3 i =M 


E 1 —1/3 14/3 
16 = 157 


resulting in 


1m tl 
ї ae ake 
13/3 247/3 


We are almost done. The bottom row represents the equation 
(13/3)z = 247/3 
from which z = 19. The second row represents the equation 
y — (1/3): = 14/3 or y — (1/3)(19) = 14/3 
from which y — 11. Finally, from the top row, 
x-—(3/2)y--(0/2).8-—-—1I ог x= G21) + (1/219 = = 


and x = —4. So the solution to this system of equations is x = —4,y = 11, 
z= 19. e 


PRACTICE 55 | Solve the following system of equations using Gaussian elimination. 


3x$—315p = © 
7х+ qu 3H и 


Not every system of n linear equations in n unknowns has a solution. As a 
simple case, consider 


2x + 4y = 10 
4x + 8y = 12 
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If we assume that there is a solution and proceed with Gaussian elimination, we 
perform the following elementary row operations: 


12[2 4 10 Е n | 
48 12 iia 4 8 1 


Then 


-4( |, 2 5 itsi | 2 1 
48102 results in 0 = 


in which row 2 says that 0 — —8. This contradiction says that our assumption 
about the existence of a solution is incorrect. Rewriting the two equations as 


y = (—1/2)ух + 5/2 
y = (—1/2)ух + 3/2 


shows that these are two parallel lines that never intersect, so there is no (x, у) pair 
that satisfies both equations. 

More generally, systems of linear equations can involve m equations in n un- 
knowns where n and m are not necessarily equal. Usually, if there are more equa- 
tions than unknowns, the system is overconstrained and there will be no solution. 
And usually, if there are more unknowns than equations, there will be an infinite 
number of solutions. 


|. EXAMPLETA | Consider the system of equations 


sb css E 9) 
x + 2y — 4z = 10 


In the augmented matrix, switch rows 1 and 2, then multiply row 1 by —3 and add 
it to row 2: 


=C]; 2 =4 H me i 2 -4 10 
3 =1 s 9) P= =7т 3 = 


The last row represents the equation —7у + 72 = —21 огу = 2 + 3. Substituting 
into equation 1, we get 


х+ 2(2 + 3) – 42 = 10 or x=2z+4 


Both х and у have values in terms of a parameter 2, which can have any value. The 
number of solutions is therefore infinite. ө 


Despite the simple examples we have seen, neither the solutions to a system of 
linear equations nor the coefficients are always integers. 
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Boolean Matrices 


In Chapter 6 we will be interested in matrices with only 05 and 15 as entries, 
called Boolean matrices (after George Boole, a nineteenth-century English math- 
ematician; Boole also lent his name to Boolean algebra, which we will consider 
later in this book). Matrix R of Example 62 is a Boolean matrix. We can define 
an operation of Boolean matrix multiplication A Х В on Boolean matrices using 
Boolean multiplication and Boolean addition instead of regular multiplication and 
addition. These are defined as follows: 


Boolean multiplication: x ^ y — min(x, y) 
Boolean addition: x V y = max(x, у) 


PRACTICE 56 | Fill-in the following operation tables for Boolean multiplication and Boolean addition. 


x y X^y x y х\/ у 
1 1 1 1 
1 0 1 0 
0 1 0 Й 
0 0 0 0 
и 


Now take the tables from Practice 56 and substitute 7 for 1 апа F for 0. They 
become the truth tables for conjunction and disjunction, respectively; for this rea- 
son, these operations are often called Boolean and (or logical and) and Boolean 
or (or logical or), which also explains the notation used for these operations. The 
operation of Boolean matrix multiplication A X B (on Boolean matrices of ap- 
propriate dimensions) is then defined by 

m 
Cij = V (ав Л by) 
k=1 

We can also define two analogues of ordinary matrix addition (on Boolean 
matrices of the same dimensions): A /\ B, where corresponding elements are 
combined using Boolean multiplication, and A V B, where corresponding ele- 
ments are combined using Boolean addition. 


| EXAMPLETS | Let A and B be Boolean matrices, 


1 
А = |0 
0 


S к= 
= = © 
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Then 


PRACTICE 57 | In Example 75, does A X B = А.В? Е. 


PRACTICE 58 | In Example 75, compute B X A. E 


SPECIAL INTEREST PAGE 


ө Chapter 


Solve Millions of Equations, Faster than Gauss 


Systems of linear equations are used in many areas of 
application, including telecommunications, materials 
analysis, transportation, economics, and medical im- 
aging. Instead of systems of 3 or 4 linear equations 
with 3 or 4 unknowns, such as we saw in our Example 
problems, the value of л in such applications can go 
into the millions or billions. In Exercise 55 of Section 
5.7, we see that a worst-case analysis of Gaussian elim- 
ination results in @(n*) computations (multiplications 
and additions). Such computations on real numbers are 
called floating point operations. The overall implica- 
tion of an algorithm of @(n’) is that increasing the size 
of n by a factor of 10 increases the work by a factor of 
10? — 1000. 

Ifnhasavalue of | billion, п = 10°, ћепи? = 107". 
Suppose we find a way to parallelize the Gaussian 
elimination algorithm and run it on a fast parallel- 
processing supercomputer. The Chinese Tianhe-2, 
the world’s fastest supercomputer as of June 2013 can 
crank out about 33.86 petaflops (33.86 х 10'° floating- 
point operations per second). The worst-case solution 
would be bounded above by 


10°’ operations 
(33.86) * 10P operations/second 


— over 936 years! 


Now @(n*) was an upper bound, so maybe the closer 
value is something like (2/3)@(n*), but at any rate it’s 
clear that these are time-consuming problems, and 
faster solution techniques are of interest. 

Gaussian elimination produces exact solutions to 
systems of linear equations. Other solution methods, 
called iterative solvers, produce a series of approxi- 
mate solutions that approach the exact solution. This 
method sounds as if it would take even longer, but most 
iterative solvers start out with a sparse matrix (a matrix 
with many 0 entries) for which computations will be 


faster but is nonetheless a good representative of the 
information in the original augmented matrix. Decid- 
ing which values can be zeroed out while modifying 
the values of the remaining coefficients so as to pro- 
vide a good "preconditioner" for the eventual solution 
is a difficult problem in itself. 

A team of researchers at Carnegie Mellon Univer- 
sity announced in 2010 that they had found a new al- 
gorithm for creating a good preconditioner for a large 
system of linear equations of a certain type, called 
SDD (symmetric and diagonally dominant) systems. 
In a diagonally dominant matrix, the absolute value 
of the diagonal element (aj) of each row of a matrix 
is bigger than the sum of the absolute values of all 
other elements in that row. SDD systems turn out to 
have many important applications, such as maximiz- 
ing flow through a network (a computer network, a 
water pipeline system, a transportation system) and 
recommendation systems such as Netflix that suggest 
movies you might like based on your past preferences 
and other user data. The new algorithm, based on so- 
phisticated mathematical techniques, is approximately 
@(n(log n)?)), nearly linear, promising much faster so- 
lutions to these massive problems. 


“A Breakthrough in Algorithm Design," Kroeker, K., Сот- 
munications of the ACM, September 2011. 

"A Fast Solver for a Class of Linear Systems," Koutis, I., 
Miller, G. L., and Peng, R., Communications of the 
ACM, October 2012. 

"Approaching Optimality for Solving SDD Linear Sys- 
tems," Koutis, I., Miller, G. L., and Peng, R., Proceed- 
ings of the 2010 IEEE 51st Annual Symposium on 
Foundations of Computer Science, Las Vegas, NV, 
October 23—26, 2010. 

Linear Equation Breakthrough, Dr. Dobbs, http://drdobbs. 
com/architecture-and-design/227900457 
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SECTION 5.7 REVIEW 


TECHNIQUES MAIN IDEAS 
W) Add, subtract, multiply, and perform scalar multi- • Matrices are rectangular arrangements of data that 
plication on matrices. are used to represent information in tabular form. 
* Solve systems of linear equations using Gaussian * Matrices have their own arithmetic, with opera- 
elimination. tions of addition, subtraction, multiplication, and 
(W) Perform Boolean and, or, and matrix multiplication scalar multiplication. 
on Boolean matrices. * Systems of linear equations can be solved by per- 


forming elementary row operations on an augment- 
ed matrix (Gaussian elimination). 

* Boolean matrices can be manipulated using Boolean 
operations of and, or, and Boolean multiplication. 


EXERCISES 5.7 


1. For the matrix: 


What is a2? What is a3,? 
2. Find x and y if 


3. Find x, y, z, and w if 


Z—w zt+2w 


pool ee A 


4. If Ais asymmetric matrix, find и, у, and w: 


2 w и 

А = |7 0 у 

1—3 4 

For Exercises 5—7, assume the following: 
r=3,s = —2, 
2 1 4 1 2 4 —6 
2 4 

А={|—1 0 В=|6 —1 5 ер x ”р = |1 3 
3 4 1 3 2 2 —1 


5. Compute (if possible) 
a. A+D b. A-D c. rB d. 5С e. A t rD 


10. 


12. 


13. 
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. Compute (if possible) 
a. В – “С с. r(sC) e. Р.С 
А +D а В.А + р 
. Compute (if possible) 
a. A-C c. B. D 
b. C-A а. C =C:C 
. For 
2 0 
А = |1 3 =] 
3 -2 1 
сотрше А? = А-А-А. 
Еог 
NE 3 =l 
[2 5] 
Pi i 
B = 
i2 —1] 
fe a 
С = 
|2 —2] 


compute (if possible) 

a. A-BandB-A 

b. A- (B: CO) and (А · B): C 

c. A- (B + C)andA-B- A: C 
а. (А + В): CandA:-.C + B: C 
If 


find x and y if A- B = B-A. 


. Prove that matrix multiplication is associative; that is, prove that if A is an n X p matrix, Bisa p Xr 


matrix and C is an r X m matrix, then A: (B: C) = (A: B): C. 

a. Prove that P. = I for any identity matrix I. 

b. Prove that I" = I for any identity matrix I and any positive integer л. 
Let A and B be n X n matrices. 

a. Prove that if A has one row consisting of all 0's, then so does А · B. 

b. Prove that if B has one column consisting of all 0’s, then so does А · B. 


| Section 5.7 Matrices 463 


14. An n X n matrix A is diagonal if all elements a, with i = j are 0. For example, A below is a 3 X 3 diago- 
nal matrix. 


a. Prove that if A and B are n X n diagonal matrices, then A + B is diagonal. 
b. Prove that if A is ann X n diagonal matrix and r is a scalar, then rA is diagonal. 
с Prove that if A and B are n X n diagonal matrices, then A · B is diagonal. 
15. The transpose of a matrix А, A’, is obtained by interchanging its rows and columns. Thus, if we denote 
the element іп row i, column j of A by A(i, j), then A'(i,j) = AQ, i). 
a. Find A’ for 


b. Prove that if A is a square matrix, then A is symmetric if and only if A’ = A. 
c. Prove that (A’)’ = A. 
d. Prove that (A + B)? = AT + В”. 
e. Prove that (A · B)! = B! - А”. 
16. Prove that A · A” is symmetric for any matrix A (see Exercise 15). 
17. Find two 2 X 2 matrices A and B such that A: B = 0 but A # 0 and B # 0. 
18. Find three 2 X 2 matrices A, B, and С such that A: C = B: C, С z 0, but A # B. 


19. If A and B are n X n matrices, is it always true that (A + B)? = A? + 2(А · В) + B’? Will it ever be 
true? 


20. The vector of real numbers U = [u, и] can be visualized on the real-number plane as an arrow from 
the origin to the point (u, u2). The length of the arrow, also called the magnitude of the vector, is given 


by ||U|| = Vui + uż. The dot product of two such vectors, U * V, is defined to be the real number 
U,V, + изу). Show that if Ө is the angle between U and V, 0 = 0 = т, then 


U*V 


со50 = = 
ПОПУ | 


(Hint: Use the law of cosines.) 
21. Prove that if a square matrix A is symmetric, then so is A’, where A? = А-А. 


22. Prove that if a square matrix A is symmetric, then so is A? for any integer n = 1. 
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23. Let 


For п = 1, let F(n) equal the nth value in the Fibonacci sequence (see Example 2 in Chapter 3); let 
F(0) = 0. Prove that for any л = 1, A" is given by 


p +1) F(n) | 
F(n) F(n — 1) 


24. a. Show that for 


ELE 


А`-В= В:А = І, ѕ0В = А!. 
b. Show that 


is not invertible. 
c. Show that 


is invertible with inverse 


1 а а 
B= | 22 3| 
411055 — 015051| — a, ayy 
if and only if аца — ард # 0. 
25. Prove that if A is invertible and r is a non-zero scalar, then rA is invertible with 


(rA) ! = (1/)A7!. 
26. Prove that if A is invertible and A- B. = A-C, then В = C. 


For Exercises 27—34, use Gaussian elimination to solve the systems of equations, if possible. 
27. x +5y= 1 
2х — 3y = 15 
28. x + 5y = 38.7 
4x — 2y = -1.4 


ЭЛ. de 


32. 


33. 


34. 


35. 


36. 


3d. 


38. 


39. 
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a 
| 
hs 
| 
ко 
N 
+ 
ve) 
= 
II 
со 


5х + 3у = z= у = 10 


x+2y- zt w= —3 
2x yt4z+2w= 33 
x— yt 32 – w= 6 
3x + 3y z+ 4w = -12 


x—2yct 3z- w= 7 
2х + 5у — 72 + 2w= 12 
4x — 3y + 122 w= 8 


Find an example of a system of 3 linear equations with 2 unknowns that has a solution. Explain what hap- 
pens when you use Gaussian elimination on this system. 


Find an example of a system of 4 linear equations with 3 unknowns that has a solution. Explain what hap- 
pens when you use Gaussian elimination on this system. 


You purchase an ancient Egyptian medallion at the State Fair from a vendor who swears it is pure gold. 
The medallion weighs 859.4 grams and its volume in cubic centimeters is 52. You suspect that the medal- 
lion is actually a mixture of copper and gold. You know that copper weighs 9 grams per cubic centimeter 
and that gold weighs 19.3 grams per cubic centimeter. Set up and solve a system of equations to find the 
percentage of copper by volume in the medallion. 


Cell phone Plan A charges a flat monthly fee of $30.00 for the first 400 minutes, plus $0.07 for each 
minute > 400. Plan B charges a flat monthly fee of $45.00 for the first 600 minutes, plus $0.19 for each 
minute > 600. You know you will use more than 600 minutes per month. 


a. At what number of minutes do the plans cost the same amount per month, and what is that amount? 
b. Above this number of minutes, which plan is more expensive? 
If A is ann X n invertible matrix, the following method can be used to find A’. 


1. Operate on A using any combination of the two following elementary row operations until the resulting 
matrix is the n X n identity matrix I. 


1. Multiply all the elements in any one row of A by a non-zero scalar. 
п. Add a scalar multiple of any row to any other row. 


2. At the same time, perform exactly the same sequence of operations on the n X n identity matrix I. 
3. The matrix that results from I after step 2 is А !. 
Use this method to find the inverse of matrix A in Exercise 24(a). 


| 466 Relations, Functions, and Matrices 


40. Use the method of Exercise 39 to find the inverse of matrix A in Example 71. 


41. Consider a system of п linear equations in n unknowns, such as the one from Example 63: 


x+ y= 70 
24x + 14y = 1180 


eu] BH xh 


If 


If A, the matrix of coefficients, 15 invertible, then we can multiply both sides of the above equation by 


A |, giving 
A -(A:X) -A B 
(A -A):.X-A - B (matrix multiplication is associative) 
I:X-A '- B (definition of A~') 
X-A -B (definition of I) 


Therefore the solution to the system of equations is given by 
X-A B 
Make use of Exercise 39 to find A ^ !, and use this approach to solve the system of equations. 
In Exercises 42—46, solve the systems of equations using the method of Exercise 41. 


42.x + 2y = —4 
x+ y= 5 


43. The system of Exercise 27 
44. The system of Exercise 28 
45. The system of Exercise 29 
46. The system of Exercise 30 


47. For Boolean matrices 


1 0 0 1 0 1 
A=]1 1 0 B-|0 1 1 
0 1 1 1 1 1 


find A ^ B, A V B, A X B, and B X A. 


48. 


49. 


50. 


5L. 
52. 
53. 
54. 
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For Boolean matrices 


0 0 1 0 1 
А= 1 1 0 B=/0 0 0 
1 0 0 1 0 0 
find A ^ B, A V B, A X В, and B X A. 
For Boolean matrices 
0 1 0 0 1 
А= |1 0 1 B=/0 0 1 
0 0 1 1 0 0 
find A ^ B, A V B, A х B,andB X A. 
For Boolean matrices 
1 1 0 1 0 1 
А=|0 1 B=/0 1 1 
0 0 1 1 1 1 


find A /\ B, A V B, A X B,and B X A. 

For Boolean matrices A and B, can it ever be the case that A V В = A ^ B? If so, when? 
For Boolean matrices A and B, prove that A V В = BV А and that A AB = B^ A. 
How many distinct symmetric п X n Boolean matrices are there? 


Strassen 5 algorithm reduces the amount of work to compute the product of two n X n matrices of suffi- 
cient size. For simplicity, assume that n = 2” for some m = 0. First consider a simple case of multiplying 
two 2 X 2 matrices. 


= p | B = D 2 


а an b b» 
The product 
ауруу + арр ауру + арр 
С= | 11911 12921 11912 12 | (1) 
ару + абор anbi + anba 
сап also be written (check the calculations) as 
+ ра = ps + + 
C= ia Pa — Ps Р] P3 T Ps | (2) 
рә + P4 Pi + P3 — р + po 
where 
Pi = (ау, + а›))(®\, + Б) р» = (Aa, + a»)bii 
рз = а1(р — bay) P4 = dyXb3i — bi) 
Ds = (ау + ар) Po = (а — AyD, + Б) 


рт = (ар — an)(bzn + bay) 


59: 
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Computing the various р; quantities requires 7 multiplications and 10 additions (counting subtractions as 
additions). Computing the product С once the р; quantities exist takes 0 multiplications and an additional 
8 additions. In total, computing C by this method requires 7 multiplications and 18 additions. Now take 
two n X n matrices A and В and partition each of them into four (n/2 X n/2) matrices: 


А = bs “| В = bs | 
Аз, A» В, By 


The product С = A · B is still given by Equation (1) using A; and В, instead of a; and b,, respectively. 
Therefore Equation (2) still holds, and the product C requires 7 (n/2 X n/2) matrix multiplications. This 
is an example of a divide and conquer algorithm, where the work has been reduced to several instances 
of the same problem on a significantly reduced input size (although there is an additional overhead of 18 
(n/2 X n/2) matrix additions). 

Let M(n) represent the number of multiplications required for a product of two n X n matrices. Using 
Strassen's algorithm, we can write 


Md) = 1 (one multiplication in the product of two 1 X 1 matrices) 
М(п) = (2) 


a. Solve this recurrence relation for M(). 


b. If A(n) represents the number of additions required for a product of two n X n matrices, justify the 
following recurrence relation: 


A(1) = 0 


A(n) = Ө | (7) 


c. Solve this recurrence relation to obtain an order-of-magnitude expression for A(n). 


d. Find an approximate order-of-magnitude value for the total work (additions and multiplications) for 
matrix multiplication using Strassen's algorithm, and compare it with the @(n*) work (additions and 
multiplications) for traditional matrix multiplication. (Although Strassen's algorithm is theoretically an 
improvement for all values of n, the constants that are ignored in an order-of-magnitude argument mean 
that in actual implementation the traditional algorithm may run faster for values of n less than about 
2° = 64). 

The Gaussian elimination algorithm is labor-intensive. A worst-case analysis can be done simply by count- 

ing, where multiplications and additions are the units of work (divisions are counted as multiplications 

and subtractions are counted as additions). Consider a system of л linear equations in n unknowns. 


a. In the worst-case, the first non-zero element of each row of the augmented matrix as it is being trans- 
formed into upper triangular form is not 1 and that row must be multiplied by a non-zero scalar to create 
a 1. Show that this requires a total of 


(n + (п + 2) 
2 


multiplications. (Hint: Consider Practice 7 in Chapter 2.) 


2m + Зи? — 5n 
6 


additions are required to transform the augmented matrix into upper triangular form. 


b. Aside from the multiplications required in part (a), show that 
2n? + Зи? — 5n 

6 
(Hint: Consider Exercise 11 in Section 2.2.) 


multiplications and 
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1) 


n(n + 
c. After the matrix has been reduced to upper triangular form, show that there are 20 multiplica- 


(n — l)n 


tions and - additions to solve for the » unknowns. 


d. Explain why Gaussian elimination is Ө(т?) in the worst case. 
56. DES, discussed in Section 5.6, is an encryption algorithm that is an example of a block cipher, where a 


block of bits is encoded into a block of bits. Matrices can be used to create a simple block cipher. Consider 
a2 X 2 matrix with integer entries, for example, 


ке 


А is an invertible matrix with 


because 


2 7| | 4 —7|_| 4 —7| |2 7|_|1 0 

1 4) [-1 2] [-1 2] [1 4] |0 1 
Break up the message to be encrypted into blocks of two characters, and apply a function mapping the 
letters of the alphabet into the integers 0—25 as follows: 


ГЈ К MN OP JQ В S Т JU |V WX |Y |27 


8 |9 |10 |11 12 |13 |14 |15 |16 |17 |18 |19 | 20 | 21 | 22 | 23 | 24 |25 


Thus [B R]— [1 17]. The heart of the encryption algorithm consists of multiplying the resulting 1 X 2 
matrix by A using arithmetic modulo 26. Thus 


2 7 


[1 m 1 


|=1» 75] [19 23] 


and [19 23] — [T X]. Therefore [В R] is encrypted as [T X]. To decrypt, convert [T X] back to 
[19 23 | апа multiply the resulting 1 X 2 matrix by A ', again using modulo 26 arithmetic. 


4 -7 


[19 2]. i Р 


| = [53 -87]o[1 17] 
to be converted back to the original message [8 А]. 

a. Using the encryption matrix А above, encrypt the block [V I]. 

b. Decrypt the result from part (a) to recover [V I]. 


c. Explain why the decoding process recovers the original (numerical) block. 
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equivalence class (p. 339) 

equivalence relation (p. 339) 

equivalent sets (p. 401) 

floor function (p. 386) 

foreign key (p. 368) 
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intractable problem (p. 417) 
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International Standard Book 
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many-to-many relation (p. 330) 
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mapping (p. 383) 
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modify a database (p. 375) 
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(p. 336) 
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plaintext (p. 428) 
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(p. 336) 
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referential integrity (p. 375) 
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relational calculus (p. 372) 

relational database (p. 366) 

relational model (p. 366) 

residue of x modulo л (р. 423) 

restrict (p. 369) 

restriction of a partial ordering 
(p. 336) 

RSA public key encryption 
algorithm (p. 431) 

scalar (p. 448) 

scalar multiplication (p. 448) 


SELECT (p. 372) 


SQL (p. 372) 


(p. 336) 


SELF-TEST 
Answer the following true-false questions. 


Section 5.1 


1. In a one-to-many binary relation, at least one first 
component must appear in two different ordered 
pairs. 

2. If an antisymmetric binary relation contains (x, y), 
then (y, x) will not belong to the relation. 

3. A least element of a partially ordered set precedes 
all elements except itself. 

4. An equivalence relation cannot also be a partial 
ordering. 

5. A partial ordering on a set determines a partition of 
that set. 


Section 5.2 


1. If a task is not on the critical path in a PERT chart, 
then that task is optional. 

2. A topological sort turns a partially ordered set into 
a totally ordered set. 

3. If x precedes y after a topological sort on a finite 
partially ordered set, then x preceded y in the origi- 
nal partial ordering. 

4. The times to complete parallel tasks are added to- 
gether in determining a critical path in a PERT chart. 

5. A given set of data results in a unique topological 
sort. 


Section 5.3 


1. Arelation in a relational database is a set of л tuples 
of attribute values. 

2. A primary key in a relation is a minimum subset of 
attribute values that will uniquely identify each tuple. 

3. The restrict operation can be achieved by doing a 
union followed by an intersection. 

4. The join operation can be achieved by doing a 
Cartesian product followed by a restrict. 


simple substitution cipher (p. 429) 


subtraction of matrices (p. 449) 
successor in a partial ordering 


surrogate key (p. 368) 
symmetric closure (p. 334) 
symmetric encryption (p. 431) 
symmetric matrix (p. 447) 
symmetric relation (p. 332) 


topological sorting (p. 359) 

total ordering (p. 338) 

transitive closure (p. 334) 

transitive relation (p. 332) 

tuple (p. 366) 

UPC-A (Universal Product Code) 
(p. 436) 

upper triangular matrix (p. 453) 

vector (p. 448) 

vertex (p. 337) 

zero matrix (p. 449) 


5. Deleting a tuple from a relation may result in addi- 
tional deletions being done in order to satisfy data 
integrity. 


Section 5.4 

1. A binary relation on 5 X T that is not one-to-many 
or many-to-many is a function from S to T. 

2. To prove that a function is onto, begin with an ar- 
bitrary element of the range and show that it has a 
preimage. 

3. To prove that a function is one-to-one, assume 
Қ) = f(s2) for some s, and s, in the domain and 
show that s, = s». 

4. The composition of two permutation functions on a 
set is a permutation function on the set. 

5. Any one-to-one function has an inverse function. 


Section 5.5 

1. If fis O(g), then beyond some point the values for 
f(x) must fall between 1 о(х) and 2g(x). 

2. If f = O(g), then either f = O(g) or g = o( f). 

3.1f f(x) = Зх? + 15x — 2 and g(x) = 5000x°/ 
(x = 1) then f = O(g). 

4. An intractable problem is one that only has solution 
algorithms of the form O(n^) where с = 5. 

5. ш a recurrence relation of the form 


n 


jer fornz2 


S(n) — as( 


the order of magnitude of the solution, as deter- 
mined by the master theorem, depends on the ratio 
of a to b°. 
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Section 5.6 


1. 37 = 15 (mod 11) 

2. When searching a hash table for a particular target 
value, first apply the hash function to the value to 
obtain the hash table index, then examine the table 
entry at that index. If the table entry stored there 
matches the target value, the search was successful; 
if not, the search fails. 

3. A Caesar cipher with a shift k = 5 will encode ^W" 
as "B." 

4. In the RSA algorithm, the public key is (n, e). Se- 
curity derives from the difficulty of computing 
en). 

5. The check digit in the ISBN-10 0-321-18059-3 is 
correct. 


ON THE COMPUTER 


For Exercises 1—17, write a computer program that 
produces the desired output from the given input. 


1. Input: The elements in a finite set S and a list of 
ordered pairs representing a binary relation on S 
Output: Statement indicating whether the relation 
is one-to-one, one-to-many, many-to-one, or many- 
to-many 


2. Input: The elements in a finite set S and two lists of 
ordered pairs representing two binary relations on 5 
Output: The ordered pairs in the union and in the 
intersection of the two relations, and the ordered 
pairs in the complements of each relation 


3. Input: The elements in a finite set S and a list of 
ordered pairs representing a binary relation on 5 
Output: Statement of which properties—reflexive, 
symmetric, transitive, and/or antisymmetric—the 
relation has 


4. Input: The elements in a finite set S and a list of 
ordered pairs representing a binary relation on S 
Output: Reflexive, symmetric, and transitive clo- 
sures of the relation 


5. Input: The elements in a finite set S and a list of or- 
dered pairs representing a partial ordering on S 
Output: A list of all minimal and maximal elements 


Section 5.7 


1. 


2. 
3. 


Two matrices that do not have the same dimensions 
cannot be added. 

If A and B are square matrices, then A: B = B^ A. 
The usual algorithm for matrix multiplication is 
O(n’). 


. The process of Gaussian elimination reduces the 


augmented matrix of a system of n linear equations 
in п unknowns to the n X n identity matrix. 


. If A and B are square Boolean matrices, then 


A X B = B X A, where A X B denotes the Bool- 
ean product. 


6. Input: The elements in a finite set S and a list of 


р | 


оо 


2 


10. 


ordered pairs representing a partial ordering on S 
Output: А list of any least or greatest elements. 
Note that this task is more difficult than that in Ex- 
ercise 5. 


. Input: The elements in a finite set S, a list of or- 


dered pairs representing an equivalence relation 
on S, and an element x of S 
Output: The members of [x] 


. Input: Array representations of relation tables and 


appropriate input data for restrict, project, and join 
operations 
Output: Array representations of the resulting rela- 
tion tables 


. Input: The elements in a finite set S and a list of 


ordered pairs representing a partial ordering on S 

Output: Sequence representing the total ordering 
that results from doing a topological sort (Hint: 
Reuse some of your code from Exercise 5.) 


Input: The elements in a finite set S and in a finite 
set T, and a list of ordered pairs representing a bi- 
nary relation on S X T 

Output: An indication of whether the relation is a 
function from S to T and if so, whether it is onto or 
one-to-one or both 
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11. Input: The number of elements in two finite sets 


S and T 

Output: The number of functions from S to T, the 
number of one-to-one functions from S to T (or 
an indication that none exist), and the number of 
onto functions from SS to T (or an indication that 
none exist) 


12. Input: Two lists of ordered pairs representing func- 


tions f and g from S to S 
Output: List of ordered pairs representing the com- 
position function g ° f 


13. Input: The elements in a finite set S and two lists 


that represent (in cycle form) permutations f and 
gons 

Output: One or more lists that represent the com- 
position function g ° fin cycle or product-of-cycle 
form 


14. Input: The number of elements in a finite set 5 


Output: The number of derangements on S 


15. Input: n and the entries in two n X n matrices A 


and B 
Output: Sum A + B and products А · Band B: A 


16. Input: Dimensions of a matrix A and the entries 


in A 
Output: А? (see Exercise 15 in Section 5.7) 


17. Input: The augmented matrix of a system of n lin- 


ear equations in п unknowns with a unique set of 
n solutions 

Output: The n solutions, determined by using 
Gaussian elimination 


18. The determinant of an л X n matrix can be used 


in solving systems of linear equations, as well 
as for other purposes. The determinant of A can 
be defined in terms of minors and cofactors. The 
minor of element a; is the determinant of the 
(n — 1) X (n — 1) matrix obtained from A by 
crossing out the elements in row ; and column j; 
denote this minor by M;. The cofactor of element 
a,, denoted by C;, is defined by 


C, = (- "M; 


The determinant of A is computed by multiplying 
all the elements in some fixed row of A by their 
respective cofactors and summing the results. For 
example, if the first row is used, then the determi- 
nant of A is given by 


>, (а (Ci) 
k=1 


Write a program that, when given п and the entries 
in an л X n array A as input, computes the deter- 
minant of A. Use a recursive algorithm. 
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Graphs and Trees 


CHAPTER OBJECTIVES ө 


After studying this chapter, you will be able to: 


* Understand and use the many terms associated with graphs, directed graphs, 
and trees. 

e Appreciate the use of graphs, directed graphs, and trees as representation tools 
in a wide variety of contexts. 

* Prove that two given graphs are isomorphic or give a reason why they are not. 

se Euler's formula for a simple, connected, planar graph. 

Jnderstand the role of the two specific graphs К; апа К; з in graph planarity. 

rove elementary properties about graphs and trees. 

Jse adjacency matrix and adjacency list representations for graphs and directed 

raphs. 

* Do preorder, inorder, and postorder tree traversal. 

Use array and pointer representations for binary trees. 

Use decision trees to represent the steps a searching or sorting algorithm car- 
ries out. 

* Build a binary search tree and conduct a binary tree search. 

* Express lower bounds on the worst-case number of comparisons for searching 
or sorting on a list with n elements. 

* Find Huffman codes for characters whose frequency of occurrence is given. 


e e 
асус с 


You work іп the Information Systems Department at World Wide Widgets (WWW), the 
leading widget manufacturer. Widgets are extremely complex devices made up of an 
enormous number of very simple parts. Each part is one of the following types: Bolt 
(B), Component (C), Gear (G), Rod (R), or Screw (S). There are many different varia- 
tions of each basic type. Part numbers consist of a leading character B, С, G, R, or S 
to identify the part type, followed by an 8-digit number. Thus 


C00347289 
B11872432 
545003781 


are all legitimate part numbers. Using the multiplication principle, there are 5 х 10? 
different potential part numbers! WWW maintains a data file of the part numbers 
it uses, which, as it turns out, is most of the potential numbers. Most computers, 
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including those at WWW, use the ASCII encoding scheme for converting characters 
into binary form, under which each character requires 1 byte (8 bits) of storage. Be- 
cause each different part number consists of 9 characters, the WWW parts data file 
is approximately 9 x 5 x 10? bytes, or 4.5 Gb. 


Question: How can you compress this data file so that it takes less storage space? 


One answer to this question involves working with binary tree structures. A tree 
is a visual representation of data items and the connections between some of these 
items. It is a special case of a more general structure called a graph. Graphs or 
trees can be used to represent a surprising number of real-world situations— 
organization charts, road maps, transportation and communications networks, 
and so forth. Later we will see other uses of graphs and trees to represent logic 
networks, finite-state machines, and formal-language derivations. 

Graph theory is an extensive topic. Sections 6.1 and 6.2 present some of the 
considerable terminology connected with graphs and trees and some elementary 
results about these structures. To represent a graph or a tree in computer memory, 
data must be arranged in a way that preserves all the information contained in the 
visual representation. Several approaches to representing graphs and trees within 
a computer are discussed. 

Decision trees are graphical representations of the activities of certain types 
of algorithms. In Section 6.3, decision trees are presented and used to find lower 
bounds on the worst-case behavior of searching and sorting algorithms. In Sec- 
tion 6.4, an algorithm is given for constructing binary trees that allow for data 
compression of large files. 


SECTION 6.1 | GRAPHS AND THEIR REPRESENTATIONS 


Definitions of a Graph 


One way to while away the hours on an airplane trip is to look at the literature in 
the seat pockets. This material almost always includes a map showing the routes 
of the airline you are flying, such as the one in Figure 6.1. АП this route informa- 
tion could be expressed in paragraph form; for example, there is a direct route be- 
tween Chicago and Nashville but not between St. Louis and Nashville. However, 
the paragraph would be rather long and involved, and we would not be able to 
assimilate the information as quickly and clearly as we can from the map. There 
are many cases where “а picture is worth a thousand words." 


Nashville 


Albuquerque 
-9— 


Miami 


Los Angeles 
Figure 6.1 
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Pie Chart 


Bar Graph Picture Graph Profits 


18% 


Each figure represents 

5 100,000 
B 
Б New York 
a 
E 
m Philadelphia At zt 

2012 2013 2014 2015 2014 expenditures 
Figure 6.2 


The term “graph” is often used informally for any visual representation of 
data, such as that in Figure 6.1; other forms include the bar graph, picture graph, 
and pie chart, which are shown in Figure 6.2. We have also talked about graphs 
of functions on rectangular coordinate systems. We will use two definitions of a 
graph; one relies on a visual representation like that of Figure 6.1, and the other is 
a more formal definition that actually says nothing about a visual representation. 


* DEFINITION GRAPH 


(Informal) A graph is a nonempty set of nodes (vertices) and a set of arcs (edges) such that 
each arc connects two nodes. 


Our graphs will always have a finite number of nodes and arcs. 


The set of nodes in the airline map of Figure 6.1 is {Chicago, Nashville, Miami, 
Dallas, St. Louis, Albuquerque, Phoenix, Denver, San Francisco, Los Angeles}. 
There are 16 arcs; Phoenix-Albuquerque is an arc (here we are naming an arc by 
the nodes it connects), Albuquerque-Dallas is an arc, and so on. ө 


In the graph of Figure 6.3, there are five nodes and six arcs. Arc a, connects nodes 
1 and 2, arc а; connects node 2 and 2, and so forth. 


аз 


a» 2 


Figure 6.3 e 


The informal definition of a graph works quite well if we have the visual 
representation of the graph before us to show which arcs connect which nodes. 
Without the picture, however, we need a concise way to convey this information. 
Hence our second definition of a graph. 
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Ф DEFINITION GRAPH 
(Formal) A graph is an ordered triple (N, A, g) where 


N — a nonempty set of nodes (vertices) 

A — a set of arcs (edges) 

g = a function associating with each arc a an unordered pair х-у of nodes 
called the endpoints of a 


For the graph of Figure 6.3, the function g associating arcs with endpoints performs 
the following mapping: g(a,) = 1-2, g(a.) = 1-2, g(a3) = 2-2, g(a4) = 2—3, 
g(as) = 1-3, and g(ag) = 3-4. e 


PRACTICE 1 | Sketch a graph having nodes {1, 2, 3,4, 5}, arcs (a,,a5,a5, A4, As, ag), and function 
glai) = 1-2, g(a;) = 1-3, g(a) — 3—4, g(a) — 3-47 2 (05) с=з апо ы " 
We might want the arcs of a graph to begin at one node and end at another, in 
which case we would use a directed graph. 
e 


DEFINITION DIRECTED GRAPH 
A directed graph (digraph) is an ordered triple (N, А, е) where 


N — a nonempty set of nodes 

A — a set of arcs 

g — a function associating with each arc a an ordered pair (x, y) of nodes 
where x is the initial point and y is the terminal point of a. 


In a directed graph, then, there is a direction associated with each arc. 


Figure 6.4 shows a directed graph. There are 4 nodes and 5 arcs. The function g 
associating arcs with endpoints performs the mapping g(a,) = (1,2), meaning that 
arc a, begins at node 1 and ends at node 2. Also, g(a3) = (1,3), but g(a4) = (3,1). 


ay 2 4 a 
| с 
3 


a4 


Figure 6.4 e 


Besides imposing direction on the arcs of a graph, we may want to modify the 
basic definition of a graph in other ways. We often want the nodes of a graph to car- 
ry identifying information, like the names of the cities in the map of airline routes. 
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This map would be a labeled graph. We may want to use a weighted graph, where 
each arc has some numerical value, or weight, associated with it. For example, we 
might want to indicate the distances of the various routes in the airline map. 

In this book, the term “graph” will mean an undirected graph. To refer to a 
directed graph, we will always say “directed graph.” 


Applications of Graphs 


Although the idea of a graph is very simple, an amazing number of situations 
have relationships between items that lend themselves to graphical representa- 
tion. Not surprisingly, there are many graphs in this book. Graphical representa- 
tions of partially ordered sets (Hasse diagrams) were introduced in Chapter 5. A 
PERT chart (for example, Figure 5.7) is a directed graph. The E-R diagram (for 
example, Figure 5.10) is a graph. The commutative diagram illustrating composi- 
tion of functions (Figure 5.23) is a directed graph. Chapter 8 will introduce logic 
networks and represent them as directed graphs. Directed graphs will also be used 
to describe finite-state machines in Chapter 9. 

We saw that the airline route map was a graph. A representation of any net- 
work of transportation routes (a road map, for example), communications lines (as 
in a computer network), or product or service distribution routes such as natural 
gas pipelines or water mains is a graph. The chemical structure of a molecule is 
represented graphically. 


PRACTICE 2 | Draw the underlying graph in each of the following cases. 


a. Figure 6.5 is a road map for part of Arizona. 
b. Figure 6.6 is a representation of an ozone molecule with three oxygen atoms. 


Flagstaff 


Winslow 
Sedona 


Payson 


Carefree 


Phoenix (@) Scottsdale Apache 


Junction 
Tempe Mesa 


О 
Chandler f Ne 
О О 


Casa Grande 
Figure 6.5 Figure 6.6 
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A high-level view of the information flow in a state automobile licensing office 
is prepared as the first step in developing a new computerized licensing system. 
Figure 6.7 shows the resulting directed graph, often called a data flow diagram. 


Customer request 


Process 
application 


Customer 


Invoice 


Validation 


Payment 


Request 
payment 


Receipt 
Plate 


Plate 
authorization 


Process 
payment 


Registration copy 
Credit notification 


State Bureau of 
Revenue Division Motor Vehicles 


Figure 6.7 ө 


Figure 6.8 shows a graph representation of a local area network of computers in an 
office complex. In this “star topology,” all machines communicate through a central 
server. The graph representation highlights one of the weaknesses of such a network 
design, namely its reliance on continued, dependable operation of the central server. 


Figure 6.8 ө 


Neural networks, tools used in artificial intelligence for such tasks as pattern гес- 
ognition, are represented by weighted directed graphs. Figure 6.9 shows a multi- 
layer network consisting of input units, output units, and a “hidden layer” of units. 
Weights on the arcs of the graph are adjusted as the neural network “learns” how 
to recognize certain trial patterns. 
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Input units Hidden layer Output units 


Figure 6.9 ө 


Graph Terminology 


Before proceeding, we need some terminology about graphs. Surprisingly, al- 
though there is a large body of literature in graph theory, the terminology is not 
completely standard. Therefore other books may give slightly different variations 
of some of these terms. 

Two nodes in a graph are adjacent if they are the endpoints associated with an 
arc. For example in the graph of Figure 6.3 (reproduced here), 1 and 3 are adjacent 
nodes but | and 4 are not. Node 2 is adjacent to itself. A loop in a graph is an arc 
with endpoints n—n for some node л; in Figure 6.3, arc а; 1 a loop with endpoints 
2—2. A graph with no loops is loop-free. Two arcs with the same endpoints are 
parallel arcs; arcs a,, and a, in Figure 6.3 are parallel. A simple graph is one 
with no loops or parallel arcs. An isolated node is adjacent to no other node; in 
Figure 6.3, 5 is an isolated node. The degree of a node is the number of arc ends 
at that node. In Figure 6.3, nodes | and 3 have degree 3, node 2 has degree 5, node 
4 has degree 1, and node 5 has degree 0. 


a3 


. e 
1 45 3 d6 4 5 


Because the function g that relates arcs to endpoints in the formal definition of 
a graph is indeed a function, each arc has a unique pair of endpoints. If g is a one-to- 
one function, then there is at most one arc associated with a pair of endpoints; such 


PRACTICE 3 
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graphs have no parallel arcs. A complete graph is one in which any two distinct 
nodes are adjacent. In this case, g is almost an onto function—every pair х—у of 
distinct nodes is the image under g of an arc—but there does not have to be a loop at 
every node. Consequently, pairs of the form x—x need not have a preimage. 

A subgraph of a graph consists of a set of nodes and a set of arcs that are sub- 
sets of the original node set and arc set, respectively, in which the endpoints of an 
arc must be the same nodes as in the original graph. In other words, it is a graph 
obtained by erasing part of the original graph and leaving the rest unchanged. 
Figure 6.10 shows two subgraphs of the graph in Figure 6.3. Note that the graph in 
Figure 6.10a is simple and also complete. 


a3 
2 a» 2 
а ад 
а 

ә 

1 a5 3 1 a5 3 
(a) (b) 

Figure 6.10 


A path from node n, to node л, is a sequence 
По, Ag, Пу, Ay, ... , Hp 1 Gg 1 Ng 


of nodes and arcs where for each i, the endpoints of arc a; are n;-n;,,. In the 
graph of Figure 6.3, one path from node 2 to node 4 consists of the sequence 
2, Ap 1, 45,2, a4, 3, а, 4. The length of a path is the number of arcs it contains; if 
an arc is used more than once, it is counted each time it is used. The length of the 
path just described from node 2 to node 4 is 4. 

A graph is connected if there is a path from any node to any other node. The 
graphs in Figure 6.10 are each connected, but the graph of Figure 6.3 is not con- 
nected. A cycle in a graph is a path from some node л back to ny where no arc 
appears more than once in the path sequence, my is the only node appearing more 
than once, and n occurs only at the ends. (Nodes and arcs may be repeated in a 
path but not, except for node no, in a cycle.) In the graph of Figure 6.3, 


1), а, 2, ал, 3, as, 1 


is a cycle. A graph with no cycles is acyclic. 


Refer to the graph created in Practice 1. 


с Бо њоро сю 


. Find two nodes that аге not adjacent. 
. Find a node adjacent to itself. 

. Find a loop. 
. Find two parallel arcs. 

. Find the degree of node 3. 


Find a path of length 5. 


. Find a cycle. 
. Is this graph complete? 


Is this graph connected? п 
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ЭСЇТ 8 Figure 6.11 illustrates the simple, complete graphs with 1, 2, 3, and 4 vertices. The 
simple, complete graph with n vertices is denoted by K,. 


e 9€— ——9 А Х| 
Kj К» Кз K4 
Figure 6.11 ө 


PRACTICE 4 | Draw K;. н 


Now consider the simple graph in Figure 6.12. It is not a complete graph 
because it is not true that every node is adjacent to every other node. However, 
the nodes can be divided into two disjoint sets, (1, 2) and (3, 4, 5}, such that 
any two nodes chosen from the same set are not adjacent but any two nodes 
chosen one from each set are adjacent. Such a graph is a bipartite complete 


graph. 


3 4 5 
Figure 6.12 


DEFINITION BIPARTITE COMPLETE GRAPH 
A graph is a bipartite complete graph if its nodes can be partitioned into two dis- 
joint nonempty sets №; and N, such that two nodes x and y are adjacent if and only 
if x € N, and y € №. If |N,| = m and |N,| = n, such a graph is denoted by К, 


n,n* 


Figure 6.12 therefore illustrates K, з. 


PRACTICE 5 | Draw Кз. 


The concept of a path extends to a directed graph, as we might expect: A path 
from node n, to node n, in a directed graph is a sequence 


По, Q9, П], Ay, ... s Tp 1, 2-1, k 
where for each i, n; is the initial point and л, is the terminal point of a;. If a path 


exists from node n, to node n,, then n, is reachable from ло. The definition of a 
cycle also carries over to directed graphs. 


РВАСТ!СЕ 6 
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In the directed graph of Figure 6.13, there are many paths from node 1 to node 3: 
1, a4,3 and 1, а), 2, 2, 2, a2, 2, аз, 3 are two possibilities. Node 3 is certainly 
reachable from 1. Node 1, however, is not reachable from any other node. The 
cycles in this graph are the loop a, and the path 3, as, 4, dg, 3. 


2 
2 
il 3 
a7 


4 
Figure 6.13 ө 


We can prove some (fairly trivial) statements about graphs that follow directly 
from the definitions. 


Prove that an acyclic graph is simple. 

We'll use a proof by contraposition. If a graph is not simple, it has either 
parallel arcs or a loop. The parallel arcs and their endpoints, or the loop and its 
endpoints, then constitute a cycle, and the graph is not acyclic. ө 


Note that the converse to the statement in Example 10 is not true: Figure 6.10a 
is a simple graph, but it contains a cycle. 


a. Prove that every complete graph is connected. 
b. Find a connected graph that is not complete. E 


Isomorphic Graphs 


Two graphs may appear quite different in their visual representation but still be the 
same graph according to our formal definition. We want to distinguish between 
two graphs that have cosmetic visual differences and those that have fundamen- 
tally different structures. The graphs in Figures 6.14 and 6.15 are the same—they 
have the same nodes, the same arcs, and the same arc-to-endpoint function. (In 
a representation of a graph, arcs can intersect at points that are not nodes of the 
graph.) The graph in Figure 6.16 is essentially the same graph as well. If we rela- 
beled the nodes and arcs of the graph of Figure 6.14 by the following mappings, 
the graphs would be the same: 


fila ha e» 
2c d, — е 
3b 


4—4 


* DEFINITION 
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е 
3 2 
2 3 
a 
a a ае Pb 
ai 

1 4 1 4 c d 
Figure 6.14 Figure 6.15 Figure 6.16 


Structures that are the same except for relabeling are called isomorphic struc- 
tures. To show that two structures are isomorphic, we must produce a relabeling 
(one-to-one, onto mappings between the elements of the two structures) and then 
show that the important properties of the structures are “preserved” (maintained) 
under this relabeling. In the case of graphs, the elements are nodes and arcs. The 
"important property" in a graph is which arcs connect which nodes. 

The given mappings f, and № are one-to-one, onto functions from the nodes 
and arcs, respectively, of the graph in Figure 6.14 to the nodes and arcs of the graph 
in Figure 6.16. Furthermore, if an arc a in the graph of Figure 6.14 has endpoints x-y, 
then the arc /(а) in the graph of Figure 6.16 has endpoints /(х)—Л ОУ), and vice versa. 
For example, arc a, in Figure 6.14 has endpoints 1—3, while its corresponding arc e; in 
Figure 6.16 has endpoints a—b, which are the nodes in Figure 6.16 that correspond to 
nodes 1 and 3 in Figure 6.14. We can formalize this idea. 


ISOMORPHIC GRAPHS 

Two graphs (№, A,, g1,) and (N>, A>, g;) are isomorphic if there are bijections 
fi: Ni >N, and р: А, — A, such that for each arc a € А, g(a) = x-y if and 
only if gj 5(а)] = AOAO). 


The graphs shown in Figure 6.17 are isomorphic. The bijections that establish the 
isomorphism are partially given here: 


Jele Ља е 
1 @ d, — ед 
I=] аз е 
4 Б 
3 — 


Figure 6.17 
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Using these bijections, g,(a;) = 1-3 and g,[f5(a3)] = ge) = с-а = AODA). 
We can see that the arc-to-endpoint relationship is preserved under the relabeling 
for the case of arc аз. To prove that the graphs are isomorphic, we would have to 
complete the definition of the f) function and then demonstrate that the arc-to- 
endpoint relationship is preserved under these mappings by examining all possible 
cases. e 


PRACTICE 7 | Complete the definition of the function f; in Example 11. е 


® THEOREM 


Graph isomorphism is easier to establish if we restrict our attention to simple 
graphs. If we can find an appropriate function fı mapping nodes to nodes, then a 
function f; mapping arcs to arcs is trivial because there is at most one arc between 
any pair of endpoints. Hence the following theorem is true. 


ON SIMPLE GRAPH ISOMORPHISM 

Two simple graphs (N,, Aj, g,) and (№, А», g;) are isomorphic if there is a bijec- 
tion f: №, > N, such that for any nodes n; and n; of №, n; and n; are adjacent if 
and only if f(n;) and f(n;) are adjacent. (The function fis called an isomorphism 
from graph 1 to graph 2.) 


PRACTICE 8 | Find an isomorphism from the graph of Figure 6.18a to that of Figure 6.18b. 


a 
1 
b 
4 5 c 
2 3 
e J 
6 
(a) (b) 
Figure 6.18 L| 


Proving that two graphs are isomorphic requires finding the bijection (or, for non- 
simple graphs, bijections) and then showing that the adjacency property (or arc-to-end- 
point relationship) is preserved. To prove that two graphs are not isomorphic, we must 
prove that the necessary bijection(s) do not exist. We could try all possible bijections 
(because there is a finite number of nodes and arcs, there is a finite number of bijec- 
tions). However, this method would quickly get out of hand in graphs of any size at 
all. Instead, we can try to find some other reason that such bijections could not exist. 
Although this task is not always easy, there are certain conditions under which it is clear 
that two graphs are not isomorphic (see Exercise 21). These include the following: 


1. One graph has more nodes than the other. 
2. One graph has more arcs than the other. 
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. One graph has parallel arcs and the other does not. 

. One graph has a loop and the other does not. 

. One graph has a node of degree k and the other does not. 
. One graph is connected and the other is not. 

. One graph has a cycle and the other does not. 


MON tA > шо 


PRACTICE 9 | Prove that the two graphs in Figure 6.19 are not isomorphic. 


N X 


Figure 6.19 L| 


The two graphs of Figure 6.20 are not isomorphic. Note that each graph has six 
nodes and seven arcs. Neither has parallel arcs or loops. Both are connected. Both 
have three cycles, four nodes of degree 2, and two nodes of degree 3. Therefore 
none of the obvious nonisomorphism tests apply. However, the graph in Figure 
6.20b has a node of degree 2 that is adjacent to two nodes of degree 3; this is not 
the case in Figure 6.20a, so the graphs are not isomorphic. 


(a) (b) 


Figure 6.20 ө 


Again, graphs that are isomorphic are considered to be “the same” regardless of 
cosmetic differences in how they are drawn or labeled, whereas nonisomorphic 
graphs have fundamental structural differences. 


Planar Graphs 


A planar graph is one that can be represented (on a sheet of paper, that is, in 
the plane) so that its arcs intersect only at nodes. Designers of integrated circuits 
want all components in one layer of a chip to form a planar graph so that no con- 
nections cross. The graph of Figure 6.14 is clearly planar. However, we know that 
it is isomorphic to the graph of Figure 6.15, so the graph of Figure 6.15 is also 
planar. The key word in the definition of a planar graph is that it can be drawn in 
a certain way. 


PRACTICE 10 | Prove that K, is a planar graph. 
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Consider K;, the simple, complete graph with five vertices. We will try to construct 
Ks with no intersecting arcs by starting with some of the arcs and then adding as 
many new arcs as possible without crossing existing arcs. We'll first lay out five 
vertices and connect them as shown in Figure 6.21a. (Because all the vertices in K, 
are symmetric, it doesn't matter how we label them.) 


(a) (b) (c) 
Figure 6.21 


Next we connect | to 3 and 1 to 4, as shown in Figure 6.216. Now 2 must be connect- 
ed to both 4 and 5. This can be accomplished while still preserving the planarity of the 
graph by putting these new arcs on the outside, as in Figure 6.21c. The final connec- 
tion is between nodes 3 and 5. But there is no way to draw an arc from node 3 to node 
5 without crossing either the 2-4 arc or one or more of the interior arcs, such as 1-4. 
We did have a choice of how to place arcs 1-3 and 1-4; we made them interior 
arcs. We could explore whether making these arcs exterior would change anything, 
but it turns out that it does not (see Practice 11). Thus it appears that К; is not a pla- 
nar graph. However, we'd still like a proof of this with a firmer foundation—this 
sounds too much like an “I can't do it so it can't be done" argument. Such a proof 


will be given shortly. e 
PRACTICE 11 | Show that adding arcs 1-3 and 1-4 as exterior arcs when constructing K; still leads to a 
situation where arcs must intersect. = 
PRACTICE 12 | Present а construction-type argument that К; з is not a planar graph. = 


One fact about planar graphs was discovered by the eighteenth-century Swiss 
mathematician Leonhard Euler (pronounced *oiler"). A simple, connected, planar 
graph (when drawn in its planar representation, with no arcs crossing) divides the 
plane into a number of regions, including totally enclosed regions and one infinite 
exterior region. Euler observed a relationship between the number n of nodes, the 
number a of arcs, and the number r of regions in such a graph. This relationship is 
known as Euler’s formula: 


n—actr-2 (1) 


PRACTICE 13 | Verify Euler's formula for the simple, connected, planar graph in Figure 6.18b. : 


To prove Euler's formula, we will do a proof by induction on a, the number of arcs. 
In the base case, a — 0 and the graph consists of a single node; the only region is the 
exterior region (Figure 6.22а). Here = 1, а = 0, andr = 1, and Equation (1) holds. 


| Section 6.1 Graphs and Their Representations 489 


Now assume that the formula holds for the planar representation of any simple, con- 
nected, planar graph with К arcs, and consider such a graph with k + 1 arcs. As usual, 
we must somehow relate the “A + 1 instance" to a ^k instance" so that we can make use 
of the inductive hypothesis. Here we consider two cases for the graph with k + 1 arcs. 


(a) (b) 
Figure 6.22 
Case I. The graph has a node of degree 1. Temporarily erasing this node and its 
connecting arc (Figure 6.22b) leaves a simple, connected, planar graph with k 


arcs, some number п of nodes, and some number r of regions for which (by the 
inductive hypothesis) 


n-k+r=2 


In the original graph, there was one more arc and one more node but the same 
number of regions, so the appropriate formula is 


(n+1)-—(kK+1)+r=2 
which, by the inductive hypothesis, is true. 


Case 2. The graph has no nodes of degree 1. Then temporarily erase one arc that 
helps define an enclosed region (Figure 6.22c). (If no arcs help define an enclosed 
region, the graph is a chain and there is a node of degree 1.) This leaves a simple, 
connected, planar graph with k arcs, some number n of nodes, and some number r 
of regions for which (by the inductive hypothesis) 


n-k+r=2 


n the original graph, there was one more arc and one more region, but the same 
number of nodes, so the appropriate formula is 


n—-(k-1)-*(r-1)-2 


which, by the inductive hypothesis, 15 true. 


PRACTICE 14 | In the proof of Euler's formula, explain why in case 2 the arc to be erased must help define 
an enclosed region. Give two reasons. 


There are two consequences of Euler's formula if we place further restrictions on 
the graph. Suppose we require that the graph not only be simple, connected, and pla- 
nar but also have at least three nodes. In a planar representation of such a graph, we 


Ф THEOREM 


490 Graphs and Trees 


can count the number of edges that are adjacent to (form the boundaries of) each re- 
gion, including the exterior region. Arcs that are wholly interior to a region contribute 
two edges to that region; for example, if we trace the boundary of the interior region 
shown in Figure 6.22b, we travel six edges, including the arc out to the node of degree 
1 and then back again. Arcs that separate two regions contribute one edge to each 
region. Therefore, if there are a arcs in the graph, the number of region edges is 2a. 

There are no regions with exactly one adjacent edge, because there are no 
loops in the graph. There are no regions with exactly two adjacent edges, because 
there are no parallel arcs and the graph consisting entirely of one arc joining two 
nodes (which would have two edges adjacent to the exterior region) is excluded. 
Therefore each region has at least three adjacent edges, so 3r is the minimum 
number of region edges. Thus 


2а = 3r 
or, from Equation (1), 
2а = 3(2 — n + a) = 6 – Зп + За 
and finally 
a=3n-6 (2) 


If a final restriction that there are no cycles of length 3 is placed on the graph, 
then each region has at least four adjacent edges, so 4r is the minimum number of 
region edges. This leads to the inequality 


2a = 4r 
which becomes 
ax2n-4 (3) 


These results are summarized in the following theorem. 


ON THE NUMBER OF NODES AND ARCS 
For a simple, connected, planar graph with n nodes and a arcs: 


1. If the planar representation divides the plane into r regions, then 
n-atr=2 (1) 
2, It: 3, then 
а= Зп — 6 (2) 
3. If n > 3 and there are no cycles of length 3, then 


а= 2n—4 (3) 


Note that inequality (3) places a tighter bound on the number of arcs than inequality 
(2), but an additional condition has been imposed on the graph. 
We can use this theorem to prove that certain graphs are not planar. 
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K; is a simple, connected graph with 5 nodes (and 10 arcs). If it were a planar graph, 
inequality (2) of our theorem would hold, but 10 > 3(5) — 6. Therefore, just as our 
construction argument showed, K; is not planar. К; is a simple, connected graph 
with 6 nodes (and 9 arcs). It has no cycles of length 3, because this would require two 
nodes in one of the two subsets to be adjacent. If it were a planar graph, inequality 
(3) would hold, but 9 > 2(6) — 4. Therefore К; з is not planar. e 


PRACTICE 15 | Show that inequality (2) does hold for А; 3, which shows that this inequality is a necessary 


but not sufficient condition for planarity in graphs with n — 3. 


The nonplanar graphs K; and K; play a central role in all nonplanar graphs. 
To state what this role is, we need one more definition. 


* DEFINITION HOMEOMORPHIC GRAPHS 
Two graphs are homeomorphic if both can be obtained from the same graph by 
a sequence of elementary subdivisions, in which a single arc x-y is replaced by 
two new arcs x—v and у-у connecting to a new node v. 


The graphs in parts (b) and (c) of Figure 6.23 are homeomorphic because each can 
be obtained from the graph of Figure 6.23a by a sequence of elementary subdivi- 
sions. (However, neither can be obtained from the other by a sequence of elemen- 
tary subdivisions.) 


PA 2 


(a) (b) (c) 
Figure 6.23 ө 


A graph that is planar cannot be turned into a nonplanar graph by elementary 
subdivisions, and a graph that is nonplanar cannot be turned into a planar graph 
by elementary subdivisions (see Exercise 32). As a result, homeomorphic graphs 
are either both planar or both nonplanar. The following theorem, due to the Polish 
mathematician Kazimierz Kuratowski, characterizes nonplanar graphs. 


THEOREM KURATOWSKI THEOREM 
A graph is nonplanar if and only if it contains a subgraph that is homeomorphic 
to К; or K33. 


We won't prove this theorem, although one direction is easy to see. Ifa graph has a 
subgraph homeomorphic to the nonplanar graphs К; or Ку, then the subgraph— 
and hence the entire graph—is nonplanar. 
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Figure 6.24a shows the “Petersen graph.” We will prove that this graph is not 
planar by finding a subgraph homeomorphic to Кз з. By looking at the top of the 
graph, we can see that node a is adjacent to nodes e, f, and b, none of which are 
adjacent to each other. Also, node e is adjacent to nodes d and j as well as a, and 
nodes a, d, and j are not adjacent to each other. This information is incorporated 
in the graph of Figure 6.24b, which is also a subgraph of К; з. The arcs needed 
to complete Аз are shown as dotted lines in Figure 6.24c. These arcs are not 
in the Petersen graph; for example, no j—f arc is present. However, there is a 
path in the Petersen graph from j to fusing the intermediate node A, that is, j-h 
and h—f. Similarly, there are paths j-g and g—b, d-i апа i-f, and d-c and c—b. 
Adding these paths to Figure 6.24b results in Figure 6.24d, which is a subgraph 
of the Petersen graph and is also obtainable from Figure 6.24c by a sequence of 
elementary subdivisions. e 


(b) 


Figure 6.24 


Computer Representation of Graphs 


We have said that the major advantage of a graph is its visual representation of 
information. What if we want to store a graph in digital form? Although it is pos- 
sible to store a digital image of a graph, it takes a lot of space. Furthermore, such 
an image remains but a picture—the data it represents can't be manipulated in any 
way. What we need to store are these essential data that are part of the definition 
of a graph—what the nodes are and which nodes have connecting arcs. From this 
information a visual representation could be reconstructed if desired. The usual 
computer representations of a graph involve one of two data structures, either an 
adjacency matrix or an adjacency list. 


Adjacency Matrix 

Suppose a graph has n nodes, numbered л, 75, ... , Np. This numbering imposes 
an arbitrary ordering on the set of nodes; recall that a set is an unordered collec- 
tion. However, this is done merely as a means to identify the nodes—no signifi- 
cance is attached to one node appearing before another in this ordering. Having 
ordered the nodes, we can form an n X n matrix where entry i, / 1s the number of 
arcs between nodes n; and n;. This matrix is called the adjacency matrix A of the 
graph with respect to this ordering. Thus, 


а; = p where there are p arcs between л; and n; 
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The adjacency matrix for the graph in Figure 6.25 with respect to the ordering 
1, 2, 3,4 is a4 X 4 matrix. Entry 1,1 is a 1 due to the loop at node 1. All other 
elements on the main diagonal are 0. Entry 2,1 (second row, first column) is a 1 
because there is one arc between node 2 and node 1, which also means that entry 
IL, 2 а JL 


2 3 
4 

Figure 6.25 

So far we have 
= = 
| «Йй к= 5 
А = 
== 0 == 
= E 0 ө 
PRACTICE 16 | Complete the adjacency matrix for Figure 6.25. = 


The adjacency matrix in Practice 16 is symmetric, which will be true for the 
adjacency matrix of any undirected graph—if there are p arcs between n; and n,, 
there are certainly p arcs between л; and n;. The symmetry of the matrix means 
that only elements on or below the main diagonal need to be stored. Therefore, all 
the information contained in the graph in Figure 6.25 is contained in the “lower 
triangular” array shown, and the graph could be reconstructed from this array. 
(The “upper triangular" version could also be used.) 


1 

1 0 

0 1 0 

1 0 2 0 


In а directed graph, the adjacency matrix A reflects the direction of the arcs. 
For a directed matrix, 


а; = p where there are p arcs from л; to n; 


An adjacency matrix for a directed graph will not necessarily be symmetric, 
because an arc from n; to л; does not imply an arc from n; to л. 


EXAMPLE 18 
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Consider the directed graph of Figure 6.26. 


2 3 
4 
Figure 6.26 
The adjacency matrix is 

i i @ @ 

re o @ i i 

Fie 11 @ g 
0 OG Y € e 


In a simple weighted graph, the entries in the adjacency matrix can indicate 
the weight of an arc by the appropriate number rather than just indicating the 
presence of an arc by the number 1. 


Adjacency List 

Many graphs, far from being complete graphs, have relatively few arcs. Such 
graphs have sparse adjacency matrices; that is, the adjacency matrices contain 
many zeros. Yet if the graph has n nodes, it still requires n? data items to represent 
the adjacency matrix (ог more than л?/2 if a triangular matrix is used), even if 
many of these items are zero. Any algorithm or procedure in which every arc in 
the graph must be examined requires looking at all n? items in the matrix, since 
there is no way of knowing which entries are nonzero without examining them. To 
find all the nodes adjacent to a given node n; requires scanning the entire ith row 
of the adjacency matrix, a total of п items. 

A graph with relatively few arcs can be represented more efficiently by stor- 
ing only the nonzero entries of the adjacency matrix. This representation consists 
of a list for each node of all the nodes adjacent to it. Pointers are used to get us 
from one item in the list to the next. Such an arrangement is called a linked list. 
There is an array of n pointers, one for each node, to get each list started. This 
adjacency list representation, although it requires extra storage for the pointers, 
may still be more efficient than an adjacency matrix. To find all the nodes adjacent 
to n; requires traversing the linked list for n, which may have far fewer than the 
n elements we had to examine in the adjacency matrix. However, there are trade- 
offs; if we want to determine whether one particular node n, is adjacent to n;, we 
may have to traverse all of n;'s linked list, whereas in the adjacency matrix we 
could access element i, j directly. 
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The adjacency list for the graph of Figure 6.25 contains a four-element array of 
pointers, one for each node. The pointer for each node points to an adjacent node, 
which points to another adjacent node, and so forth. The adjacency list structure is 
shown in Figure 6.27. 


1 ——— 1 | 2 ——— 4 ө 
2 ——— 1 —— 3 е 

3) | 2 =г | 4 ——— 4 ө 
4 = 1 —— 3 =- 3 e 
Figure 6.27 


In the figure the dot indicates a null pointer, meaning that there is nothing more to be 
pointed to or that the end of the list has been reached. We have dealt with parallel arcs 
by listing a given node more than once on the adjacency list for n; if there is more than 
one arc between л; and that node. Note that the arrow in list 1 from the 2 node to the 4 
node does not mean that there is an arc from node 2 to node 4; all the elements in the 
node | list are adjacent to node 1, not necessarily to each other. ө 


PRACTICE 17 | Draw the adjacency list representation for the graph shown in Figure 6.28. 


1 


Э 
Figure 6.28 L| 


In an undirected graph, each arc is represented twice. If n, is on the adjacency 
list of n;, then л, is also on the adjacency list of n;. The adjacency list representation 
for a directed graph puts n, on the list for n; if there is an arc from л, to n; n; would 
not necessarily be on the adjacency list for n;. For a labeled graph or a weighted 
graph, additional data items can be stored with the node name in the adjacency list. 


Figure 6.29a shows a weighted directed graph. The adjacency list representation 
for this graph is shown in Figure 6.29b. For each record in the list, the first data 
item is the node, the second is the weight of the arc to that node, and the third is the 
pointer. Note that entry 4 in the array of startup pointers is null because there are 
no arcs that begin at node 4. 
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1 —— 3 2) ө 
2 —+— 1 4 ——— 4 2) e 
3 =| 2 1 е 
4 e 
3 E 1 2) 3 е 
(а) (b) 
Figure 6.29 e 


In a programming language that does not support pointers, we can still achieve 
the effect of an adjacency list by using a multicolumn array (or an array of records), 
where one column contains the nodes and another column contains the array index 
of the next node on the adjacency list—a “pseudopointer.” The disadvantage of this 
approach is that the maximum amount of storage space that might be needed for an 
n-node graph must be set aside for the array; once we start to fill the array, new space 
cannot be dynamically created if we learn that there are still more adjacent nodes. 


The array-pointer representation of the graph of Figure 6.29a is shown in 
Figure 6.30. A null pointer is indicated by an array index of 0. 


Node Weight Pointer 


1 6 
2 m 
3 9 
4 0 
3 10 
6 3 2 0 
7 1 4 8 
8 4 2) 0 
9) 2 1 0 
10 4 [| 11 
11 2 В 0 
Figure 6.30 e 


In this array, row 2, representing node 2, has a pointer to index 7. At index 7 
of the array, we find node 1 with weight 4, representing the arc of weight 4 from 
node 2 to node 1. The pointer to index 8 says that the adjacency list for node 2 has 
more entries. At index 8, we learn that there is an arc from 2 to 4 of weight 2, and 
that this completes the adjacency list for node 2. 
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Isomorphic Protein Graphs 


The question of whether two graphs are isomorphic 
may seem to be of only academic interest, but in fact it 
is an important question in modern biological research. 
We are all familiar with the concept that the DNA in 
our cells carries our genetic information. But each cell 
also contains thousands of proteins, long chains of var- 
ious kinds of amino acids. There are up to 20 different 
kinds of amino acids, and the sequence of amino ac- 
ids determines the three-dimensional shape of a given 
protein as well as its function within the cell. Common 
categories of proteins according to their functions in- 
clude the following: 


Enzymes: Initiate chemical processes within the 
cell and form new molecules 

Antibodies: Recognize and defend against foreign 
particles such as bacteria 

Transport: Carry molecules such as oxygen 
throughout the cell and the body 

Structural: Give the cell its shape and structure 

Hormones: Transmit signals throughout the body 
to coordinate biological processes 


Proteins within a cell do not act in isolation; 
rather, they interact with other proteins in the cell to 
carry out complex tasks such as DNA replication or 
to transport oxygen throughout the cell. Protein-to- 
protein interactions (PPI) are therefore what make a 
cell function, and any change, however minor, in a 
given protein could affect the PPIs for that protein and 
thus affect the entire cell. Knowledge of cell biology 
at the PPI level is of great interest for identifying the 
underlying cause of disease and development of thera- 
peutics. 

Now consider the various proteins within a 
given cell as nodes, and two proteins that interact as 
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connected by an arc. Voila—we have a graph! And 
given the number of proteins within a cell and the num- 
ber and complexity of their interactions, it is a huge 
graph. А number of experiments have been done to 
determine what this graph looks like in various organ- 
isms, but the data gained seem to have low reliabil- 
ity. This problem has promoted interest in comparing 
PPIs across species to find commonalities that would 
reinforce information about the functionality of those 
interactions. Basically, researchers are looking for iso- 
morphic graphs (or subgraphs) between cells from two 
different species. 

The general problem of determining whether two 
graphs are isomorphic has no known polynomial-time 
solution—there is no known efficient algorithm. The 
performance of an inefficient algorithm might improve 
if certain conditions on the graphs are met: Are they 
both planar? Are they both trees? Such simplifications 
are unlikely to occur in the biological world of cell pro- 
teins. Consequently, various “heuristic” approaches 
(read “educated guesses”) are used. One of these ap- 
proaches ranks the similarities between the sequences 
of amino acids forming the proteins, one from each 
species, that might be considered as matching pairs 
for a graph isomorphism. This ranking algorithm is 
based on the iterative PageRank algorithm (invented 
by Google cofounder Larry Page) that ranks Web page 
x based on the number of pages that link to x and the 
PageRank value of those pages. 


“Comparative Analysis of Protein Networks: Hard Prob- 
lems, Practical Solutions,” Atias. N., and Sharan, R., 
Communications of the ACM, May 2012. 

http://www.ncbi.nlm.nih.gov/About/primer/genetics__ 
genome.html 

http://ghr.nlm.nih.gov/handbook/howgeneswork/protein 
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SECTION 6.1 REVIEW 


TECHNIQUES MAIN IDEAS 


e Use graph terminology. 
QJ Prove or disprove that two graphs are isomorphic. 
* Find a planar representation of a simple graph or 


prove that none exists. 
QJ Construct adjacency matrices and adjacency lists 
for graphs and directed graphs. 


EXERCISES 6.1 


1. Give the function g that is part of the formal definition of the directed graph shown. 


2. Use the graph in the figure to answer the questions that follow. 


a. 


њо 00 c 


Is the graph simple? 


. Is the graph complete? 

. Is the graph connected? 

. Can you find two paths from 3 to 6? 

. Can you find a cycle? 

. Can you find an arc whose removal will make the graph acyclic? 


g. Can you find an arc whose removal will make the graph not connected? 


3. Sketch a picture of each of the following graphs. 


a. 


b. 


Simple graph with three nodes, each of degree 2 
Graph with four nodes, with cycles of length 1, 2, 3, and 4 


c. Noncomplete graph with four nodes, each of degree 4 


4. Use the directed graph in the figure to answer the questions that follow. 


a. 
b. 
c. 


Which nodes are reachable from node 3? 
What is the length of the shortest path from node 3 to node 6? 
What is a path from node 1 to node 6 of length 8? 


5. Draw Kg. 
6. Draw К. 


* Diverse situations can be modeled by graphs. 
* Graphs can be represented in a computer by matri- 
ces or by linked lists. 


7. For each of the following characteristics, draw a graph or explain why such a graph does not exist. 


a. Four nodes of degree 1, 2, 3, and 4, respectively 


b. 


Simple, four nodes of degree 1, 2, 3, and 4, respectively 


с. Four nodes of degree 2, 3, 3, and 4, respectively 


d. Four nodes of degree 2, 3, 3, and 3, respectively 
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8. For each of the following characteristics, draw a graph or explain why such a graph does not exist. 
a. Simple graph with seven nodes, each of degree 3 
b. Four nodes, two of degree 2 and two of degree 3 
c. Three nodes of degree 0, 1, and 3, respectively 
d. Complete graph with 4 nodes each of degree 2 


9. An acquaintanceship graph is an undirected graph in which the nodes represent people and nodes a and b 
are adjacent if a and b are acquainted. 


a. The acquaintanceship graph for the IT department and the marketing department of a major corporation 
is an unconnected graph. What does this imply? 


b. The following figure represents an acquaintanceship graph for residents of an apartment building. Are 
Carl and Fletcher acquainted? How many people is SiuYin acquainted with? 


Car] ——— ————————————— Y vonne 


AO 


Moku 


BN Fletcher 
ae 


Yuri 


c. The length of the shortest path between node a and node b in an acquaintanceship graph is sometimes 
called the degree of separation between a and b. What is the degree of separation between Carl and Yuri? 


10. The “small world effect” states that the average degree of separation (see Exercise 9) in an acquaintance- 
ship graph of the whole world is 6. In other words, a path of acquaintance relationships from you to any 
other person on earth exists with, on the average, a path length of 6 (5 intermediate persons). Experiments 
in delivering hard-copy letters and e-mail messages have empirically confirmed this theory.! 


a. What are the potential implications for e-mail traffic if the small world effect holds for computer networks? 


b. What are the potential implications for epidemiology if the small world effect holds for physical con- 
tact between humans? 


11. The small world effect (see Exercise 10) has been found to be true between root words (that is, basic 
words found in a thesaurus) in the English language, with an average degree of separation equal to 3. Here 
“adjacent words” are those that are listed as synonyms in an English thesaurus. For example, “gate” and 
“commotion” are related by 3 degrees of separation, as follows: 


gate — door — flap — commotion 


Can you think of 3 degrees of separation between the following pairs of words? 
a. “star” and “sculpture” 

b. “burden” and “influence” 

c. “piano” and “significance” 


‘But more recent analyses of 721 million Facebook users, a much larger community than was available to earlier studies, suggests that the average 
number of intermediaries between persons A and B is 3.74. It’s a small world indeed, at least for Facebook users. 
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12. An idea closely related to the average degree of separation in a graph is that of clustering. The global 
clustering coefficient for a given graph is given by 


_3*Т 
t 


where Т = the number of triangles in the graph and = the number of connected node triples. 


A connected node triple is a “center” node adjacent to an unordered pair of other nodes. For example, in the 
graph of Exercise 2, 3—4—5 (or 5—4—3) and 4—5-6 (6-5-4) are two such triples. Nodes that make up a triangle 
demonstrate transitivity; ifa is adjacent to b and b is adjacent to c, then a is adjacent to c. Therefore c is a ratio of 
nodes in a transitive threesome to all nodes in a threesome. (One might think of this in terms of a social network 
as the probability that if you are a "friend" of mine and x is a “friend” of yours, then x is also a “friend” of mine.) 


a. Consider the graph in Figure 6.28 and the graph for Exercise 2. Which do you think has the higher 
clustering coefficient? 


b. Compute the clustering coefficient for the graph in Figure 6.28 
c. Compute the clustering coefficient for the graph for Exercise 2. 
13. Which of the following graphs is not isomorphic to the others, and why? 


(a) (b) (с) 


14. Which of the following graphs is not isomorphic to the others, and why? 


(a) (b) (c) (d) (e) 


For Exercises 15—20, decide if the two graphs are isomorphic. If so, give the function or functions that establish 
the isomorphism; if not, explain why. 


15; 1 


ад а е? 


(а) (b) 
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16. 2 
d» a6 
1 3 
ay a7 
5 ав. 4 а 
(а) (b) 
17. 1 
a 
5 2 , b 
4 3 d с 
(а) (b) 
18. 1 
а Ь [s 
6 2 
5 3 
f e d 
4 
(a) (b) 
19. 
1 а 
2 3 
е b 
4 
f 
5 6 d c 
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(a) (b) 


21. Prove that two graphs are not isomorphic if one of them 


22. 
23: 
24. 
25. 
26. 


27: 
28. 


29. 


а. 


has more nodes than the other. 


b. has more arcs than the other. 

c. has parallel arcs and the other does not. 
d. 
е 
f. 
5 


has а loop and the other does not. 


. has a node of degree k and the other does not. 


is connected and the other is not. 


. has a cycle and the other does not. 


Draw all the nonisomorphic, simple graphs with two nodes. 


Draw all the nonisomorphic, simple graphs with three nodes. 


Draw all the nonisomorphic, simple graphs with four nodes. 


Find an expression for the number of arcs in K, and prove that your expression is correct. 


Verify Euler's formula for the following simple, connected, planar graph. 


Prove that К, ; is a planar graph. 


Prove that the following graph is a planar graph. 


<> 


If a simple, connected, planar graph has six nodes, all of degree 3, into how many regions does it divide 
the plane? 
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30. If all the nodes of a simple, connected, planar graph have degree 4 and the number of arcs 15 12, into how 
many regions does it divide the plane? 


3 


[um 


. Does Euler's formula (Equation (1) of the theorem on the number of nodes and arcs) hold for nonsimple 
graphs? What about inequalities (2) and (3) of the theorem? 


32. What is wrong with the following argument that claims to use elementary subdivisions to turn a nonplanar 
graph into a planar graph? 

In a nonplanar graph there must be two arcs a; and a; that intersect at a point v that is not a node. Do 

an elementary subdivision on a; with an inserted node at v and an elementary subdivision on a; with an 


inserted node at v. In the resulting graph, the point of intersection is a node. Repeat this process with any 
non-node intersections; the result is a planar graph. 


For Exercises 33—36, determine whether the graph is planar (by finding a representation where arcs intersect 
only at nodes) or nonplanar (by finding a subgraph homeomorphic to К; or К; з. 


34. 36. 


For Exercises 37-42, write the adjacency matrix for the given graph. 


37. 5 39. 1 


38. 40. 1 4 5 


N 
чө 
a 
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41. 


= 


42. 1 
e 3 
28 4 
2 4 
Y 
3 


For Exercises 43—46, draw the graph represented by the adjacency matrix. 


43.0 2 0 45. 


0 1 1 1 0 

ду 9 10001 

0 2 0 100 0 1 

100 0 1 

D 1 1 1 0 

44.70 то о о о 46. [0 1 0 0 1 

101000 10 1 0 0 

0 1 1 1 0 0 0 1.0 1 0 

00 1000 00 1 0 1 

000002 1001 0 
000020 


47. The adjacency matrix for an undirected graph is given in lower triangular form by 


2 

1 0 

0 1 1 

0 1 2 0 


Draw the graph. 
48. The adjacency matrix for a directed graph is given by 


— © © © © 
© © © © - 
о-о 
© © о © 
© ю © © © 


Draw the graph. 
49. Describe the graph whose adjacency matrix is /,, the n X n identity matrix. 


50. Describe the graph whose adjacency matrix is 0,, the n X n matrix of all 0’s. 


no 


51. Describe the adjacency matrix for K,, the simple, complete graph with n nodes. 


no 
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52. Given the adjacency matrix A for a directed graph G, describe the graph represented by the adjacency 
matrix A’ (see Exercise 15 in Section 5.7). 


For Exercises 53—58, draw the adjacency list representation for the indicated graph. 


53. Exercise 37 
54. Exercise 38 
55. Exercise 39 
56. Exercise 40 
57. Exercise 41 
58. Exercise 42 


59. Refer to the accompanying graph. 91 
a. Draw the adjacency list representation. 


b. How many storage locations are required for the adjacency list? (A pointer takes i2 
one storage location.) 


c. Howmany storage locations would berequired in an adjacency matrix for this graph? 


60. Draw the adjacency list representation for the following weighted directed graph. у / d 
1 3 2 56 y 6 
1 1 [| 
3 2 4 


61. For the directed graph of Exercise 42, construct the array-pointer representation. 
62. For the weighted directed graph of Exercise 60, construct the array-pointer representation. 
63. Draw the undirected graph represented by the following adjacency list. 


1 >| 2 ——3 | e 

2 —— 1 —— 3 = 4 —— 4 | ө 
3 = 1 —— 2 —— 3 —— 4 | o 
4 -—— 2 + 2 ——— 3 | o 


1 ө 

2 —— 1 ° 

3 = 2 4—4 4 е 
4 —— 1 4—4 2 е 
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Exercises 65—72 refer to the complement of a graph. If G is a simple graph, the complement of С, denoted С', 
is the simple graph with the same set of nodes as G, where nodes х—у are adjacent in G’ if and only if they are 
not adjacent in G. 


65. 
66. 
67. 
68. 


69. 


70. 


7 


= 


35. 


76. 


Draw С’ for the graph of Figure 6.18a. 

Draw К. 

Show that if two simple graphs С; and G, are isomorphic, so are their complements С and С». 
Asimple graph is se/f-complementary if it is isomorphic to its complement. Prove that in a self-complementary 
graph with п nodes (n > 1), n = 4k orn = 4k + 1 for some integer К. (Hint: Use the result of Exercise 25.) 
Prove that in any simple graph G with at least two nodes, if G is not connected, then G' is connected. 
(Hint: If G is not connected, then G consists of a collection of “disjoint” connected subgraphs.) 

Find a simple graph G with at least two nodes where both G and G’ are connected, thus showing that the 
converse of Exercise 69 is false. 


. Given an adjacency matrix A for a simple graph С, describe the adjacency matrix for G”. 
72. 
73. 
74. 


Prove that if |N| = 11 in a simple, connected graph С, then not both G and G” can be planar. 
Prove that in any simple graph С with n nodes and a arcs, 2a = п? — n. 


Prove that a simple, connected graph with n nodes has at least n — 1 arcs. (Hint: Show that this can be 
restated as “А simple, connected graph with т arcs has at most m + 1 nodes." Then use the second prin- 
ciple of induction on m.) 


Prove that a simple graph with n nodes (n > 2) and more than C(n — 1, 2) arcs is connected. (Hint: Use 
Exercises 69 and 74.) 


Euler's formula is stated for simple, connected planar graphs, but in fact the word 
“simple” could be omitted. 


a. Anon-simple graph is a simple graph with loops or parallel arcs added. Prove that 
any simple connected planar graph remains a connected planar graph if parallel 
arcs or loops are added. (Hint: Temporarily erase the parallel arcs and loops.) 


b. Prove that the graph in the figure is a planar graph. 


c. Prove that Euler's formula holds for the graph of part (b) when drawn in its 
planar form. 

d. Prove that a connected planar graph with parallel arcs or loops obeys Euler's formula when drawn in 
its planar form. 


Exercises 77—86 refer to the problem of graph colorability. The origin of graph-coloring problems is a map- 
coloring problem: Suppose that a map of various countries, drawn on a sheet of paper, is to be colored so that 
no two countries with a common border have the same color. (We need not worry about countries that meet 
only at a point, and we will assume that each country is *connected.") What is the minimum number of colors 
required to carry out this task for any map? 


TT. 


78. 


Show that a coloring of the accompanying map requires three colors and no more than three colors. 


Draw a map that requires four colors. 
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79. Associated with any map is a graph, called the dual graph for the map, formed as follows: Put one node 
in each region of the map and an arc between two nodes representing adjacent countries. 


a. Draw the dual graph for the map of Exercise 77. 
b. Draw the dual graph for the following map. 


c. Draw a map for which the following graph would serve as the dual. 


80. A coloring of a graph is an assignment of a color to each node of the graph in such a way that no two adja- 
cent nodes have the same color. The chromatic number of a graph is the smallest number of colors needed 
to achieve a coloring. Find the chromatic number of the following graphs. 


a. b. 


81. At least four colors are required to solve the general map-coloring problem (see Exercise 78). Because no 
one could produce a map requiring more than four colors, the conjecture was formulated that four colors 
are indeed sufficient. This conjecture became known as the four-color problem. It was first proposed to 
the mathematician Augustus De Morgan by one of his students in 1852, and it subsequently received 
much attention. It remained unproved, however, for over a hundred years. In 1976 two mathematicians at 
the University of Illinois, Wolfgang Haken and Kenneth Appel, used a computer to work through a large 
number of cases in a proof by contradiction, thus verifying the four-color conjecture. 

The dual graph for a map (see Exercise 79), by the way it is constructed, will always be simple, con- 
nected, and planar. In addition, any simple, connected, planar graph can be viewed as the dual graph of 
a map. Restate the four-color conjecture in terms of the chromatic number (see Exercise 80) of a graph. 


82. Prove that in a simple, connected, planar graph with three or more nodes, there is at least one node with 
degree less than or equal to 5. (Hint: Use a proof by contradiction.) 


83. (Challenging problem) The five-color theorem states that the chromatic number for any simple, con- 
nected, planar graph is at most 5. While the four-color theorem (Exercise 81) is very difficult to prove, the 
five-color theorem can be proved by induction on the number of nodes in the graph. Prove the five-color 
theorem, making use of the result in Exercise 82. 
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84. The six-color theorem can be proved as a map-coloring problem without using the dual graph. Instead 


85. 


86. 


of creating the dual graph, put nodes at the intersections of boundaries and straighten the boundaries 
of regions so that the problem of coloring the map shown in figure (a) is represented by the problem of 
coloring the enclosed regions of the graph in figure (b). First assume that no country has a hole in 
it. Then the graph will be loop-free, planar, and connected. Also, every node will have degree at 
least 3. 


(a) (b) 


a. Show that the graph can be assumed to be simple by proving that if six colors are sufficient for coloring 
a simple graph, they are sufficient for a graph with parallel arcs as well. (Hint: Use temporary small 
countries at nodes.) 


b. Prove that in a simple, connected, planar graph with R enclosed regions, n — a + R = 1. 


c. Consider a simple, connected, planar graph and assume that every enclosed region has at least six edges 
adjacent to it. Show that 2a = Зи — 3. 


d. Now consider a simple, connected, planar graph where every node has degree at least 3. Show that such 
a graph has at least one enclosed region with no more than five adjacent edges. 


e. Prove that six colors are sufficient to color any planar map where no country has a hole in it. 

f. Prove that six colors are sufficient to color any planar map. (Hint: Cut some temporary slits in the 
map.) 

Five political lobbyists are visiting seven members of Congress (labeled A through G) on the same day. 

The members of Congress the five lobbyists must see are 


1. A,B, D 
2. B,C, F 
3. A, B, D, G 
4. E,G 

5. D,E, F 


Each member of Congress will be available to meet with lobbyists for one hour. What is the minimum 
number of time slots that must be used to set up the one-hour meetings so that no lobbyist has a conflict? 
(Hint: Treat this as a graph-coloring problem.) What if lobbyist 3 discovers that she does not need to see 
B and lobbyist 5 discovers that he does not need to see D? 


In a multiprocessor machine, six processors labeled A through F share blocks in a common data store. Two 
processors cannot simultaneously write to the same block. The following table shows which processors 
will write to the data store at the same time. How many distinct blocks are needed? (Hint: Treat this as a 
graph-coloring problem.) 

A, F,C 

B,D 

F,D,A 

B,E 

Е, С, Е 
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SECTION 6.2 | TREES AND THEIR REPRESENTATIONS 


Tree Terminology 


A special type of graph called a tree turns out to be a very useful representation 
of data. 


Ф DEFINITION TREE 
A tree is an acyclic, connected graph with one node designated as the root of the 
tree. 


Figure 6.31 pictures two trees. Perversely, computer scientists like to draw 
trees with the root at the top. An acyclic connected graph with no designated root 
node is called a nonrooted tree or a free tree. (Again, terminology is nonstan- 
dard. Some books define trees as acyclic, connected graphs and then call them 
"rooted trees" when there is a designated root node.) 


(a) (b) 
Figure 6.31 


A tree can also be defined recursively. A single node is a tree (with that node 
as its root). If Т, D, ... , T, are disjoint trees with roots 7), Р, ..., Fp the graph 
formed by attaching a new node r by a single arc to each of ri, 7%, ...,r, 18 a 
tree with root r. The nodes r4, r; ... , r, are children of r, and r is a parent of 
Fi, V, ... , F} Figure 6.32 shows the final step in the recursive construction of the 
tree in Figure 6.31b. It is often helpful to process a tree structure by working with 
it recursively, treating the subtrees as smaller tree objects. 


Figure 6.32 
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Because a tree is a connected graph, there is a path from the root to any 
other node in the tree; because the tree is acyclic, that path is unique. The depth 
of a node in a tree is the length of the path from the root to the node; the root 
itself has depth 0. The depth (height) of the tree is the maximum depth of any 
node in the tree; in other words, it is the length of the longest path from the root 
to any node. A node with no children is called a leaf of the tree; all non-leaves 
are internal nodes. A forest is an acyclic graph (not necessarily connected); 
thus a forest is a disjoint collection of trees. Figures 6.31a and 6.31b together 
form a forest. 

Binary trees, where each node has at most two children, are of particular 
interest. In a binary tree, each child of a node is designated as either the left child 
or the right child. A full binary tree occurs when all internal nodes have two 
children and all leaves are at the same depth. Figure 6.33 shows a binary tree of 
height 4, and Figure 6.34 shows a full binary tree of height 3. A complete binary 
tree is an almost full binary tree; the bottom level of the tree is filling from left to 
right but may not have its full complement of leaves. Figure 6.35 shows a complete 
binary tree of height 3. (Note that while a tree is a graph, a complete tree is not a 
complete graph!) 


Figure 6.33 Figure 6.34 Figure 6.35 


PRACTICE 18 | Answer the following questions about the binary tree shown in Figure 6.36. (Assume that 
node 1 is the root.) 


4 к) 
Figure 6.36 


a. What is the height? 
b. What is the left child of node 2? 
c. What is the depth of node 5? m 
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Applications of Trees 


Decision trees were used to solve counting problems in Chapter 4 and will be used 
in Section 6.3 to help establish lower bounds on the work for certain algorithms. 
Exercise 43 of Section 5.1 describes the organization of data into a binary tree 
structure. By using these trees, a collection of records can be efficiently searched 
to locate a particular record or to determine that a record is not in the collection. 
Examples of such a search would be checking for a volume in a library, for a 
patient's medical record in a hospital, or for an individual's credit record at the 
bank. We will also look at binary tree search in Section 6.3. The derivations of 
words in certain formal languages will be shown as trees in Chapter 9 (these are 
the parse trees generated by a compiler while analyzing a computer program). 

A family tree is usually, indeed, a tree, although if there were intermarriages, 
it would be a graph but not a tree in the technical sense. (Information obtained 
from a family tree is not only interesting but also useful for research in medical 
genetics.) The organization chart indicating who reports to whom in a large com- 
pany or other enterprise is usually a tree (Figure 6.37). 


Board of 
Regents 


University 
President 


C 


Academic Vice Administrative 
President Vice President 


=== л —À 


Dean of Dean of Arts Dean of Dean of 
Business and Sciences Engineering Agriculture 


m 


Computer 
Science Chair 


Computer Science 
Faculty Member 


History Chair 


Figure 6.37 


Folders and files on your computer are organized in a hierarchical (treelike) 
structure. In Figure 6.38, the CSCI 34000 tree (folder) has been expanded to show 
two subtrees; the Web Materials subtree has been expanded to show additional 
folders, the Activities subtree has not been expanded. 
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File Edit View Tools Help 
4 CSCI 34000 
i 4| _Web Maierials 
|. Assignments 
|. Course Info 
|. Course Management 
|. Slides 
| Activities 


Figure 6.38 


A computer virus is spread via e-mail. Each second, 4 new machines are infected. 
А 4-ary tree structure (Figure 6.39) represents the spread of the virus. By the 
multiplication principle, 4" machines have been infected after n seconds. 


Figure 6.39 


Algebraic expressions involving binary operations can be represented by la- 
beled binary trees. The leaves are labeled as operands, and the internal nodes 
are labeled as binary operations. For any internal node, the binary operation of 
its label is performed on the expressions associated with its left and right sub- 


trees. Thus the binary tree in Figure 6.40 represents the algebraic expression 
(2. 9-23) = (3o 3). 


+ * 
2 3$ ye 3 
Figure 6.40 e 


PRACTICE 19 | What is the expression tree for (2 + 3) * 5? 
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Binary Tree Representation 


Because a tree is also a graph, the representations discussed in Section 6.1 for graphs 
in general can also be used for trees. Binary trees, however, have special character- 
istics that we want to capture in the representation, namely, the identity of the left 
and right child. The equivalent of an adjacency matrix is a two-column array (or 
an array of records) where the data values for each node are the left and right child 
of that node. The equivalent of the adjacency list representation is a collection of 
records with three fields containing, respectively, the current node, a pointer to the 
record for the left-child node, and a pointer to the record for the right-child node. 


For the binary tree shown in Figure 6.41, the left child-right 1 
child array representation is given in Figure 6.42a. Zeros again 

indicate null pointers. The pointer representation is given in 

Figure 6.42b. 2 3 


4 5 6 


Figure 6.41 
Left child Right child 


1 | 2 3 
[1 
2 4 5 РА iy 
3 0 6 
2 5 o 
4 0 0 7 X by 
3 0 0 
4 ө е 5 е ө 6 ө е 
6 0 0 
(а) (b) 
Figure 6.42 ө 
PRACTICE 20 | For the binary tree in Figure 6.43 
1 
a. Give the left child-right child array representation. 
b. Give the pointer representation. 
2 
3 4 
5 


Figure 6.43 п 
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Tree Traversal Algorithms 


Ifa tree structure is being used to store data, it is often helpful to have a systematic 
mechanism for writing out the data values stored at all the nodes, which can be 
accomplished by traversing the tree, that is, visiting each of the nodes in the tree 
structure, The three common tree traversal algorithms are preorder, inorder, and 
postorder traversal. 

In the three traversal methods, it is helpful to use the recursive view of a 
tree, where the root of a tree has branches down to roots of subtrees. We will 
therefore assume that a tree T has a root ғ; any subtrees are labeled left to right as 
Ti, To, ... , T, (Figure 6.44). Because we are using a recursive definition of a tree, 
it will be easy to state the tree traversal algorithms in recursive form. 


Figure 6.44 


The terms preorder, inorder, and postorder refer to the order in which the root 
of a tree is visited compared to the subtree nodes. In preorder traversal, the root 
of the tree is visited first and then the subtrees are processed left to right, each in 
preorder. 


ALGORITHM 


PREORDER 


Preorder (tree T) 
//Writes the nodes of a tree with root r in preorder 


write(r) 
for i = 1 to do 
Preorder(T,) 
end for 
end Preorder 
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In inorder traversal, the left subtree is processed by an inorder traversal, 
then the root is visited, and then the remaining subtrees are processed from left to 
right, each in inorder. If the tree is a binary tree, the result is that the root is visited 
between processing of the two subtrees. In a nonbinary tree, if there is a single 
subtree joined to its parent by a vertical arc, that is considered the left subtree and 
there are no additional subtrees. 


ALGORITHM 


INORDER 


Inorder (tree T) 
//Writes the nodes of a tree with root r in inorder 


Inorder(T;) 
write(r) 
for i = 2 totdo 
Inorder(T,) 
end for 
end /norder 


Finally, in postorder traversal, the root is visited last, after all subtrees have 
been processed from left to right in postorder. 


ALGORITHM 


POSTORDER 


Postorder (tree T) 
//Writes the nodes of a tree with root r in postorder 


for į = 1 їо do 
Postorder(T;) 
end for 
write(r) 
end Postorder 


For the binary tree of Figure 6.45, the preorder traversal algorithm (root, left, 
right) says to write the root first, a, and then process the left subtree. At the left 
subtree, rooted at b, a preorder traversal writes the root, b, and moves again to the 
left subtree, which is the single node d. This single node is the root of a tree, so 
it is written out. Then d's left subtree (empty) and d's right subtree (empty) are 
traversed. Backing up to the tree rooted at b, its left subtree has been traversed, 
so now the right subtree 1s traversed, producing node e. The subtree rooted at 


REMINDER 


For a binary tree: 
Preorder traversal is root, 
left, right. 

Inorder traversal is left, 
root, right. 

Postorder traversal is left, 
right, root. 
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b has now been completely traversed. Backing up to a, it is time to traverse a’s 
right subtree. A preorder traversal of the tree rooted at c causes c to be written, 
then traversal goes to c’s left subtree, which results in f, Л, and i being written. 
Backing up to c, traversing c’s right subtree produces g. The subtree rooted at c 
has now been completely traversed, and the algorithm terminates. The preorder 
traversal produced 


О Gib 18, hh & 


Figure 6.45 ө 


Using the tree of Figure 6.45 again, an inorder traversal (left, root, right), travels 
down to the farthest left subtree, rooted at d. An inorder traversal here traverses the 
left subtree (empty), writes out the root, d, then traverses the right subtree (empty). 
Backing up the tree to Б, b’s left subtree has been traversed, so it is time to write 
out the root, b. Proceeding then to b’s right subtree, e is written. Backing up to a, 
a’s left subtree has been traversed, so the root, a, is written out. Proceeding to a’s 
right subtree, an inorder traversal says to go to the farthest left subtree first, which 
would cause the Л to be written. After that, fand i are written, then the root c, then 
the right subtree of c, which is g. The nodes are therefore written as 


GL I9 G0) 00, 9 
A postorder traversal (left, right, root) would produce 
o e by, б, bj & GE ө 


Consider the tree shown in Figure 6.46, which is not a binary tree. A preorder tra- 
versal first writes the root a and then does a preorder traversal on the left subtree, 
rooted at Б. The preorder traversal of this subtree writes out Б and then proceeds 
to a preorder traversal of the left subtree of b, which is rooted at d. Node d is 
written and then a preorder traversal of the left (the only) subtree of d, which 
is rooted at i, is done. After writing out i, the traversal backs up to consider any 
other subtrees of d; there are none. 

Backing up to 5, there are other subtrees of 5. Processing these left to right, 
nodes e and then f'are written. All the subtrees of b have now been traversed; back- 
ing up to node a to look for subtrees farther to the right reveals a subtree rooted at 
c. The algorithm writes out the root c, then moves to its leftmost subtree rooted at 
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a 


Figure 6.46 


g and writes out g. Processing the subtrees of g, the nodes j and k are written; then 
backing up to c, its remaining subtree is processed, producing Л. Node c has no 
other subtrees; backing up to a, a has no other subtrees, and the algorithm termi- 
nates. The list of nodes in preorder traversal is 


ab,diefcgjikh 


To do an inorder traversal of the tree in Figure 6.46, process left subtrees 
first, leading down to node i, which has no subtrees. Therefore i is written out. 
Backing up to d, the left (only) subtree of d has been traversed, so d is written out. 
Since node d has no further subtrees, the algorithm backs up to b. The left subtree 
of b has been processed, so b is written out and then its remaining subtrees are 
traversed, writing out e and f. Backing up to a, a is written out, and then the right 
subtree of a is processed, leading to nodes j, g, К, c, and Л, in that order, and we are 
done. Thus the inorder list of nodes is 


th Oh 1 f а th & 06 Qle 
The following list of nodes results from a postorder traversal: 


OL Gib bb pis & 18 @ @ ө 


PRACTICE 21 | Do a preorder, inorder, and postorder traversal of the tree in Figure 6.47. 
a 


Figure 6.47 L| 


EXAMPLE 28 
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Example 23 showed how algebraic expressions can be represented as binary trees. 
If we do an inorder traversal of the expression tree, we retrieve the original alge- 
braic expression. For the expression tree of Figure 6.48, for example, an inorder 
traversal gives the expression 


(2+x)*4 


where the parentheses are added as we complete the processing of a subtree. This 
form of an algebraic expression, where the operation symbol appears between the 
two operands, is called infix notation. Parentheses are necessary here to indicate the 
order of operations. Without parentheses, the expression becomes 2 + x * 4, which 
is also an infix expression but, because of the order of precedence of multiplication 
over addition, is not what is intended. 


2 x 
Figure 6.48 
A preorder traversal of Figure 6.48 gives the expression 
ж + 2х4 


Here the operation symbol precedes its operands. This form of an expression is 
called prefix notation, or Polish notation.” The expression can be translated into 
infix form as follows: 


*+2x4>*(2+x47(24+x)*4 
A postorder traversal gives the expression 
De ae Ghee 


where the operation symbol follows its operands. This form of an expression is 
called postfix notation, or reverse Polish notation (or just RPN). The expression 
can be translated into infix form as follows: 


2x 4s (2 + x4s— (2 + x)*4 


Neither prefix nor postfix form requires parentheses to avoid ambiguity. These 
notations therefore provide more efficient, if less familiar, representations of alge- 
braic expressions than infix notation. Such forms can be evaluated sequentially, 
with no need for “look-ahead” to locate parenthesized expressions. Compilers 
often change algebraic expressions in computer programs from infix to postfix 
notation for more efficient processing. ө 


?Named for the Polish logician J. Lukasiewicz, who first used it. 
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PRACTICE 22 | Write the expression tree for 


а (рес) 


and write the expression in prefix and postfix notation. и 


Results about Trees 


Trees are fertile ground (no pun intended) for proofs by induction, either on the 
number of nodes or arcs or on the height. 


After drawing a few trees and doing some counting, it appears that the number 
of arcs in a tree is always one less than the number of nodes. More formally, it 
appears that 


A tree with n nodes has n — | arcs. 


We will prove this statement using induction on n, n = 1. For the base case, n = 1, 
the tree consists of a single node and no arcs (Figure 6.49), so the number of arcs 
is | less than the number of nodes. 


ө iom ll res) 


Figure 6.49 Figure 6.50 


Assume that any tree with К nodes has k — 1 arcs, and consider a tree with 
К + 1 nodes. We want to show that this tree has К arcs. Let x be a leaf of the tree 
(a leaf must exist since the tree is finite). Then x has a unique parent. Remove from 
the tree the node x and the single arc a connecting x and its parent (Figure 6.50). 

The remaining graph is still a tree and has К nodes. Therefore, by the induc- 
tive hypothesis, it has k — 1 arcs, and the original graph, containing arc a, had 
(К — 1) + 1 = k arcs. The proof is complete. e 


Notice that in the inductive proof of Example 29 we had to support the proof 
with many more words than we used in some of our early inductive proofs. In 
Example 15 in Chapter 2, for instance, the inductive proof that 


Ld 24-2 pup eS eg 
consisted mainly of manipulating the mathematical expressions in this equation, 


but now we have to do more verbal reasoning. Words are not only acceptable in a 
proof, they may form the major part of the proof. 
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The inductive proof of Example 29 differs in another way from proofs like that 
of Example 15 in Chapter 2. In those proofs, there was always a single term in the 
series (the last term) whose removal would lead to the “Р(Ё) case,” the inductive hy- 
pothesis. In proofs involving trees with k + 1 nodes, which node should be removed 
to generate the Р(Х) case? Usually the node to remove is not unique, but it is not com- 
pletely arbitrary either. In the proof of Example 29, for instance, removing a non-leaf 
node (and the arcs attached to it) from a tree with A + 1 nodes would result in a graph 
with k nodes, but not a tree with k nodes, so the inductive hypothesis would not apply. 


PRACTICE 23 | Prove that in any tree with п nodes, the total number of arc ends is 2n — 2. Of course this 
result follows directly from Example 29 because n — 1 arcs means 2(n — 1) = 2n — 2 
arc ends, but do a proof by induction on the number of nodes. 


Recall that a tree T can also be constructed recursively by hooking a root node to a 
collection of subtrees T}, ... , T, (see Figure 6.44). This allows us to use structural 
induction, discussed in Chapter 3, to prove certain results about trees. We'll use 
structural induction to prove the result of Example 29, namely 


A tree with п nodes has n — 1 arcs. 
The base case where и = 1 is the same as before. Assume that any subtree Т; with 


n; nodes has n; — 1 arcs. The tree 7T constructed from subtrees Т}, ..., T, has n 
nodes where 


(The extra ¢ counts the arcs from the root of T to the ¢ subtrees.) 
By the inductive hypothesis, a; — n; — 1 for each subtree, so 


& 
Il 


t $ H t ІД 
і= 1 і= 1 i=l i-l і= 1 


t 


Snn | 


ї=1 ө 


PRACTICE 24 | Prove that in any tree with n nodes, the total number of arc ends is 2n — 2. Use structural 
induction. 
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Sometimes a clever observation can take the place of an inductive proof. Every arc 
of a tree connects one node to its parent. Each node of the tree except the root has 
a parent, and there are л — 1 such nodes, therefore n — 1 arcs. Each arc has two 
arc ends, so there are 2(n — 1) arc ends. e 


SECTION 6.2 REVIEW 


TECHNIQUES 


* Construct expression trees. 
* Construct array and pointer representations for 
binary trees. 
Ww) Conduct preorder, inorder, and postorder traversals 
of a tree. 


EXERCISES 6.2 


MAIN IDEAS 


Binary trees can be represented by arrays and by 
linked structures. 

Recursive procedures exist to systematically visit 
every node of a tree. 


1. Which of the following graphs are trees with root r? If a graph is a tree, draw it in a more conventional 


way. If not, say what property fails. 


а. e——e——e——er 


T 


2. Which of the following graphs are binary trees with root r? If the graph is not a binary tree, say what 


property fails. 


a. r b. с. а. r 
OY : 
F 


3. Sketch a picture of each of the following trees. 
a. Tree with five nodes and depth 1 
b. Full binary tree of depth 2 


c. Tree of depth 3 where each node at depth i has i + 1 children 
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4. Answer the following questions about the accompanying graph with node a as the root. 
a. Is it a binary tree? 
b. Is it a full binary tree? 
c. Is ita complete binary tree? 
d. What is the parent of e? 
e. What is the right child of e? 
f. What is the depth of g? 
g. What is the height of the tree? 


In Exercises 5—8, draw the expression tree. 
5. [(Q*x—3*y)+4*z]4+1 
6. [x — 2) *3] + (5 + 4) 
7.1- (2 – [3 – (4 – 5] 
8 
9 


. [66/2) +4] + [(1 + х) * (5 + 3)] 
. Write the left child—right child array representation for the binary tree in the figure. 
1 


7 


10. Write the left child—right child array representation for the binary tree in the figure. 


8 9 10 1112 13 14 15 
11. Draw the binary tree corresponding to the left child—right child representation that follows. (1 is the root.) 
Left child Right child 


1 2 3 


әз с t A W N 
о о О с л Ts 
с © (е ы е о 
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12. Draw the binary tree corresponding to the left child—right child representation that follows. (1 is the root.) 


Left child Right child 
1 2 0 


4 
0 
6 
0 
0 


nH л A U N 
с-|© | |© чо 


13. Write the left child—right child array representation for the binary search tree that is created by processing 
the following list of words: “АП Gaul is divided into three parts” (see Exercise 43 of Section 5.1). Also 
store the name of each node. 


14. Write the left child—right child array representation for the binary search tree that is created by processing 
the following list of words: “We hold these truths to be self-evident, that all men are created equal" (see 
Exercise 43 of Section 5.1). Also store the name of each node. 

15. ш the following binary tree representation, the left child and parent of each node are given. Draw the 
binary tree. (1 is the root.) 


Left child Parent 


1 2 0 
2 4 1 
3 0 1 
4 0 2 
5 0 2 
6 0 3 


16. The following represents a tree (not necessarily binary) where, for each node, the leftmost child and the 
closest right sibling of that node are given. Draw the tree. (1 is the root.) 


Left child Right sibling 


1 2 0 


о м QN tan A U N 
о |о о |о о |o tn 
© |© INIA со рь чо 
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17. a. For the following tree, write the leftmost child-right sibling array representation described in 
Exercise 16. 


b. Now draw the binary tree that results from treating the answer to part (a) as a left child—right 
child binary tree representation. An arbitrary tree can thus be thought of as having a binary tree 
representation. 


18. The following binary tree is the representation of a general tree (as in part (b) of Exercise 17). Draw the 
tree. 


For Exercises 19—24, write the list of nodes resulting from a preorder traversal, an inorder traversal, and a 
postorder traversal of the tree. 


19. a 20. a 


25. 
26. 
27. 
28. 
29. 
30. 
31. 
32. 
33. 
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Write in prefix and postfix notation: 3/4 + (2 — y). 


Write in prefix and postfix notation: (x * y + 3/z) * 4. 
Write in infix and postfix notation: — * + 23*6x 7. 
Write in infix and postfix notation: — + — хуту. 
Write in prefix and infix notation: 47 x — *2 +. 
Write in prefix and infix notation: x 2 + yz * — /. 
Evaluate the postfix expression 82/23 * +. 
Evaluate the postfix expression 5 3 + 13 + /7 s. 


Draw a single tree whose preorder traversal is 
a,b,c,d, e 
and whose inorder traversal is 
b,a,d,c,e 


. Draw a single tree whose inorder traversal is 

Ља, е, Б, h, d, i, с, j, e 
and whose postorder traversal is 

ђе, а, h, i, d,j, e, с, b 


. Find an example of a tree whose inorder and postorder traversals yield the same list of nodes. 

. Find two different trees that have the same list of nodes under a preorder traversal. 

. Informally describe a recursive algorithm to compute the height of a binary tree, given the root node. 

. Informally describe a recursive algorithm to compute the number of nodes in a binary tree, given the root node. 
. Prove that a simple graph is a nonrooted tree if and only if there is a unique path between any two nodes. 


40. 


4 


— 


42. 


43. 


44 


46. 


4T. 


48. 
49. 
50. 


51, 
52; 


53. 


54. 
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What is the minimum number of nodes and arcs that need to be deleted to reduce a full binary tree of 
height = 2 to a forest of 4 binary trees? 


. Let G be a simple graph. Prove that G is a nonrooted tree if and only if G is connected and if the removal 


of any single arc from G makes G unconnected. 


Let G be a simple graph. Prove that G is a nonrooted tree if and only if G is connected and the addition of 
one arc to G results in a graph with exactly one cycle. 


Prove that a binary tree has at most 2“ nodes at depth d. 


. Prove that a tree with n nodes, n = 2, has at least two nodes of degree 1. 
45. 


a. Draw a full binary tree of height 2. How many nodes does it have? 

b. Draw a full binary tree of height 3. How many nodes does it have? 

с. Conjecture how many nodes there are in a full binary tree of height Л. 

Prove your conjecture from Exercise 45(c) three different ways. 

a. Use induction on the height / of the full binary tree. (Hint: Use Exercise 43.) 

b. Add up the nodes at each level of the tree (Hint: Use Exercise 43). 

c. Use structural induction. 

a. Prove that a full binary tree with x internal nodes has 2x + 1 total nodes. 

b. Prove that a full binary tree with x internal nodes has x + 1 leaves. 

c. Prove that a full binary tree with n nodes has (n — 1)/2 internal nodes and (n + 1)/2 leaves. 

Prove that the number of leaves in any binary tree is 1 more than the number of nodes with two children. 
Find an expression for the height of a complete binary tree with n nodes. (Hint: Use Exercise 45.) 

Prove that in the pointer representation of a binary tree with n nodes there are и + 1 null pointers. 
(Hint: Use Exercise 48). 

Find the chromatic number of a tree (see Section 6.1, Exercise 80). 

Let E be the external path length of a tree, that is, the sum of the path lengths to all the leaves. Let I be the 
internal path length, that is, the sum of the path lengths to all the internal nodes. Let i be the number of 
internal nodes. Prove that in a binary tree where all internal nodes have two children, E = J + 21. 

Let B(n) represent the number of different binary trees with и nodes. 


a. Define В(0) to have the value | (there is one binary tree with 0 nodes). Prove that B(n) is given by the 
recurrence relation 


B()-1 
n-l1 


B(n) = У В(ЮВ(п — 1 — k) 
k=0 


b. Compare the sequence B(n) to the sequence of Catalan numbers (Exercise 97, Section 4.4). Write the 
closed-form expression for B(n). 


c. Compute the number of different binary trees with 3 nodes. Draw all these trees. 
d. Compute the number of different binary trees with 6 nodes. 


In the data structure known as a B-tree of order 5, each node of the tree can contain multiple data values, 
maintained in sorted order. Between and around the data values at an internal node are arcs that lead to 
children of the node. New data values are inserted into the leaf nodes of the tree, but when a leaf (or in- 
ternal node) gets up to five values, it splits in two and the median value pops up to the next level of the 
tree. The figure shows the tree at various points as the data values 1 through 8 are inserted into an initially 
empty tree. 
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1.2,3,4 1, 


a. The minimum number of data values to insert into a B-tree of order 5 to force it to have two levels is 5. 
Find the minimum number of data values required to force the tree to have three levels. 
b. Prove that when a B-tree of order 5 has the minimum number of data values to force it to have n levels, 
п = 2, the bottom level contains 2(3" *) nodes. 
c. Find (and justify) a general expression for the minimum number of data values required to force a 
3° = 3 
B-tree of order 5 to have л levels. (Hine 39 4-3 4-5-4357 7% = (=>) 
In Exercises 55 and 56, two trees are isomorphic if there is a bijection f: N, — Nz, where f maps the root of 
one tree to the root of the other and where f(y) is a child of f(x) in the second tree when y is a child of x in the 


first tree. Thus the two trees shown are isomorphic graphs but not isomorphic trees (in part (a) the root has two 
children and in part (b) it does not). These are the only two nonisomorphic trees with three nodes. 


r 


(a) (b) 


55. Draw all the nonisomorphic trees with four nodes. 
56. Draw all the nonisomorphic trees with five nodes. 


57. One of the most efficient sorting algorithms is HeapSort, which sorts an array of values into increasing 
order. To understand how the HeapSort algorithm works, it is best to imagine that the array elements are 
stored in level order as the nodes of a binary tree. Thus the values in a 7-element array that is indexed from 
0 through 6 would be stored in a binary tree with element 0 at the root, elements 1 and 2 at depth 1, and 
so on. 


A heap is a binary tree in which the value at every node is greater than the value at its two child nodes. 
HeapSort is a two-phase process. The first phase is to reorganize the tree elements into a heap (more on 
this later), and the second is to sort the heap. The key idea is that in a heap, the largest element is the root of 
the tree; its proper place in the sorted array is at the end of the unsorted section of the array (at the lowest, 
rightmost tree element not yet in its sorted position). The tree root gets thrown to the last unsorted posi- 
tion, and the element that formerly occupied that position must be inserted back into the unsorted section 
in such a way as to preserve the heap property. Consider the following binary tree, which is a heap—each 
node value is larger than the values at the two child nodes. 
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С а if m | 


The root value, s, belongs at the bottom right of the tree (the position of the last array element), and once 
positioned there, s will never be looked at again. The previous value, m, has been temporarily displaced. 


m 


C a | if 5 


To insert m back into the heap, there is a series of “competitions” between the displaced element and 
the child nodes of the empty node so that the largest value is promoted to fill the empty slot. Here m, k, 
and p compete, and p is the winner. 


Now m and f compete (remember that s is no longer considered), the largest value is m, and once m is 
inserted, the heap has been reestablished. 


@ а if s | 


a. Complete phase 2 of the process so that the array is sorted. (Hint: The next step is to throw the root 
value, p, into the position currently occupied by f. The displaced value f must then be reinserted into 
the tree.) 

b. For phase 1, to construct the heap, put the original random elements into the binary tree. Consider the 
leaf nodes to be temporarily in place, since their relative order does not matter. Beginning with the 
rightmost node in the level above the leaves and working up level-by-level to the root, temporarily 
displace that node element and then insert it using the “competition” rules described. Beginning with 
the array r, w, f, g, k, y, d, construct a heap. 
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SECTION 6.3 | DECISION TREES 


We used decision trees in Chapter 4 to solve counting problems. Figure 6.51 shows 
the tree used in Example 39 of Chapter 4 to represent the various possibilities for 
five coin tosses under the constraint that two heads in a row do not occur. Each 
internal node of the tree represents an action (a coin toss), and the arcs to the chil- 
dren of internal nodes represent the outcomes of that action (heads or tails). The 
leaves of the tree represent the final outcomes, that is, the different ways that five 
tosses could occur. 


Toss 1 


H 


Toss 2 


Toss 3 


Toss 4 


Toss 5 


шушу 
dadmam 
Б» ЕЕ е» 
maaan 
HAs 
Jdmama 
THATA 
dddmad 
mamadád 
ddmadd 
Бе ЕЕЕ 
к” ЕЕЕ 
HAHA 


Figure 6.51 


Although we have used decision trees, we haven’t given a formal definition of 
what a decision tree 15. 


* DEFINITION DECISION TREE 
A decision tree is a tree in which the internal nodes represent actions, the arcs 
represent outcomes of an action, and the leaves represent final outcomes. 


Sometimes useful information can be obtained by using a decision tree to rep- 
resent the activities of a real algorithm; actions that the algorithm performs take 
place at internal nodes, the children of an internal node represent the next action 
taken, based on the outcome of the previous action, and the leaves represent some 
sort of circumstance that can be inferred upon algorithm termination. Note that, 
unlike the trees we talked about in Section 6.2, a decision tree is not a data struc- 
ture; that is, the nodes of the tree have no data values associated with them. Nor 
are the algorithms we are representing necessarily acting on a tree structure. In 
fact, we will use decision trees in this section to learn more about algorithms for 
searching and sorting, and these algorithms act on lists of data items. 


Searching 


A search algorithm either finds a target element x within a list of elements or de- 
termines that x is not in the list. Such an algorithm generally works by making 
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successive comparisons of x to the list items. We have already seen two such algo- 
rithms, sequential search and binary search. We can model the activities of these 
algorithms by using decision trees. The nodes represent the actions of comparing x 
to the list items, where the comparison of x to the ith element in the list is denoted 
by x:L|i]. 

Sequential search only distinguishes between two possible outcomes of a com- 
parison of x to L[i]. If x = L[i], the algorithm terminates because x has been found 
in the list. If x # Lli], the next comparison performed is x:L[i + 1], regardless of 
whether x was less than or greater than Li]. The leaves of this decision tree corre- 
spond to the final outcomes, where either x is one of the list items or x is not in the list. 


| EXAMPLE? | Figure 6.52 shows the decision tree for the sequential search algorithm acting оп a 


list of five elements. 


x:L[1] 


X= DS] 
Figure 6.52 ө 


From the decision tree for a given search algorithm, we сап see that the number 
of comparisons required to reach any particular outcome (leaf of the tree) is the 
number of internal nodes from the root to that leaf. This number equals the length of 
the path from the root to that leaf. The worst case, that is, the maximum number of 
comparisons, is the maximum length of any such path, which is the depth of the tree. 

Because every decision tree for sequential search looks like Figure 6.52, it is 
clear that the depth of such a tree, for an n-element list, is n. This agrees with what 
we already know, namely, that the worst case for sequential search on a list of n 
elements is n. 

The decision tree for the binary search algorithm is more interesting. Binary 
search acts on a sorted list and distinguishes between three possible outcomes of 
the comparison: 
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x = L[i]: algorithm terminates, x has been found 
x < L[i]: algorithm proceeds to the left half of the list 
x > L[i]: algorithm proceeds to the right half of the list 


We will follow the usual custom and not write the leaf that corresponds to the 
“middle branch,” x = L[i] (see Exercise 21 for a discussion of the consequences of 
this convention). If x < L[i], the next comparison the algorithm performs is found 
at the left child of this node; if x > L[i], the algorithm’s next comparison is found 
at the right child. If no child exists, the algorithm terminates because x is not in 
the list. The tree we've described is a binary tree whose leaves represent all the 
possible outcomes where x is not in the list. There are many more failure leaves 
in binary search than in sequential search, because binary search indicates how x 
fails to be in the list (e.g., x < L[1] or L[1] < x < L[2]). 


| EXAMPLE3S | Figure 6.53 shows the decision tree for the binary search algorithm acting on a 


sorted list of eight elements. 

The worst case, that is, the maximum number of comparisons, will again be 
the depth of the tree, which is 4 in Figure 6.53. In Chapter 3 we solved a recurrence 
relation to get the worst-case behavior for binary search where л is a power of 2 
and found this to be 1 + log n (remember that we are using base 2 logarithms). 
Note that 1 + log 8 = 4, so the decision tree agrees with our previous result. The 
restriction of n to a power of 2 made the arithmetic of solving the recurrence re- 
lation simpler. If n is not a power of 2, then the depth of the tree is given by the 
expression 1 + |log n]. 


x:L[4] 


x«L|1] L[2] «x < L[3] L[4] « x « L[5] L[6] « x « L[7] 


TETTE | ese 021] 


L[3] « x « L[4] L[5] « x « L[6] 


« 


L[7]«x«L[8] x»L[S] 
Figure 6.53 ө 
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PRACTICE 25 


a. Draw the decision tree for the binary search algorithm on a sorted list of five elements. 
b. Find the depth of the tree and compare to 1 + |log 5]. a 


Lower Bounds on Searching 

We have used decision trees to represent the actions of two particular search al- 
gorithms. Such a tree could be used to represent the actions of any algorithm that 
solves the search problem by comparing the target element to the list elements. 
The internal nodes of such a tree would represent the comparisons done, and the 
depth of the tree would be the worst-case number of comparisons over all possible 
cases. What can be said about such a tree when we don’t know the particulars of 
the algorithm involved? We can say that x must be compared to every element 
in the list at least once (perhaps more than once if the algorithm is quite stupid). 
For if there is some list element that escapes being compared to x, the algorithm 
cannot say whether that element equals x and thus cannot decide with certainty 
whether x belongs to the list. Comparisons are internal nodes in the decision tree. 
Therefore, if m is the number of internal nodes in the decision tree Т, for any 
search algorithm acting on an n-element list, then m 2 и. 

Before proceeding further with decision trees, we need some additional facts 
about binary trees in general. The number of nodes at each level in a full binary tree 
follows a geometric progression: 1 node at level 0, 2! nodes at level 1, 2? nodes at level 
2, and so on. In a full binary tree of depth d, the total number of nodes is therefore 


1+42+2 +2 ++ ОРЕ 


(see Example 15 of Chapter 2). A full binary tree has the maximum number of 
nodes for a given depth of any binary tree. This gives us fact 1: 


1. Any binary tree of depth d has at most 2^*! — 1 nodes. 
Fact 2, which we’ll prove momentarily, is 
2. Any binary tree with m nodes has depth = |log m]. 


To prove fact 2, we'll use a proof by contradiction. Suppose a binary tree has m 
nodes and depth d < |log m|. Then d = |log m| — 1. From fact 1, 


m < 24+1 ==] = 2(йов т|—1) +1 __ 1 
25 21108 т] = 1 = 2log m —]2m-1 
or 
т= т – 1 


—a contradiction. Therefore d = |log m]. 

Now back to decision trees representing search algorithms on n-element 
lists. Temporarily strip the leaves from tree Т; (with m internal nodes) to create a 
new tree T, with m nodes, т = n. By fact 2, T, has depth d > [Пор m] = |log n]. 
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Therefore tree T; has depth > [log n| + 1. Because the depth of the decision 
tree gives the worst-case number of comparisons, we can state the following 


theorem. 


? THEOREM ON THE LOWER BOUND FOR SEARCHING 
Any algorithm that solves the search problem for an n-element list by comparing 
the target element х to the list items must do at least |logn| + 1 comparisons in 


the worst case. 


This theorem gives us a lower bound on the number of comparisons required 
in the worst case for any algorithm that uses comparisons to solve the search prob- 
lem. Since binary search does no more work than this required minimum amount, 
binary search is an optimal algorithm in its worst-case behavior. 


Binary Tree Search 


The binary search algorithm requires that data already be sorted. Arbitrary data 
can be organized into a structure called a binary search tree, which can then be 
searched using a different algorithm called binary tree search. To build a binary 
search tree, the first item of data is made the root of the tree. Successive items are 
inserted by comparing them to existing nodes, beginning with the root. If the item 
is less than a node, the next node tested is the left child; otherwise it is the right 
child. When no child node exists, the new item becomes the child. 


The data items 


S, ® 2 12, Щй, 149 


are to be organized into a binary search tree. Figure 6.54 shows the successive 
stages of constructing the tree. 


Figure 6.54 


10 


12 


5 
5} 
PM p 8 
2 8 
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5 
2 8 
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10 14 
9 
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A binary search tree, by the way it is constructed, has the property that 
the value at each node is greater than all values in its left subtree (the subtree 
rooted at its left child) and less than all values in its right subtree. A binary 
tree search compares item x with a succession of nodes beginning with the 
root. If x equals the node item, the algorithm terminates; if x is less than the 
item, the left child is checked next; if x is greater than the item, the right 
child is checked next. If no child exists, the algorithm terminates because x 
is not in the list. Thus the binary search tree, except for the leaves, becomes 
the decision tree for the binary tree search algorithm. (Here is a case where 
the algorithm itself is described in terms of a tree.) The worst-case number of 
comparisons equals the depth of the tree plus 1 (for the missing leaves). How- 
ever, a binary search tree for a given set of data is not unique; the tree (and 
hence the depth of the tree) depends on the order in which the data items are 
inserted into the tree. 


The data in Example 34 entered in the order 


©), 12, 10), Si, S 22, Jd 
produce the binary search tree of Figure 6.55. 


9 


2 8 10 14 
Figure 6.55 ө 


The actions performed in a binary tree search certainly resemble those in 
the “regular” binary search algorithm; in both cases the procedure is to make a 
comparison and, if unsuccessful, to look left or right (in the tree if it is a binary 
tree search or in the list if it is a binary search). It is possible to order the data for a 
binary tree search such that the search tree built from these data matches the deci- 
sion tree (minus the leaves) for a binary search of the same data in sorted order. 
This match is illustrated in Example 35 (note that the tree was not built from data 
items in sorted order). Here the binary search tree has the minimum depth and 
requires the least amount of work in the worst case. 

The depth of a binary search tree for a given set of data items can vary. The 
depth of the tree in Figure 6.54 1s 4, while that of Figure 6.55 is 2. Thus the worst- 
case number of comparisons to search for an item can also vary. The tree-building 
process can be modified to keep the tree more “balanced,” that is, short and wide 
rather than tall and skinny; such a modification reduces the depth of the tree and 
therefore the search time. Of course, we know from the theorem on the lower 
bound for searching that a certain minimum amount of work is required no matter 
how clever we are in building the tree. 
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PRACTICE 26 


a. Construct the binary search tree for the data of Example 34 entered in the order 
12; 91А шә 10.8727 


b. What is the depth of the tree? m 


Sorting 


Decision trees can also model algorithms that sort a list of items by a sequence of com- 
parisons between two items from the list. The internal nodes of such a decision tree 
are labeled L[i]:L[ j] to indicate a comparison of list item i to list item j. To simplify 
our discussion, let’s assume that the list does not contain duplicate items. Then the 
outcome of such a comparison is either L[i] < L| j] or Lli] > LL j] . If L[i] < LUZ], 
the algorithm proceeds to the comparison indicated at the left child of this node; if 
L[i] > 113], the algorithm proceeds to the right child. If no child exists, the algo- 
rithm terminates because the sorted order has been determined. The tree is a binary 
tree, and the leaves represent the final outcomes, that is, the various sorted orders. 


Figure 6.56 shows the decision tree for a sorting algorithm acting on a list of 
three elements. This algorithm is not particularly astute because it ignores the 
transitive property of < and therefore performs some unnecessary compari- 
sons. The leaves of the tree indicate the various final outcomes, including two 
cases (marked with an X) that result from contradictory information. For ex- 
ample, one X results from the following inconsistent sequence of outcomes: 
ae ALs ABLA] S> 


LL] 


АЕО ор EDI S EDU S EDI о Д др 


X L[3] < L[1] « L[2] X TET << EET << TET 
Figure 6.56 С) 


PRACTICE 27 | Draw the decision tree that would result if the algorithm of Example 36 were modified to 
eliminate unnecessary comparisons. 


Ф THEOREM 
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A decision tree argument can also be used to establish a lower bound on the 
worst-case number of comparisons required to sort a list of n elements. As we did 
for the search problem, let us see what we can say about a decision tree for sort- 
ing based on comparisons, regardless of the algorithm it represents. The leaves of 
such a tree represent the final outcomes, that is, the various ordered arrangements 
of the n items. There are n! such arrangements, so if p is the number of leaves in 
the decision tree, then p = n!. The worst case will equal the depth of the tree. But 
it is also true that if the tree has depth d, then p = 2° (Exercise 43 of Section 6.2). 
Taking the base 2 logarithm of both sides of this inequality, we get log p = d or, 
because d is an integer, d — [log p]. Finally, we obtain 


d = [logp] = [logz!| 


This result proves the following theorem. 


ON THE LOWER BOUND FOR SORTING 
Any algorithm that sorts an n-element list by comparing pairs of items from the 
list must do at least [log л! | comparisons in the worst case. 


It can be shown (Exercise 23) that logn! = Ө(п Іов п). Therefore we have 
proved that sorting п elements by comparing pairs of list items is bounded below 
by O(nlog п), whereas searching by comparing the target element to the list items 
is bounded below by O(logn). As expected, it takes more work to sort than to 
search. 


SECTION 6.3 REVIEW 


TECHNIQUES 


Ù Draw decision trees for sequential search and bi- 
nary search on n-element lists. 
QJ Create a binary search tree. 


bounds on the minimum amount of work needed to 
solve the problem in the worst case. 

* The task of searching an n-element list for a target 
value x, if done by comparing x to elements in the 
list, requires at least |logn| + 1 comparisons in the 
worst case. 


MAIN IDEAS | 0. 
2. | * The task of sorting an n-element list, if done by 
К Decision trees represent the sequences of possible comparing pairs of list elements, requires at least 
actions for certain algorithms. [log л! | comparisons in the worst case. 


* Analysis of a generic decision tree for algorithms 
that solve a certain problem may lead to lower 


EXERCISES 6.3 


1. Draw the decision tree for sequential search on a list of three elements. 


2. Draw the decision tree for sequential search on a list of six elements. 


3. Draw the decision tree for binary search on a sorted list of seven elements. What is the depth of the tree? 


4. Draw the decision tree for binary search on a sorted list of four elements. What is the depth of the tree? 


10. 


11. 


12. 


13. 


14. 
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. Consider a search algorithm that compares an item with the last element in a list, then the first element, 


then the next-to-last element, then the second element, and so on. Draw the decision tree for searching a 
six-element sorted list. What is the depth of the tree? Does it appear that this is an optimal algorithm in the 
worst case? 


. Consider a search algorithm that compares an item with an element one-third of the way through the list; 


based on that comparison, it then searches either the first one-third or the second two-thirds of the list. 
Draw the decision tree for searching a nine-element sorted list. What is the depth of the tree? Does it ap- 
pear that this is an optimal algorithm in the worst case? 


. a. Given the data 


9,5, 6, 2,4, 7 


construct the binary search tree. What is the depth of the tree? 


b. Find the average number of comparisons done to search for an item that is known to be in the list 
using binary tree search on the tree of part (a). (Hint: Find the number of comparisons for each of the 
items.) 


. a. Given the data 


g,d,r,s,b,q,c,m 


construct the binary search tree. What is the depth of the tree? 

b. Find the average number of comparisons done to search for an item that is known to be in the list 
using binary tree search on the tree of part (a). (Hint: Find the number of comparisons for each of the 
items.) 


. а. Fora set of six data items, what is the minimum worst-case number of comparisons a search algorithm 


must perform? 

b. Given the set of data items 1a, d, g, i, k, s}, find an order in which to enter the data so that the corre- 
sponding binary search tree has the minimum depth. 

a. Fora set of nine data items, what is the minimum worst-case number of comparisons a search algorithm 
must perform? 

b. Given the set of data items {4, 7, 8, 10, 12, 15, 18, 19, 21}, find an order in which to enter the data so 
that the corresponding binary search tree has the minimum depth. 


An inorder tree traversal of a binary search tree produces a listing of the tree nodes in alphabetical or 
numerical order. Construct a binary search tree for “To be or not to be, that is the question,” and then do 
an inorder traversal. 


Construct a binary search tree for “In the high and far-off times the Elephant, O Best Beloved, had no 
trunk,” and then do an inorder traversal. (See Exercise 11.) 

Use the theorem on the lower bound for sorting to find lower bounds on the number of comparisons 
required in the worst case to sort lists of the following sizes: 

a. 4 b. 8 c. 16 


Contrast the number of comparisons required for selection sort and merge sort in the worst case with the 
lower bounds found in Exercise 13 (see Exercise 23 in Section 3.3). What are your conclusions? 


Exercises 15—20 concern the problem of identifying a counterfeit coin (one that is two heavy or too light) from 
a set of n coins. A balance scale is used to weigh a group of any number of coins from the set against a like 
number of coins from the set. The outcome of such a comparison is that group A weighs less than, the same as, 
or more than group B. A decision tree representing the sequence of comparisons done will thus be a ternary tree, 
where an internal node can have three children. 


15. 


16. 


L7. 


18. 


19; 
20. 


21, 
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One of five coins is counterfeit and is lighter than the other four. The problem is to identify the counterfeit coin. 
a. What is the number of final outcomes (the number of leaves in the decision tree)? 

b. Find a lower bound on the number of comparisons required to solve this problem in the worst case. 

c. Devise an algorithm that meets this lower bound (draw its decision tree). 


One of five coins is counterfeit and is either too heavy or too light. The problem is to identify the counter- 
feit coin and determine whether it is heavy or light. 


a. What is the number of final outcomes (the number of leaves in the decision tree)? 
b. Find a lower bound on the number of comparisons required to solve this problem in the worst case. 
c. Devise an algorithm that meets this lower bound (draw its decision tree). 


One of four coins is counterfeit and is either too heavy or too light. The problem is to identify the counter- 
feit coin but not to determine whether it is heavy or light. 


a. What is the number of final outcomes (the number of leaves in the decision tree)? 
b. Find a lower bound on the number of comparisons required to solve this problem in the worst case. 
c. Devise an algorithm that meets this lower bound (draw its decision tree). 


One of four coins is counterfeit and is either too heavy or too light. The problem is to identify the counter- 
feit coin and determine whether it is heavy or light. 


a. What is the number of final outcomes (the number of leaves in the decision tree)? 
b. Find a lower bound on the number of comparisons required to solve this problem in the worst case. 


c. Prove that no algorithm exists that can meet this lower bound. (Hint: The first comparison can be made 
with either two coins or four coins. Consider each case.) 


Devise an algorithm to solve the problem of Exercise 18 using three comparisons in the worst case. 


One of eight coins is counterfeit and is either too heavy or too light. The problem is to identify the counter- 
feit coin and determine whether it is heavy or light. 


a. What is the number of final outcomes (the number of leaves in the decision tree)? 
b. Find a lower bound on the number of comparisons required to solve this problem in the worst case. 
c. Devise an algorithm that meets this lower bound (draw its decision tree). 


In the decision tree for the binary search algorithm (and the binary tree search algorithm), we have counted 
each internal node as one comparison. For example, the top of Figure 6.53 looks like this: 


x:L[4] 


To get to either of the child nodes of the root, we have assumed that one comparison has been done. 
However, the outcome of the comparison at each internal node is really a three-way branch: 


x — node element 
x « node element 


x > node element 


Think about how this three-way branch would be implemented in most programming languages, and write 
a more accurate expression than 1 + |logn| for the number of comparisons in the worst case. 


| Section 6.4 Huffman Codes 539 


22. Our existing binary search algorithm (Chapter 3, Example 13) contains the pseudocode instruction 
find the index k midway between i and j 


after which the target x is compared to the list item at index k, the “midpoint item." Suppose that this in- 
struction is replaced by 


if i = j then 
k=] 
else 
k=it+l 
end if 


a. Draw the decision tree that results from using the modified algorithm on a sorted list with n = 8. 
b. Give the exact number of comparisons required (see Exercise 21) in the worst case for n = 8. 


c. Give a worst-case order-of-magnitude expression for the number of comparisons required as a func- 
tion of n, and justify your expression. Comment on the use of this algorithm as opposed to the original 
binary search algorithm, which is O(log л). 


23.To prove that logn! = O(mlogn), we can use the definition of order of magnitude (see 
Section 5 of Chapter 5) and show that there exist positive constants ,c,, and c) such that for 
п = по, cy(n log n) = log n! = c,(n log n). 
a. Show that for n = 1,log n! = n log n. (Hint: Use the definition of n! and properties of logarithms.) 


b. Show that for n = 4, log n! = (1/4)(n logn). (Hint: Use the definition of n! and properties of 
logarithms, but stop at log[n/2].) 


SECTION 6.4 | HUFFMAN CODES 


Problem and Trial Solution 


Character data consist of letters of the alphabet (both uppercase and lowercase), 
punctuation symbols, and other keyboard symbols such as @ and %. Computers 
store character data in binary form, as a sequence of 0s and 15, usually by fixing 
some length n so that 2” is at least as large as the number of distinct characters 
and then encoding each distinct character as a particular sequence of n bits. Each 
character must be encoded into its fixed binary sequence for electronic storage, and 
then the binary sequence must be decoded when the character is to be displayed. 
The most common encoding scheme for many years was ASCII (American Stan- 
dard Code for Information Interchange), which uses п = 8, so that each character 
requires 8 bits to store. However, 28 = 256, so a maximum of 256 characters could 
be encoded. This was enough for the English alphabet, punctuation, and special 
characters, but as electronic data storage spread around the world, it was not enough 
to include characters found in other languages such as Russian, Japanese, Arabic, 
Greek, and many others. Unicode (in general) uses 16 bits to encode a single char- 
acter, so that 2'° = 65536 character encodings are now available. But whatever 
value is chosen for n, each character requires the same amount of storage space. 
Suppose a collection of character data to be stored in a file in binary form is 
large enough that the amount of storage required is a consideration. Suppose also 
that the file is archival in nature, and its contents will not often be changed. Then 
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it may be worthwhile to invest some extra effort in the encoding process if the 
amount of storage space required for the file could be reduced. 

Rather than using a fixed number of bits per character, an encoding scheme could 
use a variable number of bits and store frequently occurring characters as sequences 
with fewer bits. To store all the distinct characters, some sequences will still have to be 
long, but if the longer sequences are used for characters that occur less frequently, the 
overall storage required should be reduced. This approach requires knowledge of the 
particular file contents, which is why it is best suited for a file whose contents will not 
be frequently changed. We will study such a data compression or data compaction 
scheme here, because it is best described as a series of actions taken on binary trees. 


As a trivial example, suppose that a collection of data contains 50,000 instances 
of the six characters a, с, g, К, p, and ?, which occur with the following percent 


frequencies: 
Character а @ g k р D 
Frequency 48 9 12 4 17 10 


Because six distinct characters must be stored, the fixed-length scheme would 
require at a minimum three bits for each character (2? = 8 > 6). The total storage 
required would then be 50,000 * 3 = 150,000 bits. Suppose instead that the fol- 
lowing encoding scheme is used: 


Character а @ g k р y 
Encoding scheme 0 1101 101 1100 Ш 100 


Then the storage requirement (number of bits) is 
50,000 (0.48 * 1 + 0.09 * 4-- 0.12 *3+0.04*4+0.17 * 34- 0.10 * 3) = 108,500 


which is roughly two-thirds of the previous requirement. ө 


In the fixed-length storage scheme with n bits for each character, the long 
string of bits within the encoded file can be broken up into the code for successive 
characters by simply looking at n bits at a time. This makes it easy to decode the 
file. In the variable-length code, there must be a way to tell when the sequence for 
one character ends and the sequence for another character begins. 


PRACTICE 28 | Using the variable-length code of Example 37, decode each of the following strings: 


a. 11111111010100 
b. 1101010101100 
c. 100110001101100 m 


In Practice 28 the strings can be broken into the representation of characters 
in only one way. As each new digit is considered, the possibilities are narrowed 
as to which character is being represented until the character is uniquely identi- 
fied by the end of that character’s representation. There is never any need to guess 
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at what the character might be and then backtrack if our guess proves wrong. 
This ability to decode uniquely without false starts and backtracking comes about 
because the code is an example of a prefix code. In a prefix code, the code for any 
character is never the prefix of the code for any other character. (A prefix code is 
therefore an “antiprefix” code!) 


| EXAMPLE 38 | КЕИ Consider the code 


Character a b @ 
Encoding scheme 01 101 011 


which is not a prefix code. Given the string 01101, it could represent either ab 
(01—101) or ca (011-01). Furthermore, in processing the string 011011 digit by 
digit as a computer would do, the decoding could begin with ab (01—101) and only 
encounter a mismatch at the last digit. Then the process would have to go all the 
way back to the first digit in order to recognize cc (011—011). ө 


As an aside, Morse code is a variable-length code. Morse’s encoding scheme 
for telegraphic communication, invented in 1838, uses strings of dots and dashes 
to represent letters of the alphabet. The most frequently occurring letter of the 


alphabet in English text is the letter “е,” which is assigned the shortest code, a 
single dot. The Morse code for 


“hello world” 


is 


Here you can see that “e” is a single dot, that “7” is dot-dash-dot-dot, and that “r” 
is dot-dash-dot. Morse code is not a prefix code, however; note that the “7” code is 
the first part of the “/” code. To avoid this kind of ambiguity, Morse code inserts 
a pause between the code for each letter. To decode, you wait for the pause and at 
that point you have the code for exactly one letter. 

In our approach to prefix codes, we will build binary trees with the characters 
as leaves. Once the tree is built, a binary code can be assigned to each character 
by simply tracing the path from the root to that leaf, using 0 for a left branch and 1 
for a right branch. Because no leaf precedes any other leaf on some path from the 
root, the code will be a prefix code. The binary tree for the code of Example 37 is 
shown in Figure 6.57. 


Figure 6.57 
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Suppose a code tree T exists, with leaves representing characters. For any 
leaf i, its depth d(i) in T equals the number of bits in the code for the corre- 
sponding character. Let fi) denote the percentage frequency of that character 
in the data to be stored, and let S be the total number of characters to be stored. 
Then, just as in Example 37, the total number of bits required is given by the 
expression 


Sx | > (oo| 


all leaves і 


We seek to build an optimal tree T, one for which the expression 


Е(Т) =| 2, О (1) 


all leaves і 


is a minimum and hence the file size is a minimum. 

This process could be done by trial and error, because there is only a finite 
number of characters and thus only a finite number of ways to construct a tree and 
assign characters to its leaves. However, the finite number quickly becomes very 
large! Instead we will use the algorithm known as Huffman encoding. 


Huffman Encoding Algorithm 


Suppose, then, that we have m characters in a file and we know the percentage 
frequency of each character. The algorithm to build the tree works by maintain- 
ing a list L of nodes that are roots of binary trees. Initially L will contain m roots, 
each labeled with the frequency of one of the characters; the roots will be ordered 
according to increasing frequency, and each will have no children. A pseudocode 
description of the algorithm follows. 


ALGORITHM 


HUFFMANTREE 


HuffmanTree (node list L; integer m) 
//Each of the m nodes in L has an associated frequency f, and L is 
//ordered by increasing frequency; algorithm builds the Huffman tree 


for (i = 1 to m — 1) do 
create new node z 
let x, y be the first two nodes in L //minimum frequency nodes 
Ха) = Дх) + fly) 
insert z in order into L 
left child of = node x 
right child of z = node y //x and y are no longer in L 
end for 
end HuffmanTree 
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When this algorithm terminates, L consists of just one node, which is the root 
of the final binary tree. Codes can then be assigned to each leaf of the tree by trac- 
ing the path from the root to the leaf and accumulating 05 for left branches and 
15 for right branches. By the way the tree is constructed, every internal node will 
have exactly two children. 


We'll use algorithm HuffmanTree to build the tree of Figure 6.57, which is based 
on the data of Example 37. L initially contains the six nodes, ordered by frequency: 


4 9 10 12 17 48 
e ө 


Following the algorithm, we enter ће for loop for the first time. The x and y nodes 
are those with frequencies 4 and 9, respectively. A new node z with frequency 
4 + 9 = 13 iscreated and inserted in order into L, with the x node as its left child 
and the y node as its right child The new Z looks like the following: 


10 12 13 17 48 
° 9 A е е 
4 9 


This process is repeated four more times. The resulting L at each stage follows: 


13 17 22 48 
4 9 10 12 
22 30 48 

aX | 
13 im 

10 12) 
4 9 

48 52 


4(k) 9(с) 


At this point the tree is complete and the codes can be assigned. The code for с, for 
example, is 1101 (right branch, right branch, left branch, right branch). ө 
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PRACTICE 29 | Construct the Huffman tree for the following characters and frequencies: 


Character 
Frequency 


PRACTICE 30 


w q h e 
10 12 20 58 m 


Find the Huffman codes for the characters of Practice 29. 


Table 6.1 shows the steps in Huffman encoding/decoding for data compression. 


TABLE 6.1 


Encoding Step 1 


On the original CLEARTEXT file, perform a frequency analysis; that is, create a file 
FREQUENCY that contains data of the form 

a—18 

b—7 
and so forth. 


Encoding Step 2 


Encoding Step 3 


Using FREQUENCY, create a file CODETABLE that contains the Huffman code for each 
character, i.e., 

a—001 

b—1110 
and so forth. 
Using CLEARTEXT and CODETABLE, create a file called CODED that contains the 
compressed data. 


Decoding 


Using CODED and CODETABLE, decode the data to recover CLEARTEXT. 


The CODED file is the data-compressed version of CLEARTEXT, and presum- 
ably it requires less storage space. However, the CODETABLE file must also be 
stored in order to be able to decode the file. 


Justification 


Although the algorithm to construct the Huffman tree 715 easy enough to describe, 
we must justify that it gives us the minimum possible value for E(7). 

First, if we have an optimal tree Т for m characters, the nodes with the low- 
est frequencies can always be assumed to be the left and right children of some 
node. To prove this assumption, label the two nodes with the lowest frequencies 
x and y. If x and y are not siblings in the tree, then find two siblings p and q at 
the lowest level of the tree, and consider the case where x and y are not at that 
level (Figure 6.58a). Because f(x) is one of the two smallest values, we know that 
fx) = fp). I£f(x) < fp), then interchanging x and p in the tree would result in a 
new tree 7’ with E(T’) < E(T ) (Figure 6.58b: the larger frequency is now at a less- 
er depth—see Exercise 20a), but this would contradict the fact that T was optimal. 
Therefore f(x) = f(p), and x and p can be interchanged in the tree with no effect 
on E(7). Similarly, y and q can be interchanged, resulting in Figure 6.58c, in which 
x and y are siblings. If x or y are at the same level as p and q to begin with, they can 
certainly be interchanged with p or q without affecting A(T) (Figure 6.58d). 
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x P р 
y y d y 
р 4 x 4 x y р 4 x 


(a) (b) (c) (d) 
Figure 6.58 
T T' B' B 
frequency f(x) + f(y) frequency f(x) + f(y) 
x y x y. 
(а) (b) (c) (d) 
Figure 6.59 


Now again let f(x) and f(y) be the minimum frequencies, and suppose we 
have a tree T that is optimal for the other frequencies together with the sum 
f(x) + f(y) (Figure 6.593). This sum will be the frequency of a leaf node; cre- 
ate a tree 7" that has this node as an interior node with children x and y hav- 
ing frequencies f(x) and f(y) (Figure 6.59b). T’ will be optimal for frequencies 
fŒ), f(y), and the rest. The proof of this fact begins with some optimal tree B’ for 
frequencies f(x), f(y), and the rest. We know such an optimal tree exists (since 
it could be found by trial and error), and from the preceding paragraph, we can 
assume that x and y are siblings in B’ (Figure 6.59c). Now create a tree В by strip- 
ping nodes x and y from B’ and giving frequency f(x) + f(y) to their parent node, 
now a leaf (Figure 6.59d). Because T' is optimal for the other frequencies together 
with f(x) + f(y), we have 


E(T) = E(B) (1) 
But the difference between E(B) and E(B’) is one arc each for x and y; that 
is, E(B’) = E(B) + f(x) + f(y) (see Exercise 205). Similarly, we have 
E(T') = ЕТ) + f(x) + f(y). Thus, if we add f(x) + f(y) to both sides of (1), we get 


E(T') € E(B’) (2) 


EXAMPLE 40 
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Because B’ was optimal, it cannot be the case that E(T’) < E(B’), so E(T') = E(B’), 
and T” is optimal. 

Finally, a tree with a single node whose frequency is the sum of all the fre- 
quencies is trivially optimal for that sum. We can repeatedly split up this sum 
and drop down children in such a way that we end up with the Huffman tree. 
By the preceding paragraph, each such tree, including the final Huffman tree, is 
optimal. 


If we applied the process that preserves optimality to the tree of Figure 6.57, we 
would begin with a single node with frequency 100 and “grow” that tree down- 
ward, as shown in Figure 6.60. 


100 100 100 
е 


48 ГАХ 52 48 52 


22 30 


100 100 


48 52 48 52) 
22 30 22 30 


16, 17 
10 То 113} 17 


Figure 6.60 ө 


Application of Huffman Codes 


The cost of data storage has certainly declined in recent years, and the relatively low 
cost makes it seem as if practically unlimited storage should be available to all. But 
"relatively inexpensive" doesn’t mean “ее,” and during the same period in which 
data storage costs have decreased, the legal requirements on many businesses and 
other organizations to keep electronic records have increased. Much of this infor- 
mation is archival in nature and suitable for data compression. As just one example, 
the Sarbanes—Oxley Act became United States federal law in 2002. This law was 
enacted as a reaction to a series of high-profile scandals about accounting irregulari- 
ties and irresponsible corporate governance that shook financial markets and public 
trust. Congress authorized the Securities and Exchange Commission (SEC) to set up 
rules regarding retention of documents concerned with financial audits or reviews. 
The final SEC rule requires that all publicly traded companies must retain electronic 
records of all financial transactions for seven years. These records are to include not 
just formal documents but any e-mail, memos, working papers, and so forth, sent or 
received, that contain opinions, conclusions, analyses, or financial data. 
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As a more specific application of Huffman codes, let’s look at JPEG, a 
standardized image compression mechanism for photographic-quality images. 
JPEG stands for Joint Photographic Experts Group, the name of the group that 
developed this international standard. The need for improved image compres- 
sion was largely fueled by the desire to transmit images over the Internet. There 
are actually two versions of JPEG encoding, lossy and lossless, but the lossy 
version is by far the more common. A lossy compression scheme means that 
once the compressed data has been unencoded, it does not precisely match 
the original—some information has been “lost.” In the case of lossy JPEG 
compression, data loss comes from the preprocessing of the image before Huff- 
man encoding is applied; the Huffman encoding/decoding faithfully restores 
the data it starts with. 

JPEG compression is intended for images to be viewed by humans, and it 
takes advantage of the fact that the human eye is much more sensitive to gra- 
dients of light and dark than it is to small changes in color. The first step in the 
JPEG process is therefore to take the color image information, which is usually 
given as 24 bits per pixel, 8 bits for each of the red, green, and blue components, 
and transform each pixel into components that capture the luminance (lightness/ 
darkness) with reduced information about the color components. Next, pixels 
with similar color information are grouped together and an “average” color 
value is used, while more accurate luminance data are maintained. The data 
are then transformed into frequency data (that is, the data are represented as a 
combination of cosine waves of varying frequencies), which in turn go through 
a “quantization” process (basically rounding the results of a computation) to 
end up in integer form. Higher-frequency variations, to which the human eye is 
less sensitive, are lost in this process, but again the luminance data are treated 
at a finer grain than the color data. Huffman encoding is applied to the result. 
Areas of the image whose representations occur often will encode to smaller bit 
strings. 

A JPEG image file contains not only the compressed data but also the infor- 
mation needed to reverse the compression process (including the information to 
reverse the Huffman encoding). The resulting image has lost the high-frequency 
changes and color variations that were eliminated in the stages before the Huffman 
coding was applied. Parameters in the JPEG encoding process allow tradeoffs to 
be made between the amount of compression to be obtained and the faithfulness 
of the restored image to the original. Because of the nature of the algorithms used, 
JPEG encoding has little or no effect on black-and-white line drawings where 
there are no data to throw away. 
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TECHNIQUE 


REVIEW 


MAIN IDEA 


W) Find Huffman codes, given a set of characters and ° Given the frequency of characters in a collection of 


their frequencies. 


data, a binary encoding scheme can be found that 
minimizes the number of bits required to store the 
data but still allows for easy decoding. 
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EXERCISES 6.4 
1. Is the following code a prefix code? Why or why not? 


b 
100 


d 
011 


Character 
Encoding scheme 


т 


01 


w 
101 


2. Using the code of Exercise 1, decode the string 01101. 
. Given the codes 


i 


10 


Character 
Encoding scheme 


о 
110 


и 


01 111 


decode the sequences 
a. 11011011101 
b. 1000110111 
c. 010101 
. Given the codes 


b 
1000 


h 
1001 


% 
101 


Character 
Encoding scheme 


q 
0 
decode the sequences 

a. 10001001101101 

b. 11110 

c. 01001111000 


. Given the codes 


Character 


w ( 
Encoding scheme 110 11 


decode the sequences 

a. 111110101101110001 

b. 1010001110 

c. 1111111100111101110 
. Given the nonprefix codes 


1 
1 


3 
111 


5 
101 


Character 
Encoding scheme 


give all possible decodings of the sequence 111110101. 


) 
1110 


. Write the Huffman codes for a, b, c, and d in the binary tree shown. 


| Section 6.4 Huffman Codes 549 


8. Write the Huffman codes for r, s, t, u in the binary tree shown. 


9. a. Construct the Huffman tree for the following characters and frequencies. 


Character с а g m ІА 2 
Frequency 28 25 6 20 3 18 


b. Find the Huffman codes for these characters. 
10. a. Construct the Huffman tree for the following characters and frequencies. 


Character b n р S w 
Frequency 6 32 21 14 27 


b. Find the Huffman codes for these characters. 
11. а. Construct the Huffman tree for the following characters and frequencies. 


Character a 2 t e € 
Frequency 27 12 15 31 15 


b. Find the Huffman codes for these characters. 
12. a. Construct the Huffman tree for the following characters and frequencies. 


Character ? x w e t 5 а 
Frequency 14 3 11 27 18 22 5 


b. Find the Huffman codes for these characters. 


c. A file consisting of 100,000 instances of these seven characters is stored using a fixed-length binary 
encoding scheme. How many bits are required for each code and what is the total number of bits 
needed? 


d. Storing the same file using the Huffman code of part (b), how many bits are needed? 


In Exercises 13—14, the integers could represent the results of the “quantization” step in a JPEG image com- 
pression, and the number of occurrences of each in the image. (Note that these are instances of occurrence 
rather than percent frequencies; this simply means that the Huffman tree will not end up with the root value 
equal to 100.) 


13. Construct the Huffman tree and find the Huffman codes for the following integers and occurrences. 


Integer 82 664 327 349 423 389 
Occurrences 416 97 212 509 446 74 


14. Construct the Huffman tree and find the Huffman codes for the following integers and occurrences. 


Integer 190 205 514 333 127 901 277 
Occurrences 52 723 129 233 451 820 85 
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15. JPEG can achieve various compression levels; the higher the compression, the lower the quality of the 
reconstructed image. 


a. A compression ratio of 10:1 results in virtually imperceptible loss of image quality. A file of 850,000 
bytes that is compressed with a 10:1 compression ratio results in a compressed file of what size? 


b. Ata compression ratio of 25:1, some degradation is visible in the reconstructed image. A file of 850,000 
bytes that is compressed with a 25:1 compression ratio results in a compressed file of what size? 


16. Explain why JPEG encoding results in less compression for gray-scale images than for full-color images. 


17. Someone does a global substitution on the text file of Exercise 11, replacing all instances of “z” with “sh.” 
Find the new Huffman codes. 


18. Consider the following paragraph. 


However, in my thoughts I could not sufficiently wonder at the intrepidity of these diminutive mortals 
who durst venture to mount and walk upon my body, while one of my hands was at liberty, without 
trembling at the very sight of so prodigious a creature as I must appear to them.* 


If this paragraph were to be compressed using a Huffman code, what single character, aside from punctua- 
tion or uppercase characters, would be apt to have one of the longest codes? Which would have one of the 
shortest? 


19. Recall the problem posed at the beginning of this chapter. 


You work in the Information Systems Department at World Wide Widgets (WWW), the leading widget 
manufacturer. Part numbers consist of a leading character B, С, G, R, or S to identify the part type, fol- 
lowed by an 8-digit number. Thus 


C00347289 
B11872432 
845003781 


are all legitimate part numbers. WWW maintains a data file of the part numbers it uses, which, as it 
turns out, is most of the potential numbers. 


How can you compress this data file so it takes less storage space than the approximately 4.5 Gb required 
using the ASCII encoding scheme of eight bits per character? 


a. Running a frequency count on the WWW data file reveals the following information: 


Character B G G R S 0 1 2 3 4 5 6 7 8 9 
Frequency 2 5 1 2 1 18 13 7 12 9 6 11 7 2 4 


Construct a Huffman code for these characters. 
b. Compute the space requirements of the compressed file as a percent of the uncompressed file. 


20. In the justification that the Huffman algorithm produces an optimal tree, the following two assertions were 
made. Prove that each is true. 


a. Е(Т') < E(T) 
b. E(B’) = E(B) + f(x) + f(y) 


?From Gulliver s Travels by Jonathan Swift, London, 1726. 
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TERMINOLOGY 


acyclic graph (p. 482) 

adjacency list (p. 494) 

adjacency matrix (p. 492) 

adjacent nodes (p. 481) 

arc (edge) (p. 477, 478) 

binary search tree (p. 534) 

binary tree (p. 510) 

binary tree search (p. 534) 

bipartite complete graph (p. 483) 

children nodes (p. 509) 

complete binary tree (p. 510) 

complete graph (p. 482) 

connected graph (p. 482) 

cycle (p. 482) 

data compression (data 
compaction) (p. 540) 

data flow diagram (p. 480) 

decision tree (p. 529) 

degree of a node (p. 481) 

depth of a node (p. 510) 

depth (height) of a tree (p. 510) 

directed graph (digraph) (p. 478) 

endpoints (p. 478) 

Euler’s formula (p. 488) 

five-color theorum (p. 507) 


SELF-TEST 


forest (p. 510) 

full binary tree (p. 510) 
graph (p. 477, 478) 

graph colorability (p. 506) 
height of a tree (p. 510) 
homeomorphic graphs (p. 491) 
Huffman encoding (p. 542) 
infix notation (p. 518) 

initial point (p. 478) 

inorder traversal (p. 515) 
internal node (p. 510) 
isolated node (p. 481) 
isomorphic graphs (p. 485) 
isomorphism (p. 486) 
labeled graph (p. 479) 

leaf (p. 510) 

left child (p. 510) 

length of a path (p. 482) 
linked list (p. 494) 

loop (p. 481) 

loop-free graph (p. 481) 
map-coloring problem (p. 506) 
node (vertex) (p. 477, 478) 
nonrooted (free) tree (p. 509) 
null pointer (p. 495) 


Answer the following true—false questions. 
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1. A connected graph has an arc between any two 


nodes. 


optimal algorithm (p. 533) 
parallel arcs (p. 481) 
parent node (p. 509) 
path (pp. 482, 483) 
planar graph (p. 487) 
Polish notation (p. 518) 
postfix notation (p. 518) 
postorder traversal (p. 515) 
prefix code (p. 541) 
prefix notation (p. 518) 
preorder traversal (p. 514) 
reachable node (p. 483) 
reverse Polish notation 
(RPN) (p. 518) 
right child (p. 510) 
root of a tree (p. 509) 
simple graph (p. 481) 
sparse matrix (p. 494) 
subgraph (p. 482) 
terminal point (p. 478) 
tree (p. 509) 
tree traversal (p. 514) 
weighted graph (p. 479) 


5. The adjacency matrix of a directed graph is not 


2. If graph О, is isomorphic to graph G,, then a node 


of degree 5 in G, will be mapped to a node of de- 


gree 5 in С, 


3. No matter how a planar graph is drawn, its arcs will 


intersect only at nodes. 


4. If part of the adjacency list representation of a 


graph contains 
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symmetric. 


1. The depth of any node in a tree is less than or equal 


to the height of the tree. 


2. Because a tree is a graph, a complete tree is also a 


complete graph. 


3. In the left child-right child array representation of 
a binary tree, any row of the array that corresponds 


5 д 7 3 to a leaf will have all zero entries. | 
4. Postorder traversal of an expression tree results in 
an algebraic expression in reverse Polish notation. 
then node 2 is adjacent to node 3 and node 3 is 5. In preorder tree traversal, the root is always the first 


adjacent to node 4. 


node visited. 
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Section 6.3 


1 


. The root of a decision tree for the binary search 


algorithm acting on a sorted list of 11 items would 
represent the comparison of the target element with 
the sixth list item. 


. Searching for any target element x in a list of л ele- 


ments requires at least 1 + |log n| comparisons. 


. A binary tree search is done with a target element 


of 14 on a binary search tree whose root has the 
value 10; the right subtree will be searched next. 


‚ А binary search tree is unique for any given set of 


data. 


. A decision tree for sorting n elements must have a 


depth of at least л! 


ON THE COMPUTER 


For Exercises 1—4, write a computer program that pro- 
duces the desired output from the given input. 


1 


. Input: Adjacency list for a graph 


Output: Adjacency matrix for the graph 


. Input: Adjacency matrix for a graph 


Output: Adjacency list for the graph 


. Input: Adjacency list for a graph and the name ofa 


node л in the graph 
Output: Adjacency list for the graph with node n 
and its associated arcs removed 


. Input: List of n characters and their (integer) fre- 


quencies 

Output: Huffman code for the characters 

(Hint: Maintain a sorted linked list of records that rep- 
resent the roots of binary trees. Initially there will be n 
such records, each with no children; at the end, there 
will be one such record, the root ofthe Huffman tree.) 


Section 6.4 


1 


2. 


3. 


The ASCII encoding scheme requires 8 bits to store 
each character. 

In a prefix code, each code word is the prefix of 
another code word. 

In a Huffman code, characters that occur most fre- 
quently have the most Os in their binary string rep- 
resentation. 


. The maximum number of bits for any encoded 


character using a Huffman code will be the depth 
of the Huffman tree. 


. To be able to decode an encoded file, a frequency 


count from the original file must be stored along 
with the encoded file. 


‚ Write a program that allows the user to enter a list 


of integers and constructs a binary search tree with 
those integers as nodes. The user can then enter one 
integer at a time, and the program will do a binary 
tree search and indicate whether the given integer 
is in the list. 


. Write a program that allows the user to enter a list 


of integers and then constructs a binary search tree 
with those integers as nodes. The user can then en- 
ter the type of traversal desired (inorder, preorder, 
or postorder), and the program will write out the 
nodes in the appropriate order. 


. Write a program that carries out the first three steps 


in Table 6.1. That is, beginning with a text file, the 
program should produce a frequency count file, then 
a code table file, then an encoded version ofthe origi- 
nal file. Write a second program that uses the encod- 
ed file and the code table, and recreates the original 
file. 
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Chapter 
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Question: 


After studying this chapter, you will be able to: 


* Convert between adjacency matrix, adjacency relation, and directed graph 
representations. 

e Use the reachability matrix of a directed graph to determine whether one node 
is reachable from another. 

e Compute the reachability matrix of a directed graph either directly or by using 
Warshall’s algorithm. 

* Test a graph for the existence of an Euler path (solve the highway inspector 
problem). 

* Understand the Hamiltonian circuit problem (and the traveling salesman prob- 

lem) and how they are fundamentally different from the Euler path problem. 

* Use Dijkstra’s algorithm to find the shortest path between two nodes in a simple, 

weighted, connected graph. 

* Use Prim's algorithm to find the minimal spanning tree in a simple, weighted, 

connected graph. 

e Carry out depth-first search and breadth-first search in a simple, connected graph. 

* Understand how depth-first search can be used to test for reachability in a di- 
rected graph, perform a topological sort on a partially ordered set represented 
by a directed graph, and find the connected components of an unconnected 
graph. 

* Identify articulation points in a simple connected graph. 


You are the network administrator for a wide-area backbone network that serves your 
company's many offices across the country. Messages travel through the network by 
being routed from point to point until they reach their destination. Each node in the 
network therefore acts as a switching station to forward messages to other nodes ac- 
cording to a routing table maintained at each node. Some connections in the network 
carry heavy traffic, while others are less used. Traffic may vary with the time of day; 
in addition, new nodes occasionally come on line and existing nodes may go off line. 
Therefore you must periodically provide each node with updated information so that 
it can forward messages along the most efficient (that is, the least heavily traveled) 
route. 


How can you compute the routing table for each node? 
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If the network described is viewed as a graph, your task as network administrator is to 
find the "shortest" path from one node to another in the graph. Because graphs have so 
many applications, there is a great deal of interest in finding efficient algorithms to an- 
swer certain questions about graphs, directed graphs, or trees, and to perform certain 
tasks on them, such as finding shortest paths. All graph algorithms use one of the con- 
venient representations (adjacency matrix or adjacency list) presented in Chapter 6. 

This chapter covers many of the “classical” graph algorithms. Section 7.1 first 
relates directed graphs to binary relations, and reachability in a graph to the transi- 
tive closure of a binary relation. Then two different algorithms pertaining to reach- 
ability are given. 

In Section 7.2 we will look at algorithms that answer two historically interest- 
ing questions about graphs. These questions are known as the highway inspector 
problem and the traveling salesman problem. The highway inspector problem asks 
whether there is a path through a given graph that uses each arc exactly once, 
thereby providing an efficient way for a highway inspector to check all roads with- 
out going over the same road twice. The traveling salesman problem asks whether 
there is a cycle in a given graph that visits each node of the graph and, if so, which 
such cycle requires the minimum distance to travel. Recall that a cycle is a path 
that ends where it started and does not use any other node more than once; thus 
such a cycle would provide an efficient way for a salesperson to visit all cities in 
the sales territory only once and end up at home. 

Section 7.3 provides algorithmic solutions to the two problems of finding the 
minimum path between two nodes in a simple, connected graph and of minimizing 
the number of arcs used to connect all nodes in a simple, connected graph. Section 7.4 
discusses algorithms for traversing simple graphs—‘‘visiting” all the nodes in some 
systematic way. Section 7.5 uses one of these traversal algorithms to detect articulation 
points in a simple connected graph, points whose removal would disconnect the graph. 


SECTION 7.1 


DIRECTED GRAPHS AND BINARY RELATIONS; 
WARSHALL’S ALGORITHM 


In this section we confine our attention to (unweighted) directed graphs with no 
parallel arcs. (In a directed graph, two arcs from node a to node b would be par- 
allel, but one arc from a to b and another from b to a are not parallel arcs.) Con- 
sider the adjacency matrix of the graph (assuming some arbitrary ordering of the 
n nodes, which we always assume when discussing the adjacency matrix of a 
graph). This will be an п X n matrix, not necessarily symmetric. Furthermore, 
because there are no parallel arcs in the graph, the adjacency matrix will be a 
Boolean matrix, that is, a matrix whose only elements are 05 and 15. Conversely, 
given an 7 X n Boolean matrix, we can reconstruct the directed graph that the 
matrix represents, and it will have no parallel arcs. Thus there is a one-to-one 
correspondence, which we can picture as 


Directed graphs 
with п nodes, ---— 
no parallel arcs 


n x n Boolean 
matrices 


(1) 


Now we will see how binary relations tie in to this correspondence. 
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Directed Graphs and Binary Relations 

Suppose С is a directed graph with n nodes and no parallel arcs. Let N be the set 
of nodes. If (n;,n;) is an ordered pair of nodes, then there either is or is not an arc 
in G from л, to n;. We can use this property to define a binary relation on the set №: 


n; p n;< there is an arc in С from n; to n; 


This relation is the adjacency relation of the graph. 


For the directed graph of Figure 7.1, the adjacency relation is {(1, 2), (1,3), (3,3), 


(4, 1), (4, 2), (4,3)}. 


1 @2 
3 
4 
Figure 7.1 e 


Conversely, if p is a binary relation on a set N, we can define a directed graph G 
with N as the set of nodes, and an arc from л; to n; if and only if n; p n;. С will have 
no parallel arcs. 


For the set N = (1,2, 3, 4} and the binary relation {(1, 4), (2, 3), (2, 4), (4, 1)} on 


N, we obtain the associated directed graph shown in Figure 7.2. 


le >» 4 
2 >e 3 
Figure 7.2 ө 


We now have another one-to-one correspondence: 


Binary Directed graphs 
relations on <--> with n nodes, 
n-element sets no parallel arcs 


(2) 
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Of course, a one-to-one correspondence means the existence of a bijection. If 
function composition is carried out on the bijections in (1) and (2), the result is a 
bijection that gives us a one-to-one correspondence between binary relations and 
matrices. Thus we have three equivalent sets: 


Binary Directed graphs 
Д А Eu n X n Boolean 
relations on <--> with n nodes, <--> : 
matrices 
n-element sets no parallel arcs (3) 


An item from any of the three sets has corresponding representations in the other 
two sets. 


Give the collection of ordered pairs in the adjacency relation for the following Boolean 


D- OTG 
or CO о 


Recall the reflexive, symmetric, antisymmetric, and transitive properties of a 
binary relation on a set that we studied in Chapter 5. If a binary relation on a set 
N has a certain property, it will be reflected in the corresponding graph and the 
corresponding Boolean matrix. Conversely, certain characteristics of a directed 
graph or of a Boolean matrix imply certain properties of the corresponding adja- 


If p is a reflexive relation on a set N, then for each n; € №, n; p n;. In the corre- 
sponding directed graph there will be a loop at each node, and in the corresponding 
Boolean matrix there will be 1s on the main diagonal. e 


PRACTICE 1 
matrix; also draw the directed graph. 
0 0 
a 
TIN) 
0 0 
cency relation. 
PRACTICE 2 


Explain why the corresponding binary relation is not antisymmetric for the directed 
graph in Figure 6.13 reproduced here. 
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In Chapter 5 we represented partial orderings on a set by using a Hasse 
diagram. How does this representation differ from the directed graph representa- 
tion? The Hasse diagram is a simplification of the directed graph representation. 
Suppose that G is the directed graph representation of a partial ordering. Because 
a partial ordering is reflexive, G will have a loop at each node. We can eliminate 
these loops in the Hasse diagram without losing any information because we 
know that each node has a loop; that is, each node is related to itself. Because a 
partial ordering is transitive, if apb and bpc, then apc. In the directed graph 
there would be an arc from a to b, an arc from b to с, and an arc from a to c. In 
the Hasse diagram we can eliminate the arc from a to c without losing any in- 
formation if we keep the transitive property in mind. Finally, the Hasse diagram 
is not a directed graph at all, but we did impose the convention that if a is an 
immediate predecessor of b, then node a will appear below node b in the Hasse 
diagram. Thus we could achieve a directed graph from the Hasse diagram by 
making all the arc directions point upward. The antisymmetry property prevents 
any potential conflict where node a should be below node b and node b should be 
below node a. 

In Chapter 5 we also noted set operations that could be performed on two 
binary relations p and сопа set N, p Uo and р П с. The relation p U ø is the 
union of the ordered pairs in p or c, while p N ø is the intersection of the ordered 
pairs in p and o. Let R and S be the Boolean matrices for p and oc, respectively. 
The Boolean matrix for p U с will have a 1 in position i, j if and only if there is a 
1 in position i, j of R ora 1 in position i, j of S. Each entry in the Boolean matrix 
for p Ua is thus the maximum of the two corresponding entries in R and $, so 
the Boolean matrix for p U ø is R V S (see the discussion of Boolean matrix op- 
erations in Section 5.7). Similarly, the Boolean matrix for p N с will have a 1 in 
position i, j if and only if there is a 1 in position i, 7 of both R and S. Therefore the 
Boolean matrix for р Пс is R AS. 


Reachability 


The “reachability” property has an interesting interpretation in each of the three 
equivalent forms in (3)—directed graph, adjacency relation, and adjacency ma- 
trix. We already have a definition for this term for directed graphs from Section 6.1, 
which we'll restate now. 


* DEFINITION REACHABLE NODE 
In a directed graph, node л; is reachable from node n; if there is a path from n; 
to nj. 


| EXANPLE4 | In the directed graph of Figure 7.2, node 3 is not reachable from node 4 or node 1. 


Node 1 is reachable from node 2 by the path 2—4—1. ө 


In a system modeled by a directed graph (a data flow diagram, for example) 
with a “start node,” any node that is unreachable from the start node can never af- 
fect the system and thus can be eliminated. If the directed graph represents some- 
thing like airline routes or communication paths in a computer network, it would 
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be undesirable to have some node be unreachable from some other node. Thus the 
ability to test reachability has very practical applications. 

The adjacency matrix A of a directed graph С with п nodes and no parallel 
arcs will have a | in position i, if there is an arc from л; to n;. This would be a 
path of length 1 from n; to n;. The adjacency matrix by itself therefore tells us 
about a limited form of reachability, via length-1 paths. However, let us perform 
the Boolean matrix multiplication A X A. We'll denote this product by A? to 
distinguish it from A’, the result of A · A using ordinary matrix multiplication. 
Recalling from Section 5.7 the definition of Boolean matrix multiplication, the 
i, j entry of А? is given by 


AM) = V @ ^ ay) (4) 
REMINDER 
To compute A? [i, /] write If a term such as ар ^ a5; in this sum is 0, then either a; = 0 or a5; = 0 (or 
two copies of A side by both), and there is either no path of length 1 from n, to n, or no path of length 1 
side. Run a left-hand from m, to n; (or both). Thus there are no paths of length 2 from л; to n; passing 
finger along row i of the through n. If ap ^ a5; is not 0, then both ap = 1 and a; = 1. Then there is a path 
left copy and a right-hand of length 1 from n, to n, and a path of length 1 from n, to n;, so there is a path of 
finger down column j of length 2 from n; to n; passing through n. A path of length 2 from л, to n; will exist 
the right copy. The value if and only if there is a path of length 2 passing through at least one of the nodes 


is 1 if and only if both 
fingers hit a 1 at the same 
time. 


from 1 to n, that is, if and only if at least one of the terms in the sum (4) is | and 
therefore A®[;, j] = 1. Therefore the entries in A® tell us about reachability via 
length-2 paths. 


PRACTICE 3 | Find A for the graph of Figure 7.2 and compute АХ). What does the 2,1 entry indicate? = 


The matrix A® indicates the presence or absence of length-2 paths. We might 
surmise that this result holds for arbitrary powers and path lengths. 


* THEOREM ON BOOLEAN ADJACENCY MATRICES AND REACHABILITY 
If A is the Boolean adjacency matrix for a directed graph G with n nodes and no 
parallel arcs, then A” |i, j] = 1 if and only if there is a path of length m from 
node л, to node n;. 


Proof: A proof by induction on m is called for. We have already shown the result 
true for m = 1 (and m = 2). Suppose that A” [7,7] = 1 if and only if there is a 
path of length p from л; to n;. We know that 


АО = V (ALE, k] ^ ay) 
k=1 


which will equal 1 if and only if at least one term, say A [i,q] Ла, = 1, or 
A [i,q] = 1 and a,; = 1. This condition will be true if and only if there is a path 


Section 7.1 Directed Graphs and Binary Relations; Warshall's Algorithm 559 


of length p from 7; to n, (by the inductive hypothesis) and there is a path of length 1 
from n, to пу, which means there is a path of length p + 1 from n; to n;. End of proof. 


PRACTICE 4 | From the graph of Figure 7.2, what would you expect for the value of entry 2,1 in A“? 
Compute A(? and check this value. 


If node n; is reachable from node n,, it is by a path of some length. Such a 
path will be shown by a 1 as the i, j entry in A or А? or AO! and so on, but we 
cannot compute an infinite number of matrix products. Fortunately, there is a 
limit to how far in this list we have to look. If there are л nodes in the graph, then 
any path with n or more arcs, and therefore n + 1 or more nodes, must have a 
repeated node. This is a consequence of the pigeonhole principle—there are n 
"bins" (distinct nodes) into which we are putting more than n objects (the nodes 
in a path with л or more arcs). The section of a path lying between the repeated 
nodes is a cycle. If n; # nj, the cycle can be eliminated to make a shorter path; 
then if a path exists from n; to n,, there will be such a path of length at most n — 1. 
If n; = nj, then the cycle could be the entire path from л; to n; with maximum 
length п; although we could eliminate this cycle (noting that any node may be 
considered reachable from itself), we will retain it to show that a nontrivial path 
does exist from n; to n;. 

Consequently, whether n; = n; or n; 7 n;, we need never look for a path from 
n; to n; of length greater than n. Therefore to determine reachability, we need only 
consult element i, j in A, AO, ... , А. Alternatively, we can define a reachability 
matrix R by 


R=AV АО V---V AM 


Then л, is reachable from л; if and only if entry i, j in К is positive. 

We now see how reachability in a graph can be expressed in terms of the ad- 
jacency matrix. How is reachability represented in terms of the adjacency relation 
that corresponds to the graph? 

If p is the adjacency relation for a graph С, we let p® denote the binary rela- 
tion of reachability; that is, (п, л;) € p" exactly when there is a path in С from 
n; to n;. Then we can show that p* is the transitive closure of p. Recall from the 
definition of closure of a relation that the transitive closure of p 1s a relation that is 
transitive, contains p, and is a subset of any transitive relation containing p. 

To see that p? is transitive, let (n; nj) and (nj, т) belong to р“. Then there is 
a path in С from n; to n; and a path in С from n; to п. Therefore there is a path 
in С from n;to п, and (л, п) belongs to p". To see that р“ contains p, let (п, nj) 
belong to p. Then there is an arc from n; to n; in G, which means there is a path of 
length 1 from л; to л, and (n, n;) belongs to р“. Finally, suppose с is any transi- 
tive relation on the nodes of С that includes p, and let (л„ п) belong to pF. This 
means that there is a path from л, to n; using, say, nodes n; ny, ny, ... , Nw, nj. Then 
there is an arc from each node in this path to the next, and the ordered pairs (n; n), 
(n,,N,), ... „ (1,,;), all belong to p, and therefore all belong to с. Because ø is 
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transitive, (7; n;) belongs to c, and р“ is a subset of с. Therefore pP is the transitive 
closure of p. 

To summarize, corresponding to the three equivalent representations of adja- 
cency relation p, directed graph G, and adjacency matrix A, we have 


(n, п) belongs 
to the transitive <> 
closure of p 


п, is reachable R[ij] = 1 where 
from n; in С R-AVAOyv ... VAM 


Let G be the directed graph in Figure 7.3; G has 5 nodes. 


REMINDER 


Don't try to compute 
matrix products without 
writing the two matrices 
side by side; you'll surely 
make a mistake. 


2 
1 3 
e4 

5 

Figure 7.3 
The adjacency matrix A for G is 

0 1.00 0 
0.0 1.0 0 
А=|1 0 0 1 0 
0 0 0 0 0 
i @ л o @ 


The adjacency relation p is р = {(1, 2), (2, 3), (3, 1), (3, 4), (5, 1), (5, 3)}. 
The successive powers of A are 


@ @ ПЛ @ © го @ 1 o 
ї @ @ 1 0 отоо о 
AUC OT АСЮ Ооо 
000 0 0 0 0 6 0 0 
igi g Gii 
i 000 0 П @ @ 
0 ооо ї @ @ Л @ 
M= 0 0 Лл © АЕ 100 0 
©0000 00000 
ї @ П ПЛ @ лоо 
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These matrices indicate, for example, that there is a path of length 2 from 2 to 1 
because A®[2, 1] = 1 (the path is 2-3-1), and there is a path of length 4 from 5 to 
3 because, A?[5, 3] = 1 (the path is 5—3—1—2—3), but there is no path of length 3 
from 1 to 3 because A? 1, 3] = 0. 

The reachability matrix R is the Boolean sum of A, AO, AO, AO. and Аб): 


5 
Il 
—Ó OF m. nm 
= OF н н 
_ OF m. nm 
_ O = н н 
C» (m € 2 => 


The 1 values in R indicate that there are paths in G from nodes 1, 2, 3, and 5 to 
every node except 5, but no path from node 4 to anywhere, which can be confirmed 
by looking at Figure 7.3. 

We have proved that the 1 entries in R mark the ordered pairs of nodes that 
belong to the transitive closure of p. The transitive closure will therefore be the 
following set of ordered pairs: 


(а, 1), (1, 2), (1, 3), (1, 4), (2, 1), (2, 2), 02,3), 02, 4), 
ООСО Э ©) 


Beginning with р and following the ad hoc procedure described in Chapter 5 for 
finding the transitive closure of a relation, we see that to obtain transitivity, we must 
first add the pairs (1, 3), (2, 1), (2, 4), (3, 2), (5, 2), and (5, 4). Reviewing the new set, 
we see that we must also add (1, 1), (1, 4), (2, 2), and (3, 3). The resulting collection 
of ordered pairs is transitive (and agrees with what we obtained earlier). e 


PRACTICE 5 | Compute R for the directed graph of Figure 7.2. What information does column 2 convey? 


In Chapter 5, we promised a better algorithm to find the transitive closure 
of a relation. Here it is. Write the binary relation in adjacency matrix form and 
compute 


R-AVAOVv---VvAO 


How much work is required to carry out this algorithm? The expression for R 
indicates that Boolean matrix operations are to be done, but matrix operations in 
turn require Boolean and and Boolean or operations on matrix elements. We will 
therefore use Boolean and and Boolean or as the measure of work. In Section 5.7, 
we noted that ordinary matrix multiplication of two n X n matrices requires 
O(m) multiplications and additions; by a similar argument, Boolean matrix 
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multiplication of two n X n Boolean matrices requires Ө(п?) Boolean and/or 
operations. The algorithm to compute К requires n — 1 Boolean matrix multipli- 
cations (to find the products AO), AO), ... , A). To compute n — 1 such products 
require (n — 1)@(n*) = O(n*) Boolean operations. To compute C V D where С 
and D are two n X n Boolean matrices requires л? Boolean or operations. To com- 
pute В, n — 1 such matrix operations are required, so (n — 1)n* = @(n*) Boolean 
or operations are performed. The total amount of work is O(n) + @(n*) = O(n^). 

Next we discuss a more efficient algorithm for computing the transitive 
closure of a relation (or the reachability matrix of a graph). 


Warshall's Algorithm 


For a graph G with n nodes, Warshall’s algorithm computes a sequence of n + 1 
matrices M, M, Mb, ..., M,. For each k, 0 = k = п, Mjli,j] = 1 if and only if 
there is a path in С from n; to л; whose interior nodes (1.е., nodes that are not the 
endpoints of the path) come only from the set of nodes (n, n2, ... , п}. 

Let us examine the “end conditions.” When k = 0, the set {л}, n, ... , пу} is 
the empty set, so Mi i, j] = 1 if and only if there is a path in G from n; to n; whose 
interior nodes come from the empty set; that is, there are no interior nodes. The 
path from n; to n; must then consist only of the endpoints and one connecting arc, 
so n; and л; are adjacent nodes. Thus М, = A. The other end condition occurs 
when k = n. Then the set (7, m, ... , п„} consists of all the nodes in С, so there is 
really no restriction at all on the interior nodes in the path and M, (Li, j] = 1 if and 
only if there is a path from п; to л, which means that M, = К. 

Therefore Warshall’s algorithm begins with A = М, and successively com- 
putes Mj, M,, ... , M, = R. This computation can be defined inductively. The 
base case is to let M) = A. Now assume that M, has been computed, and consider 
how to compute M, , | or, more specifically, М, , |i, j]. We have М, ilij] = 1 
if and only if there is a path from n; to n; whose interior nodes come only from the 
set (m, n, ... , nj, 4). This can happen in two ways: 


1. АП the interior nodes come from {n;, ns, ...,n,h in which case 
Myli,j] = 1. We should therefore carry forward any 1 entries in M, into 
M,.;. 

2. Node n;,, is an interior node. We can assume that n; | is an interior node 
only once, because cycles can be eliminated from a path. Then there must 
be a path from n; to л; ; whose interior nodes come from (n, n2, ... , т} 
anda path from л; | to л; whose interior nodes come from Dii; fios sse p Agf. 
This means that M,[i, k + 1] = 1 and МДК + 1,j] = 1, which is to say 
that МД, k + 1] ^ МДЕ + 1,j] = 1; this condition can be tested be- 
cause our assumption is that М, has already been computed. 


In the following pseudocode version of Warshall’s algorithm, the initial value 
of matrix M is A. Each pass through the outer loop computes the next matrix in 
the sequence M, Mj, ..., М, = В. 
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ALGORITHM 


WARSHALL’S ALGORITHM 


Warshall(n X n Boolean matrix M) 
//Initially, М = adjacency matrix of a directed graph С with no parallel arcs 


for k = Oton — 1 do 
for і = 1 ton do 
for j — 1 ton do 
M[ij| = Milij] V (M[i, k + 1] ^ MÍ& + 1, j]) 
end for 
end for 
end for 
//at termination, M = reachability matrix of С 
end Warshall 


This pseudocode gives a nice neat description of Warshall’s algorithm, which can 
be implemented as computer code rather easily. These steps are confusing to do by 
hand, however, requiring some bookkeeping to keep track of all the indices. We can 
write the algorithm more informally, making it easier to do manually. Suppose again 
that matrix M, in the sequence exists and we are trying to write row i of the next 
matrix in the sequence. For the various values of j, we must evaluate the expression 


M[i j] V Mli, k + 11^ M[k + 1,j]) (5) 
If entry M[i, k + 1]is0, then M[i, К + 1] ^ M[k + 1,j] = 0 for all j. Expression 
(5) then reduces to 
M[i,j] V 0 = M[i,j] 
In other words, row i of the matrix remains unchanged. If, on the other hand, 


entry M[i, k + 1] is 1, then M[i,k + 11^ M[k + 1,/] = M[k + 1,7] for all j. 
Expression (5) then becomes 


MLi,j] V M[k + 1,j] 


In other words, row i of the matrix becomes the Boolean or of the current row i 
and the current row k + 1. 

Table 7.1 describes the (informal) steps to compute entries in M,., from 
matrix M,. 


TABLE 7.1 


1. Consider column k + 1 in M,. 

2. For each row with a 0 entry in this column, copy that row to M, +. 

3. For each row with a 1 entry in this column, or that row with row k + 1 and write 
the resulting row іп M,. ч. 
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| EXAMPLE6 | For the graph of Example 5, the initial matrix Mọ is the adjacency matrix. 


01000 
d Rl 0 
M,=|1 0 0 1 0 
00000 
ї @ ї @ @ 


We know M, (so k = 0) and we want to compute M; (k + 1 = 1). Using step 1 of 
Table 7.1, we consider column 1 of Ме. Using step 2 of Table 7.1, rows 1, 2, and 4 
of М, contain 05 in column 1, so these rows get copied directly to М}: 
O it © 0 
@ O0 1 9 @ 
М, = 
0 © @ O @ 


Now we finish up by using step 3 of Table 7.1 Row 3 of column 1 of M, contains a 
І, so row 3 of M, is or-ed with row 1 of M, and the result becomes the new row 3: 


M, = 


oro © 
уже ЫИ „Ж dem iS 
оо н © 
о є_ © о 
о o € c9 


Row 5 of column 1 of M, contains a 1, so row 5 is or-ed with row 1 and the result 
becomes the new row 5: 


M, = 


=. OF c © 
= = SS = 
[— €» ec LI €» 
оо н о о 
c eo © €» ӘӘ 


To compute the entries in М», consider column 2. Rows 2 and 4 (the 0 posi- 
tions in column 2) will be copied unchanged. Row 1 will be or-ed with row 2 to 
give the new row 1, row 3 will be or-ed with row 2 to give the new row 3, and row 
5 will be or-ed with row 2 to give the new row 5: 


011 0 0 
00 1 0 
M,=/1 1 1 1 0 
0000 0 
1 Л ї @ O 
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М; is computed in a similar fashion: 


Ё 

Il 
—Ó OF m. m 
_ O н m= n 
_ O н н н 
_ O н н Lm 
O | €» 2 €» 


M, and M; will be the same as M5; row 4 is all Os, so any row that gets or-ed 
with it will be unchanged, and column 5 1s all 0s so all rows are copied directly. In 
terms of the graph, no new 1 entries are produced because there are no paths from 
4 to any node or from any node to 5. Thus M, = M, = M; = К, as computed in 
Example 5. Note, however, that the matrices computed by Warshall’s algorithm, 
except for M, = A and M, = К, do not agree with the matrices that are powers of 
A used in our previous algorithm for R. e 


Each pass through the outer loop of Warshall's algorithm modifies in place 
the matrix that existed at the end of the previous pass. Warshall’s algorithm re- 
quires no additional storage for other matrices, even though we wrote down new 
matrices in our example. There is one more point we need to check. Because we 
are modifying the (only) matrix as we go along, during any one pass through 
the outer loop, some of the entries will belong to М, +, while others still belong 
to M,. Specifically, on pass k + 1, we may consider M[i, k + 1] ^ M[k + 1,j] 
in Expression (5), where these values have already been computed on this pass 
and therefore represent М, , 1, k + 1] and М, ,[k + 1,j] rather than the values 
Myli, k 1] and Mj[k + 1, j] we used in our justification for this algorithm. Can 
there be a case where the values М, ;[i, k + 1] and M,, [К + 1,/] are 1, so that 
a 1 value goes into M, [i, j], whereas the values Му, k + 1] and МДК + 1,7] 
are 0? No—if M,,[i, k + 1] = 1, there is a path from л; to пу ү with interior 
nodes drawn from the set {n,, n, ... , nj, 1}. However, because n,,, is an end- 
point and cycles can be eliminated, there must also be a path with interior nodes 
drawn from the set (1, и, ... , nj) so that Mj[i, k + 1] = 1. A similar argument 
holds for M, [А + 1,7]. 


PRACTICE 6 | Use Warshall's algorithm (formally or informally) to compute R for the graph of Figure 7.2. 
Compare your answer with that for Practice 5. 


How much work does Warshall’s algorithm require as measured by the num- 
ber of Boolean and/or operations? Consider the formal algorithm. The single as- 
signment statement in the algorithm lies within a triply nested loop; it will be 
executed л? times. Each execution of the assignment statement requires one and 
and one or; therefore, the total amount of work is 27? = Ө(т?). Recall that our 
previous algorithm for computing R was an O(n*) algorithm. 
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SECTION 7.1 REVIEW 


TECHNIQUES 


* Find any two of the adjacency relation, directed 
graph, or adjacency matrix representation, given 
the third. 

Q) Compute the reachability matrix R for a graph С 
(or, equivalently, find the transitive closure of the 
adjacency relation on G) by using the formula 
R=AVA®V---V А) and by using War- 
shall’s algorithm. 


EXERCISES 7.1 


MAIN IDEAS 


* There is a one-to-one correspondence between a di- 


rected graph G with no parallel arcs, the adjacency 
relation on G, and the adjacency matrix for G (with 
respect to some arbitrary ordering of the nodes). 
The reachability matrix of a graph G also represents 
the transitive closure of the adjacency relation on G. 
The reachability matrix for a graph can be com- 
puted with O(n) Boolean and/or operations by 
summing powers of the adjacency matrix A or with 
O(m) Boolean and/or operations by using War- 
shall’s algorithm. 


1. Find the adjacency matrix and adjacency relation for the following graph. 


Q 


1 


2. Find the adjacency matrix and adjacency relation for the following graph. 


1 €— ———————— 9 3 


3. Find the corresponding directed graph and adjacency relation for the following adjacency matrix. 


Il 
оо c o 
o= оо о 


— оо о c 


re CO о о н 
e» © O Eo 
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4. Find the corresponding directed graph and adjacency relation for the following adjacency matrix. 


» 

Il 
© © © кє_ о 
о-о © 
SG © E e 
—_ © о © 
© © © ~ © 


5. Given the adjacency relation р = {(1, 4), (1, 5), (1, 6), (6, 2), (6, 3), (6, 5)} on the set N = {1, 2, 3, 4, 5, 6}, 
find the corresponding directed graph and adjacency matrix. 

6. Given the adjacency relation р = {(2, 1), (3, 2), (3, 3), (3, 4), (4, 5), (6, 3), (6, 6)} on the set N = {1, 2, 3, 
4, 5, 6}, find the corresponding directed graph and adjacency matrix. 


7. Let p be a binary relation defined on the set (0, 1, 2, 3, 4, 5, 6} by xpy < y = x + 2. Draw the associ- 
ated directed graph. 


8. Let p bea binary relation defined on the set (0, + 1, +2, +4, +16) by xpy €» y = x’. Draw the associated 
directed graph. 


9. Describe a property of a directed graph whose adjacency matrix is symmetric. 


10. Describe the directed graph whose adjacency matrix has all 05 on the main diagonal and 15 everywhere 
else. 


11. Describe the directed graph whose adjacency matrix has all 15 in row 1 and column 1, and 05 else- 
where. 


12. Describe the directed graph whose adjacency matrix has 15 in positions (4,7; + 1) for l -izn-—1,a 
1 in position (л, 1), and 05 elsewhere. 


13. Describe a property of a directed graph whose adjacency relation is irreflexive (see Exercise 26, 
Section 5.1). 


14. Describe a property of the adjacency matrix of a graph whose adjacency relation is antisymmetric. 


15. Adjacency relations p and с have the following associated adjacency matrices R and S. Find the adja- 
cency matrices associated with the relations p U ø and pfo. 


1 0 1 1 0 0 
R=]1 1 0 S=|1 0 1 
0 0 1 0 0 1 


16. Adjacency relations p and с have the following associated adjacency matrices А and $. Find the adja- 
cency matrices associated with the relations p U ø and p П с. 


Ex opc 
or о н 
oS oq 
=. Or © 
—— 0 сз 
O == OQ m 
о о = о 
о _ о о 
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17. The two directed graphs that follow have adjacency relations p and o. Draw the graphs associated with 
the relations p Ua and p N o. 


18. The two directed graphs that follow have adjacency relations p and o. Draw the graphs associated with 
the relations p Ua and p Пс. 


1 2 1 2 
d ; $ vd 
3 4 5 3 4 5 


19. Let A be the matrix 


0 1 1 

А=|1 1 I 

0 0 1 

Find the products A? and AO. 
20. Let A be the matrix 

1 1 0 1 
n 0 0 1 0 
1 0 0 1 
1 0 1 0 


Find the products A? and AO. 


21. The definition of a connected graph can be extended to directed graphs. Describe the reachability matrix 
R for a connected, directed graph. 


22. Describe the directed graph with the following reachability matrix R. 


ооо - - - 
ооо - н – 
ooo 
Бо © © 
Бо © © 
=... © © © 
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23. For the following graph, write the reachability matrix R by simply inspecting the graph. 


4e 3 
24. For the following graph, write the reachability matrix R by simply inspecting the graph. 


1 2 


5e 4 


For Exercises 25—30, compute the reachability matrix R by using the formula R = A V AO \/...\/ A” 


25. Exercise 1 277. Exercise 3 29. Exercise 5 
26. Exercise 2 28. Exercise 4 30. Exercise 6 


For Exercises 31—36, compute the reachability matrix R by using Warshall's algorithm. 


31. Exercise 1 33. Exercise 3 35. Exercise 5 
32. Exercise 2 34. Exercise 4 36. Exercise 6 


37. Given the binary relation p = {(1, 3), (3, 2), (2, 3)} on the set (1, 2, 3}, use Warshall's algorithm to find 


the transitive closure of p. 


38. Given the binary relation p = ((1, 2), (2, 3), (4, 1)} on the set (1, 2, 3, 4), use Warshall’s algorithm to 


find the transitive closure of p. 


39. Use Warshall's algorithm to find the transitive closure of the following binary relations on the set (1, 2, 3) 


(see Exercise 23 in Section 5.1). 

а. p = {(1,3), (3, 3), (3, D, (2, 2), (2, 3), (1, 1), (1, 2)} 
b. p = {(1, 1), (3, 3), (2, 2)} 

c. p = {(1, D, (1,2), (2, 3), (3, 1), (1, 3) 

d. p = {(1, 1), (1, 2), (2, 3), (1, 3)} 


40. Use Warshall’s algorithm to find the transitive closure of the following binary relations on the set 


{0, 1, 2, 4, 6} (see Exercise 24 in Section 5.1). 


a. p = (0, 0), (1, 1), (2, 2), (4, 4), (6, 6), (0, 1), (1,2), (2, 4), (4, 6)} 
b. p = {(0, 1), (1, 0), (2, 4), (4, 2), (4, 6), (6, 4)} 

c. p = {(0, 1), (1, 2), 0, 2), (2, 0), (2, 1), (1, 0), (0, 0), (1, 1), (2, D} 
d. p = {(0, 0), (1, 1), (2, 2), (4, 4), (6, 6), (4, 6), (6, 4)} 

e p= Ø 
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41. The following directed graph represents a binary relation p on the nodes. Draw the directed graph that 
would represent the transitive closure of p. 


42. The following directed graph represents a binary relation p on the nodes. Draw the directed graph that 
would represent the transitive closure of p. 


Exercises 43—46 use regular matrix multiplication to obtain information about a graph. 


43. Let G be a directed graph, possibly with parallel arcs, and let A be its adjacency matrix. Then A may not 
be a Boolean matrix. Prove that the i, j entry of matrix A? is the number of paths of length 2 from node i 
to node j. 


44. Let A be the adjacency matrix of a directed graph G, possibly with parallel arcs. Prove that the 7, j entry 
of matrix A" gives the number of paths of length л from node i to node j. 


45. For the following graph, count the number of paths of length 2 from node 1 to node 3. Check by 
computing A?. 


46. For the following graph, count the number of paths of length 4 from node 1 to node 5. Check by 
computing А“. 
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SECTION 7.2 | EULER PATH AND HAMILTONIAN CIRCUIT 


b 


Figure 7.5 


Euler Path Problem 


The Euler path problem (the highway inspector problem) originated many years 
ago. Swiss mathematician Leonhard Euler (pronounced *oiler") (1707—1783) was 
intrigued by a puzzle popular among the townsfolk of Kónigsberg (an East Prus- 
sian city later called Kaliningrad, which is in Russia). The river flowing through 
the city branched around an island. Various bridges crossed the river as shown in 
Figure 7.4. 


Figure 7.4 


The puzzle was to decide whether a person could walk through the city crossing 
each bridge only once. It is possible to answer the question by trial and error, list- 
ing (or walking) all possible routes, so some dedicated Königsberger could have 
solved this particular puzzle. Euler solved this problem, and indeed a more general 
version of the problem, by a better mechanism than trial and error. The problem 
is usually represented as a graph (Figure 7.5) where the bridges are arcs and the 
land masses (labeled a through d) are nodes. The more general question is to de- 
termine when an Euler path exists in any graph.! 


‘Euler did not actually represent the problem as a graph, and such a representation was not connected to 
Euler's work until over 150 years later. See "The Truth about Kónigsberg" by Brian Hopkins and Robin 
Wilson, The College Mathematics Journal, May 2004. 
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* DEFINITION EULER PATH 
An Euler path in a graph G is a path that uses each arc of С exactly once. 


PRACTICE 7 | Do Euler paths exist for either graph in Figure 7.6? (Use trial and error to answer. This 
is the old children’s game of whether you can trace the whole graph without lifting your 
pencil and without retracing any arcs.) 


1 2 1 2 
X O 
4 2 5 4 
(а) (b) 


Figure 7.6 


For this discussion we will assume that all graphs are connected, since an 
Euler path generally cannot exist otherwise. Whether an Euler path exists in a 
given graph hinges on the degrees of its nodes. A node is even if its degree is even 
and odd if its degree is odd. It turns out that every graph has an even number of 
odd nodes. To see this, choose any graph and let N be the number of odd nodes in 
it, N(1) the number of nodes of degree 1, N(2) the number of nodes of degree 2, and 
so on. Then the sum S of the degrees of all the nodes of the graph is 


S-1:NI) + 2: NQ -3- NO) +++ k- NW (1) 


for some k. This sum is, in fact, a count of the total number of arc ends in the 
graph. Because the number of arc ends is twice the number of arcs, S is an even 
number. We will reorganize equation (1) to group together terms for odd nodes 
and terms for even nodes: 


S= 2-NQ) + 4: NA) +--+ + 2m: Nm) 


even nodes 
+1- NI) *3-NQ3) t: + Qn + D): N2n + 1) 
odd nodes 


The sum of the terms representing even nodes is an even number. If we subtract it 
from both sides of the equation, we get a new equation— 


S' = 1- NO) + 3: NO) ++ (2л + 1): NOn + 1) (2) 


—where 5” (the difference of two even numbers) is an even number. Now if we 
rewrite equation (2) as 


N(1) terms N(3) terms 


F(2m- 1) + Qa 1) te Qn + 1) 
N(2n + 1) terms 
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we see that there are N terms altogether in the sum (the number of odd nodes) and 
that each term is an odd number. For the sum of N odd numbers to be even, N 
must be even. (Can you prove this statement?) We have thus proved the following 
theorem. 


* THEOREM ON ODD NODES IN A GRAPH 
The number of odd nodes in any graph is even. 


Now suppose a graph has an odd node n of degree 2k + 1 and that an Euler 
path exists in the graph but does not start at л. Then for each arc we use to enter 
n, there is another unused arc for leaving n until we have used k pairs of arcs. The 
next time we enter n, there is no new arc on which to leave. Thus, if our path does 
not begin at л, it must end at n. The path either begins at n or it does not, and in 
the latter case it ends at и, so the path either begins or ends at this arbitrary odd 
node. Therefore, if there are more than two odd nodes in the graph, there can be 
no path. Thus, there are two possible cases where an Euler path may exist—on a 
graph with no odd nodes or on one with two odd nodes. 

Consider the graph with no odd nodes. Pick any node т and begin an Euler 
path. Whenever you enter a different node, you will always have another arc on 
which to exit until you get back to т. If you have used up every arc of the graph, 
you are done. If not, there is some node m’ of your path with unused arcs. Then 
construct an Euler path beginning and ending at m’, much as you did the previous 
section of path, using all new arcs. Attach this cycle as a side trip on the original 
path. If you have now used up every arc of the graph, you are done. If not, continue 
this process until every arc has been covered. 

If there are exactly two odd nodes, an Euler path can be started beginning at 
one odd node and ending at the other. If the path has not covered all of the arcs, 
extra cycles can be patched in as in the previous case. 

We now have a complete solution to the Euler path problem. 


* THEOREM ON EULER PATHS 
An Euler path exists in a connected graph if and only if there are either no odd 
nodes or two odd nodes. For the case of no odd nodes, the path can begin at any 
node and will end there; for the case of two odd nodes, the path must begin at one 
odd node and end at the other. 


PRACTICE 8 | Using the preceding theorem, work Practice 7 again. 


PRACTICE 9 | Is the Kónigsberg walk possible? 


The theorem on Euler paths is actually an algorithm to determine whether 
an Euler path exists on an arbitrary connected graph. To make it look more like 
an algorithm, we'll rewrite it in pseudocode, but first we'll make a simplifying 
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assumption that the graph has no loops. If the graph G has loops, we can strip 
them off and consider the modified graph H. If H has an Euler path, then so does 
G—whenever we come to a node with a loop, we traverse the loop. If H has no 
Euler path, then neither does G. 

In the accompanying algorithm (algorithm EulerPath), the input is a connect- 
ed graph with no loops represented by an n X n adjacency matrix A. The essence 
of the algorithm is to count the number of nodes adjacent to each node and to 
determine whether this is an odd or an even number. If there are too many odd 
numbers, an Euler path does not exist. The variable total keeps track of the num- 
ber of odd nodes found in the graph. The degree of any particular node, degree, 
is found by adding the numbers in that node's row of the adjacency matrix. (This 
is why we exclude loops; a loop at node i adds only 1 to the adjacency matrix at 
position [i, i], yet such a loop contributes 2 arc ends.) The function odd results in 
a value "true" if and only if the argument is an odd integer. 


ALGORITHM 


EULERPATH 


EulerPath (n X n matrix A) 

//Determines whether an Euler path exists in a connected graph with 
//no loops and adjacency matrix A 

Local variables: 

integer total — //number of odd nodes so far found 

integer degree //the degree of a node 


integer i, j //array indices 
total — 0 
i=l 
while total <= 2 andi <= n do 
degree = 0 


for j = 1 ton do 
degree = degree + A|i,j] //бпа degree of node i (*) 
end for 
if odd(degree) then 
total = total + 1 //another odd degree node found 
end if 
і=і+ 1 
end while 


if total > 2 then 
write (“No Euler path exists") 
else 
write (“Euler path exists") 
end if 
end EulerPath 


| Section 7.2 Euler Path and Hamiltonian Circuit 575 


| EXAMPLE? | The adjacency matrix for the graph of Figure 7.6a follows. 


So mS em 
со © = O ә 
н шо н н 
ко с = с 
Some dem = 


When the algorithm first enters the while loop, total is 0 and i is 1. Then degree is 
initialized to 0. Within the for loop, the values of row 1 of the adjacency matrix are 
added in turn to degree, resulting in a value for degree of 3. The odd function applied 
to degree returns the value “true,” so the value of total is increased from 0 to 1; one 
node of odd degree has been found. Then i 15 incremented to 2. Neither the bounds on 
total nor the bounds on the array size have been exceeded, so the while loop executes 
again, this time for row 2 of the array. Once again, degree is found to be odd, so the 
value of total is changed to 2. When the while loop is executed for row 3 of the ar- 
ray, the value of degree is even (4), so total does not change, and the while loop is 
executed again with i = 4. Row 4 again produces an odd value for degree, so total 
is raised to 3. This terminates the while loop. The bad news is written that there is no 
Euler path because the number of odd nodes exceeds 2. ө 


PRACTICE 10 | Write the adjacency matrix for the Kónigsberg walk problem and trace the execution of 
algorithm EulerPath. 


Let us analyze algorithm EulerPath. The important operation done by the al- 
gorithm is an examination of the elements of the adjacency matrix, which occurs 
in the line marked at the end with an asterisk (*). In the worst case, the while loop 
in the algorithm is executed n times, once for each row. Within the while loop, the 
for loop, containing line (*), is executed и times, once for each column. EulerPath 
is therefore an O(n’) algorithm in the worst case. 

At the cost of some extra decision logic, we could modify the algorithm be- 
cause we never have to examine the last row of the matrix. We know from the 
theorem on Euler paths that the total number of odd nodes is even. If the number 
of odd nodes after processing the next-to-last row is odd, then the last row must 
represent an odd node; if that number is even, then the last row must represent an 
even node. This modification results in (n — 1)n elements to examine in the worst 
case, which is still O(n’). 

If we represented the graph G by an adjacency list rather than an adjacency ma- 
trix, then the corresponding version of the algorithm would have to count the length 
of the adjacency list for each node and keep track of how many are of odd length. 
There would be n adjacency lists to examine, just as there were n rows of the adja- 
cency matrix to examine, but the length of each adjacency list might be shorter than 
n, the length of a row of the matrix. It is possible to reduce the order of magnitude 
below n? if the number of arcs in the graph is small, but the worst case is still O(n”). 
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Hamiltonian Circuit Problem 


Another famous mathematician, William Rowan Hamilton (1805—1865), posed a 
problem in graph theory that sounds very much like Euler’s. He asked how to tell 
whether a graph has a Hamiltonian circuit. 


* DEFINITION HAMILTONIAN CIRCUIT 
A Hamiltonian circuit in a graph is a cycle using every node of the graph. 


(Recall that in a cycle, only the node that forms the beginning and the ending of 
the cycle is repeated.) 

An Euler path in a graph requires that each and every arc of the graph be used 
once and only once, but nodes can be repeated. A Hamiltonian circuit requires 
that each and every node of the graph be visited once and only once (except for the 
start node, which is also the end node) but there can be unused arcs; no arc can be 
used more than once because that would involve revisiting a node. 


PRACTICE 11 | Do Hamiltonian circuits exist for the graphs of Figure 7.6? (Use trial and error to answer.) „ 


Like the Euler path problem, the Hamiltonian circuit problem can be solved 
for a given graph by trial and error. The algorithm is as follows: Start from one 
node of the graph and try some path by choosing various arcs. If the path results 
in a repeated node, it is not a cycle, so throw it away and try a different path. If 
the path can be completed as a cycle, then see whether it visited every node; if not, 
throw it away and try a different path. Continue in this fashion until all possible 
paths have been tried or a Hamiltonian circuit has been found. This will involve 
some careful record keeping so that no path is tried more than once. The trial-and- 
error approach is theoretically possible—but it is practically impossible! In all but 
the smallest of graphs, there will simply be too many paths to try. 

Euler found a simple, efficient algorithm to determine, for an arbitrary graph, 
if an Euler path exists. Although the Hamiltonian circuit problem sounds very 
similar to the Euler path problem, there is a basic difference. No efficient algo- 
rithm has ever been found to determine if a Hamiltonian circuit exists. In fact, 
there is some evidence (see Section 9.3) to suggest that no such algorithm will 
ever be found. 

In certain types of graphs we can easily determine whether a Hamiltonian 
circuit exists. For example, an unconnected graph cannot have a Hamiltonian cir- 
cuit because there would be no way to construct a path reaching all nodes. A 
complete graph with n > 2 has a Hamiltonian circuit because for any node on 
the path, there is always an arc to travel to any unused node and finally an arc 
to return to the starting point. Exercise 37 describes an additional condition that 
guarantees the existence of a Hamiltonian circuit. But in general—that is, for an 
arbitrary graph—we cannot readily make a determination about the existence of 
a Hamiltonian circuit. 

Suppose we are dealing with a weighted graph. If a Hamiltonian circuit ex- 
ists for the graph, can we find one with minimum weight? This is the traveling 
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salesman problem. Once again it can be solved using trial and error by tracing all 
possible paths and keeping track of the weights of those paths that are Hamilto- 
nian circuits, but, again, this is not an efficient algorithm. (Incidentally, the travel- 
ing salesman problem for visiting all 48 capitals of the contiguous United States 
has been solved—a total of 10,628 miles is required!) 


SECTION 7.2 REVIEW 


TECHNIQUE MAIN IDEAS 


W) Using algorithm EulerPath, determine whether an * There is a simple criterion for determining whether 
Euler path exists in a graph. Euler paths exist in a graph but no such criterion 
for whether Hamiltonian circuits exist. 


* An algorithm that is @(n’) in the worst case can 
determine the existence of an Euler path in a con- 
nected graph with л nodes. 


EXERCISES 7.2 


1. Rework Example 3 of Chapter 2 using the theorem on Euler paths. Here is the graph, where the nodes 
have been numbered. 


A 
чө 


2. а. Adda single arc to the graph of Exercise 1 so that there is an Euler path. 
b. List the nodes in such a path. 


For Exercises 3—12, determine whether the given graph has an Euler path by using the theorem on Euler paths. 
If so, list the nodes in such a path. 


3. 1 4. 1 2 
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5.1 2 3 9. 
1 


4 
6. 10. үй 
3 
7. 11. 
8. 1 2 12 
3 
4 5 
6 


13. Draw the adjacency matrix for the graph of Exercise 3. In applying algorithm EulerPath, what is the value 
of total after the second pass through the while loop? 


14. Draw the adjacency matrix for the graph of Exercise 5. In applying algorithm EulerPath, what is the value 
of total after the fourth pass through the while loop? 


15. Draw the adjacency matrix for the graph of Exercise 7. In applying algorithm EulerPath, what is the value 
of i after the while loop is exited? 


Draw the adjacency matrix for the graph of Exercise 9. In applying algorithm EulerPath, what is the value 
of i after the while loop is exited? 


1 


e 
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The definition of an Euler path extends to directed graphs. Instead of just the degree of a node as the total num- 
ber of arc ends, we must now keep track of arcs coming into a node and arcs leaving a node. The total number 
of arc ends coming into a node is its in-degree; the total number of arc ends leaving a node is its out-degree. 
Exercises 17—20 talk about Euler paths in directed graphs. 


17. Describe two conditions on a connected directed graph, either of which would guarantee the existence of 
an Euler path. 


18. Determine whether this graph has an Euler path. If so, list the nodes in such a path. 


19. Determine whether this graph has an Euler path. If so, list the nodes in such a path. 


20. Determine whether this graph has an Euler path. If so, list the nodes in such a path. 


For Exercises 21—28, decide by trial and error whether Hamiltonian circuits exist for the graphs of the given 
exercise. If so, list the nodes in such a cycle. 

21. Exercise 3 23. Exercise 5 25. Exercise 7 27. Exercise 9 

22. Exercise 4 24. Exercise 6 26. Exercise 8 28. Exercise 11 


29. Prove that any graph with a Hamiltonian circuit is connected. 


30. Find an example of an unconnected graph that has an Euler path. (Hint: Because this seems intuitively 
contradictory, you should look for a trivial case.) 


31. 


32. 


33. 


34. 


35. 
36. 


37. 
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Consider a simple, complete graph with n nodes. Testing for a Hamiltonian circuit by trial and error could 
be done by selecting a fixed starting node and then generating all possible paths from that node of length n. 


a. How many paths of length n are there if repetition of arcs and nodes is allowed? 

b. How many paths of length n are there if repetition of arcs and nodes is allowed but an arc may not be 
used twice in succession? 

c. How many paths of length n are there if nodes and arcs cannot be repeated except for the starting node? 
(These are the Hamiltonian circuits.) 

d. To solve the traveling salesman problem in a weighted graph, assume a fixed starting point at node 1 
and generate all possible Hamiltonian circuits of length n to find one with minimum weight. If it takes 
0.000001 seconds to generate a single Hamiltonian circuit, how long will this process take in a simple, 
complete graph with 15 nodes? 


Is it possible to walk in and out of each room in the house shown in the following figure so that each door 
of the house is used exactly once? Why or why not? 


NE EN 


| a 


Recall that K,, denotes the simple, complete graph of order л. 


a. For what values of л does an Euler path exist in K,? 
b. For what values of п does a Hamiltonian circuit exist in K,? 


Recall that K,, „ denotes a bipartite, complete graph with т + n nodes. 
a. For what values of m and n does an Euler path exist in К, n? 


m,n* 
b. For what values of m and n does a Hamiltonian circuit exist in K,, „? 


Prove that a Hamiltonian circuit always exists in a connected graph where every node has degree 2. 


Consider a connected graph with 2n odd vertices, n = 2. By the theorem on Euler paths, an Euler path 
does not exist for this graph. 


a. What is the minimum number of disjoint Euler paths, each traveling some of the arcs of the graph, 
necessary to travel each arc exactly once? 
b. Show that the minimum number is sufficient. 


Ore's theorem (Oystein Ore, 1960) states that a Hamiltonian circuit exists in any graph G with the follow- 
ing properties: 

1. Gis a simple graph with n nodes, n = 3. 

2. For any two nonadjacent nodes x and y, degree(x) + degree(y) = n. 


Ore's Theorem is proved by contradiction in the following steps. 


a. Assume that a graph G with properties 1 and 2 above does not have a Hamiltonian circuit. Beginning 
with G, add new edges to produce a simple graph H that does not have a Hamiltonian circuit but would 
have such a circuit with the addition of any single new arc. Describe a process for creating H. 

b. Prove that H has a Hamiltonian path, that is, a path that visits each node exactly once. 

c. Denote the nodes on the Hamiltonian path by p = x), X», X4, ... , X, ,, x, = q. Prove that for any node 
х2 <i<n — l,ifan arc exists in Н between x; and p, then no arc exists in H between x;. , and q. 
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d. Using the result from part (c), prove that in graph H, degree( р) + degree(q) < n. 
e. Prove that in graph С, degree(p) + degree(q) < n. 
f. Conclude from part (e) that G has a Hamiltonian circuit. 

38. Ore's theorem (Exercise 37) gives a sufficient condition for a Hamiltonian circuit to exist, but it is not a 
necessary condition. Find a simple graph G with л nodes, n > 3, that has a Hamiltonian circuit but for 
which condition (2) does not hold. 


SECTION 7.3 | SHORTEST PATH AND MINIMAL SPANNING TREE 


Shortest-Path Problem 


Assume that we have a simple, weighted, connected graph, where the weights are 
positive. Then a path exists between any two nodes x and y. Indeed, there may be 
many such paths. The question is, How do we find a path with minimum weight? 
Because weight often represents distance, this problem has come to be known as 
the “shortest-path” problem. It is an important problem to solve for a computer 
or communications network, where information at one node must be routed to 
another node in the most efficient way possible, or for a transportation network, 
where products in one city must be shipped to another. 

The traveling salesman problem is a minimum-weight path problem with such 
severe restrictions on the nature of the path that such a path may not exist at all. 
In the shortest-path problem, we put no restrictions (other than minimum weight) 
on the nature of the path, and because the graph is connected, we know that such 
a path exists. For this reason we may hope for an efficient algorithm to solve the 
problem, even though no such algorithm is known for the traveling salesman prob- 
lem. Indeed such an algorithm does exist; it was published in 1959 by Edsger W. 
Dijkstra, a prominent computer scientist of the twentieth century. 

The shortest-path algorithm known as Dijkstra's algorithm works as follows. 
We want to find the minimum-distance path from a given node x to a given node 
y. We build a set (we'll call it IN) that initially contains only x but grows as the 
algorithm proceeds. At any given time IN contains every node whose shortest 
path from x, using only nodes in IN, has so far been determined. For every node 2 
outside IN, we keep track of the shortest distance d[z] from x to that node, using 
a path whose only non-IN node is z. We also keep track of the node adjacent to z 
on this path, 5[2]. 

How do we let IN grow; that is, which node should be moved into IN next? 
We pick the non-IN node with the smallest distance d. Once we add that node, 
call it p, to IN, then we have to recompute d for all the remaining non-IN nodes, 
because there may be a shorter path from x going through p than there was before 
p belonged to IN. So we compare the current distance of z from x, d[z], with the 
distance of p from x, d| p], plus the distance from p to 2, А[р, z] where A is the 
adjacency matrix. If there is a shorter path, we must also update 5[2 | so that p is 
now shown to be the node adjacent to z on the current shortest path, that is, s[z] 
is the node just before z on this path from x. As soon as y is moved into IN, IN 
stops growing. The current value of d| y] is the distance for the shortest path, and 
its nodes are found by looking at y, s[ y], s[s| у ]], and so forth, until we have traced 
the path back to x. 

A pseudocode form of the algorithm is given in the accompanying box. The 
input is the adjacency matrix for a simple, connected graph G with positive weights 
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and nodes x and y; the algorithm writes out the shortest path between x and y and 
the distance for that path. Here shortest path means minimum-weight path. We 
actually assume a modified adjacency matrix A, where A[i, j] is the weight of the 
arc between i and j if one exists and Ali, j] has the value o» if no arc exists (here 
the symbol © denotes a number larger than any weight in the graph). 


ALGORITHM 


DIJKSTRA’S ALGORITHM 


Dijkstra (п X п matrix A; nodes x, у) 

//Computes the shortest path between a source node x and a destination node y 
/hn a simple, connected graph with positive weights. A is a modified adjacency 
//matrix. Writes out nodes in the shortest path from x to y, and the 

//distance for that path. 

Local variables: 


set of nodes /N //set of nodes whose shortest path from x is known 
nodes z, p //temporary nodes 

array of integers d //for each node, the distance from x using nodes in /N 
array of nodes s //for each node, the previous node in the shortest path 
integer OldDistance // distance to compare against 


/hnitialize set JN and arrays d and s 
JE 
aix 
for all nodes z not in JN do 
dizl = АЕА 
slal = z 
end for 


//process nodes into JN 

while y not in ZN do 
//add minimum-distance node not in IN 
p = node z not іп ZN with minimum d [2] 
IN = INU {p} 


//recompute d for non JN nodes, adjust s if necessary 
for all nodes z not in JN do 
OldDistance = d|z] 
42] = min(a[z],d[p] + Alp, 2]) 
if d[z] 5 OldDistance then 
slz] =p 
end if 
end for 
end while 
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//write out path nodes 
write (“In reverse order, the path is”) 
write (у) 
а = 
repeat 
write (s[z]) 
z = s[z] 
until z — x 


// write out path distance 
write (“The path distance is", d| y]) 
end Dijkstra 


ТТЕ Consider the graph in Figure 7.7 and the corresponding modified adjacency matrix 


shown in Figure 7.8. 


x i 23 а > 
хо 3 8 4 o 10 
il 3 о о 6 со со 
A || des G9 Ci G9 7 со 
3|4 6 vo o | 3 
4| о o 7 1 o || 
y{10 © o 3 | eo 

Figure 7.7 Figure 7.8 


Let's trace Dijkstra's algorithm on this graph. At the end of the initialization phase, 
IN contains only x, d contains all the direct distances (arc weights) from x to other 
nodes, and x is the immediate predecessor of all nodes except x. (Because of the 
оо in position A[x, 4], there is no arc from x to 4, so s[4] is meaningless here but 
it simplifies the initialization.) 


IN = +} 
| х 1 2 3 4 y 
d | 0 3 8 4 c 10 
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In Figure 7.9, circled nodes are those in set JN, heavy lines show the current 


shortest paths, and the d value for each node is written along with the node label. 
Figure 7.9a is the picture after initialization. 


Figure 7.9 


We now enter the while loop and search through the d values for the node of 
minimum distance that is not in ZN; this turns out to be node 1, with d|1] = 3. We 
throw node 1 into JN, and in the for loop we recompute all the d values for the 


remaining nodes, 2, 3, 4, and y. 
р= 1 
IN = {x, 1} 
d|2] = min(8, 3 + A[1,2]) = min(8, ~) = 8 
d|3] = min(4, 3 + АИ, 3) = minG, 9) = 4 
d|4] = min(», 3 + A[1, 4]) = min(, o») = © 
d|y] = min(10, 3 + A[1, y]) = min(10, o») = 10 
There were no changes in the d values, so there were no changes in the s values 


(there were no shorter paths from x by going through node 1 than by going directly 
from x). Figure 7.9b shows that 1 is now in ZN. 


The second pass through the while loop produces the following: 


p = 3 (3 has the smallest d value, namely 4, of 2, 3, 4, or y) 
ЕИО 


REMINDER 


Distances in Dijkstra’s 
algorithm are always 
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d[2] = min(8, 4 + A[3, 2]) = min(8,4 + о) = 8 
44] = min(o, 4 + A[3, 4]) = min(~, 4 + 1) = 5 (a change, so update s[4] to 3) 


recomputed relative to the d| y] = min(10, 4 + A[3, y]) = min(10, 4 + 3) = 7 (a change, so update s[y] to 3) 


node most recently added 
to IN. 


1 2 9 4 y 
9 7 
X X X ©) 9 


Shorter paths from x to the two nodes 4 and y were found by going through 3, as 
reflected in Figure 7.9c. 
On the next pass, 


p = 4 (d value = 5) 

DS 

d|2] = min(8,5 + 7) = 8 

d|y] = min(7, 5 + 1) = 6 (a change, update s| y ]) 


| x 1 2 3 4 y 
d 0 3 8 5 6 
S = X X X 3 4 


See Figure 7.9d. 
Processing the while loop again, we get 
р=у 
Е 1, 3, 4, y} 
d|2| = min(8, 6 + о) = 8 


See Figure 7.9e. 

Now that y is part of IN, the while loop terminates. The path goes through 
y, sly] = 4,s[4] = 3, and s[3] = x. Thus the path uses nodes x, 3, 4, and y. (The 
algorithm gives us these nodes in reverse order.) The distance for the path is 
d| y] = 6. By looking at the graph in Figure 7.7 and checking all the possibilities, 
we can see that this is the shortest path from x to y. ө 


Dijkstra's algorithm terminates when y is put into ZN, even though there may 
be other nodes in the graph not yet in /N (such as node 2 in Example 8). How do 
we know that a still shorter path cannot be found through one of these excluded 
nodes? If we continue processing until all nodes have been included in JN, the 
d values then represent the shortest path from x to any node, using all the values 
in IN, that is, the shortest path using any nodes of the graph. But new nodes are 
brought into ZN in order of increasing d values. A node z that is brought into JN 
later than y must have as its shortest path from x one whose distance is at least as 
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great as the d value of y when y was brought into /N. Therefore there cannot be a 
shorter path from x to y via z because there is not even a shorter path just between 
x and z. 


PRACTICE 12 | Trace Dijkstra's algorithm on the graph shown in Figure 7.10. Show the values for p and /N 
and the d values and s values for each pass through the while loop. Write out the nodes of 
the shortest path and the distance of the path. 


Figure 7.10 L| 


When looking for the next node to bring into ZN in Dijkstra's algorithm, more 
than one node p may have a minimum d value, in which case p can be selected ar- 
bitrarily. There may also be more than one shortest path between x and y in a graph. 

Dijkstra's algorithm also works for directed graphs if the adjacency matrix 15 
in the appropriate form. And it works for unconnected graphs; if x and y are not 
in the same component, then d[ y] will remain œ throughout. After y has been 
brought into JN, the algorithm will terminate, and this value of о for d| y] will 
indicate that no path exists between x and y. 

We may think of Dijkstra's algorithm as being a “nearsighted” algorithm. It 
cannot see the entire graph at once to pick out overall shortest paths; it only picks 
out shortest paths relative to the set /N at each step. Such an algorithm 15 called 
a greedy algorithm—it does what seems best based on its limited immediate 
knowledge. In this case, what seems best at the time turns out to be best overall. 

How efficient is Dijkstra's algorithm? Most of the work seems to take place 
within the for loop that modifies the d and s arrays. Here the algorithm checks 
all n nodes to determine which nodes z are not in JN and recomputes d[z] for 
those nodes, possibly also changing s[z]. The necessary quantities d[z], d| p], and 
A[ p, z] for a given z are directly available. Therefore the for loop requires O(n) 
operations. In addition, determining the node p to add to /N can also be done in 
O(n) operations by checking all n nodes. With the additional small amount of 
work to add p to JN, each execution of the while loop takes O(n) operations. In the 
worst case, y is the last node brought into JN, and the while loop will be executed 
n — | times. Therefore the total number of operations involved in the while loop 
is Ө(п(п — 1)) = O(n’). Initialization and writing the output together take O(n) 
operations, so the algorithm requires O(n + п?) = O(n’) operations іп the worst 
case. 

What if we keep JN (or rather the complement of JV) as some sort of linked 
list, so that all the nodes of the graph do not have to be examined to see which 
are not in JN? Surely this would make the algorithm more efficient. Note that the 
number of nodes not in JN is initially n — 1, and that number decreases by 1 for 
each pass through the while loop. Within the while loop the algorithm thus has to 
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perform on the order of л — 1 operations on the first pass, then n — 2, then n — 3, 
and so on. But, as proof by induction will show, 


(n—1)* (n—2) 127 (n — 1)л/2 = O(n’) 


Thus the worst-case situation still requires O(n”) operations. 


Minimal Spanning Tree Problem 


A problem encountered in designing networks is how to connect all the nodes effi- 
ciently, where nodes can be computers, telephones, warehouses, pumping stations, 
and so on. A minimal spanning tree may provide an economical solution, one that 
requires the least cable, pipeline, or whatever the connecting medium is. For reli- 
ability, however, the minimal spanning tree usually would be supplemented with 
additional arcs so that if one connection were broken for some reason, an alterna- 
tive route could be found. 


SPANNING TREE 

A spanning tree for a connected graph is a nonrooted tree whose set of nodes 
coincides with the set of nodes for the graph and whose arcs are (some of) the 
arcs of the graph. 


A spanning tree thus connects all the nodes of a graph with no excess arcs (no 
cycles). There are algorithms for constructing a minimal spanning tree, a span- 
ning tree with minimal weight, for a given simple, weighted, connected graph. 
One of these algorithms, called Prim’s algorithm, proceeds very much like Dijks- 
tra’s algorithm. There is a set JN, which initially contains one arbitrary node. For 
every node z not in JN, we keep track of the shortest distance d[z] between z and 
any node in JN. We successively add nodes to JN, where the next node added is one 
that is not in JN and whose distance d[z] is minimal. The arc having this minimal 
distance is then made part ofthe spanning tree. Because there may be ties between 
minimal distances, the minimal spanning tree of a graph may not be unique. The 
algorithm terminates when all nodes of the graph are in JN. 

The key difference in the implementation of the two algorithms comes in the 
computations of new distances for the nodes not yet in /N. In Dijkstra's algorithm, 
if p is the node that has just been added to JN, distances for non-/N nodes are 
recalculated by 


d[z] = min(d[z], dlp] + Alp, z) 


that is, by comparing the current distance of z from x with the distance of p from 
x plus the distance of z from p. In Prim's algorithm, if p is the node that has just 
been added to JN, distances for non-/N nodes are recalculated by 


42] = min(d[z , Alp, z) 


that 15, by comparing the current distance of z from JN with the distance of z from р. 

We won't write out the algorithm (which, like Dijkstra's algorithm, requires 
Ө(п?) operations in the worst case and is a greedy algorithm); we will simply 
illustrate it with an example. 
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We will find a minimal spanning tree for the graph of Figure 7.7. We let node 1 be 
the arbitrary initial node in ZN. Next we consider all the nodes adjacent to any node 
in /N, that is, all nodes adjacent to 1, and select the closest one, which is node x. 
Now IN = (1, x}, and the arc between 1 and x is part of the minimal spanning tree. 
Next we consider all nodes not in /N that are adjacent to either 1 or x. The closest 
such node is 3, which is 4 units away from x. The arc between 3 and x is part of the 
minimal spanning tree. For IN = 11, х, 3}, the next closest node is node 4, 1 unit 
away from 3. The remaining nodes are added in the order y and then 2. Figure 7.11 
shows the minimal spanning tree. 


Figure 7.11 e 


PRACTICE 13 | Find a minimal spanning tree for the graph of Figure 7.10. 


Seismic sensing instruments are to be distributed at sites along a volcanic rift zone, 
as shown in Figure 7.12a, where the distances in meters between sites are given. 
(Distances between some sites are not shown because of natural hazards that would 
prevent a direct connection.) The most economical way to wire the devices so that 
they are all connected is to create a minimal spanning tree for the graph, as shown 
in Figure 7.12b. The total length of wire involved is 975 meters. 


172 172 


ЗЭ 
592 592 


321 


(а) (6) 


Figure 7.12 e 


SPECIAL INTEREST PAGE 


Pathfinding 


Pathfinding—as the name suggests—seeks to find a path 
from point x to point у, more specifically, the shortest 
path. This is a problem encountered in video games, 
when entities must move to a target point while avoid- 
ing obstacles. On a much larger scale, Google Maps and 
other Web mapping sites allow us to enter a source x and 
a destination y and, in a very short time, receive driving 
directions to get from x to y. 

Well, don’t we already know how to do that? 
Dijkstra’s algorithm is the classic shortest-path algo- 
rithm, but it's an G(r?) algorithm. In the context of a 
real-time video game, quick response is essential. For 
Google maps, an Ө(п?) algorithm might not be a prob- 
lem for a graph with 50, 500, or 2000 nodes, but scale 
up to the hundreds of thousands of cities and towns, and 
you'll want to find a way to speed up the search. This 
is an important problem, and much research has been 
directed toward improving shortest-path performance. 

One generalization of Dijkstra's algorithm, called 
the A* algorithm, is used by many video game design- 
ers. The exact techniques used by Google and other sites 
aren't publicly revealed, but the A* algorithm may well 
be part of their arsenal. The A* algorithm generalizes 
Dijkstra's algorithm by adding ideas borrowed from 
artificial intelligence. In fact, A*, strictly speaking, isn't 
even an algorithm because there's a big hole where one 
must plug in a “heuristic” (read an “educated guess"). 
The heuristic function used varies with the specifics of 
the application, and it's the hard part of using A*. 

Here's an outline of how A* works. The general 
problem is that there is a graph with a source node x 
and a destination node y (for simplicity, assume that it's 
a connected graph so that a path from x to y exists), and 
the arcs of the graph are weighted with known distances 
between nodes. At any point in time, a "closed set" 
represents nodes that have been examined and need not 
be considered further. There is also an “open set" that 
represents nodes available for evaluation as the next 
node along the path. Initially, only node x is in the open 
set. Evaluation of a node n consists of three values that 
are maintained for each node in the open set: 


G — the distance to get from x to n along the path 
being constructed 
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Н = the result of the heuristic function that “guesses” 
the distance to get from л to y. This is a guess because 
the exact path from л to y is not yet known. 
F=Gt+H 


Pseudocode: 
Put node x into the open set 
Repeat the following process until y gets moved to the 
closed set: 
Select the node p from the open set with the lowest F 
value (with some tie-breaking rule) 
Move p to the closed set 
For each node z that is adjacent to p and not in the 
closed set (you don’t want to go back to a previ- 
ous node on the path), do the following: 

If z is not in the open set, move z to the open set, 
compute its G, H, and F values (the G value 
is just the G value for p + the weight of the 
D-z arc), and set p as z's parent node. 

Ifz is in the open set, compute a new G value for 
z by going through node p; if this value is 
lower than z's current G value, then recom- 
pute z's F value (which will also be lower 
than before) and set p as z's parent node. 


Once the target node y is in the closed set, walk back 
through the parents to the start node x. Reversing this 
walk gives the shortest path from x to y. 

Note the similarities to Dijkstra's algorithm (DA). 
There is a closed set (like the /N set of DA) that begins 
with the start node and eventually includes the end 
node. There is a next node p that gets moved into the 
closed set (moved into /N in DA). Once p is moved into 
the closed set, G values (distances from x in DA) are 
recomputed to see whether going through node p is an 
improvement. The algorithm terminates when the tar- 
get node y is moved into the closed set (moved into JN 
in DA). The path 15 found by walking backward from y 
to x through a parent list (the s array in DA). 

The difference between A* and Dijkstra's algo- 
rithm is that in A* the next node p is chosen based on 
the lowest F value, whereas in Dijkstra's algorithm p is 
chosen based on the shortest distance from the source 
node (which is the same as the G value). So A* uses 
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F to make its choice and Dijkstra’s algorithm uses G, 
but F = G + Н. Dijkstra's algorithm is therefore A* 
with a heuristic function that always returns 0. By us- 
ing a non-zero heuristic H value, A* makes smarter 
choices (and therefore closes in on the shortest path 
faster) provided, of course, that the heuristic function 
is a good guess. If the heuristic function consistently 
overestimates the distance to the target, then the path 
computed will not be the shortest path. Another way 
to look at the difference between A* and Dijkstra’s 


algorithm is to remember that Dijkstra's algorithm is 
a greedy algorithm—it makes its decisions based on 
the local knowledge of how close adjacent nodes are to 
p. A* uses global knowledge (or assumed global 
knowledge) about not only how close adjacent nodes 
are to p but also how far they are from the target node. 


http://www.policyalmanac.org/games/aStarTutorial.htm 
http://www.heyes-jones.com/astar.html 
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SECTION 7.3 REVIEW 


TECHNIQUES MAIN IDEA 
(0) Find a shortest path from x to y in a graph (using ° Algorithms that are Ө(п?) in the worst case can 
Dijkstra’s algorithm). find a shortest path between two nodes or a mini- 
(D Find a minimal spanning tree for a graph (using mal spanning tree in a simple, positively weighted, 
Prim's algorithm). connected graph with п nodes. 
EXERCISES 7.3 


For Exercises 1-4, use the graph that follows. Apply Dijkstra's algorithm for the pairs of nodes given; show the 
values for p and /N and the d values and s values for each pass through the while loop. Write out the nodes in 
the shortest path and the distance of the path. 


1. From 2 to 5 
2. From 3 to 6 
3. From 1 to 5 
4. From 4 to 7 
For Exercises 5 and 6, use the graph that follows. Apply Dijkstra's algorithm for the pairs of nodes given; show 


the values for p and JN and the d values and s values for each pass through the while loop. Write out the nodes 
in the shortest path and the distance of the path. 


5. Froma toe 


6. Fromd toa 
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For Exercises 7 and 8, use the directed graph that follows. Apply Dijkstra's algorithm to the nodes given; show 
the values for p and JN and the d values and s values for each pass through the while loop. Write out the nodes 
in the shortest path and the distance of the path. 


7. From 1 to 7 

8. From 3 to 1 

9. a. Modify Dijkstra's algorithm so that it finds the shortest paths from x to all other nodes in the graph. 
b. Does this change the worst-case order of magnitude of the algorithm? 

10. Give an example to show that Dijkstra's algorithm does not work when negative weights are allowed. 


Another algorithm for finding shortest paths from a single source node to all other nodes in the graph is the 
Bellman—Ford algorithm. In contrast to Dijkstra's algorithm, which keeps a set of nodes whose shortest path 
(minimum-weight path) of whatever length (that is, number of hops) has been determined, the Bellman-Ford 
algorithm performs a series of computations that seeks to find successively smaller-weight paths of length 1, 
then of length 2, then of length 3, and so on, up to a maximum of length n — 1 (ifa path exists at all, then there 
is a path of length no greater than n — 1). A pseudocode description of the Bellman-Ford algorithm is given 
in the accompanying box; when using this algorithm, the adjacency matrix A must have A[i, i] = 0 for all i. 


ALGORITHM BELLMAN-FORD ALGORITHM 


Bellman—Ford(n X n matrix A; node x; array of integers d; array of nodes s) 
//Computes the shortest path between a source node x and all other nodes in a simple, 
/ weighted, connected graph. A is a modified adjacency matrix with A[i, i] = 0. 
//When procedure terminates, the nodes in the shortest path from x to a node y 
//are y, sLy], sls[y]], ... , x; the distance for that path is d| y]. 

Local variables: 

nodes z, p //temporary nodes 

array of integers / //temporary distance array created at each iteration 


//initialize arrays d and s; this establishes the shortest 1-length paths from x 
@| =O 
for all nodes z not equal to x do 
dizl = АБ 
sel =x 
end for 


//find shortest paths of length 2, 3, etc. 
for i = 2 ton — 1 do 
t=d //copy current array d into array t 
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//modify t to hold shortest paths of length i 

for all nodes z not equal to x do 
//find the shortest path with one more link 
р = node іп С for which (d|p] + А[р, 2]) is minimum 
[2] = dlp] + Alp.z] 


if p Az then 
slz] = p 
end if 
end for 
d=t;  //copy array t back into d 
end for 


end Bellman—Ford 


For Exercises 11—14 use the Bellman-Ford algorithm to find the shortest path from the source node to any other 
node. Show the successive d values and s values. 


11. Graph for Exercises 1—4, source node — 2 (compare your answer to Exercise 1) 
12. Graph for Exercises 1—4, source node = 1 (compare your answer to Exercise 3) 
13. Graph for Exercises 7-8, source node — 1 (compare your answer to Exercise 7) 
14. а. Accompanying graph, source node = 1 (compare your answer to Exercise 10) 
b. What does this say about the Bellman-Ford algorithm as opposed to Dijkstra's algorithm? 


5 diii 
1 


2 3 


To compute the distance for the shortest path between any two nodes in a graph, Dijkstra’s algorithm could be 
used repeatedly, with each node in turn as the source node. A different algorithm, Floyd 5 algorithm, can also 
be used to solve this “all pairs" shortest-path problem, but while Floyd’s algorithm produces the weight of 
all shortest paths, it does not calculate what the shortest paths actually are, that is, what nodes are on a given 
shortest path. Floyd's algorithm is very similar to Warshall’s algorithm. A description follows, where A is the 
adjacency matrix of the graph with Ali, i] = 0 for all i. 


ALGORITHM FLOYD’S ALGORITHM 


Floyd (n X n matrix A) 
//Computes the shortest path between any two nodes in a simple, weighted, 
//connected graph; A is a modified adjacency matrix with A[i,i] = 0. 
//Upon termination, А will contain all the shortest-path distances 
for k — 1 ton do 
for i = 1 ton do 
for j — 1 ton do 
if Ali, k] + A[ j] < A[i, j] then 
Alij] = Al; k] + Alk, | 
end if 
end for 
end for 
end for 
end Floyd 
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For Exercises 15 and 16, use Floyd’s algorithm to find the distances for all the shortest paths. Show the successive 
values of the A matrix for each pass through the outer loop. 


15. Figure 7.10 
16. Graph for Exercises 1—4 


For Exercises 17—20, use Prim’s algorithm to find a minimal spanning tree for the graph in the specified figure. 
17. Graph for Exercises 1—4 
18. 


19. 


Kruskal’ algorithm is another algorithm for finding a minimal spanning tree in a connected graph. Whereas 
Prim's algorithm "grows" the tree from an arbitrary starting point by attaching adjacent short arcs, Kruskal's 
algorithm adds arcs in order by increasing distance wherever they may be in the graph. Ties are resolved 
arbitrarily. The only restriction is that an arc is not added if adding it would create a cycle. The algorithm 


terminates when all nodes have been incorporated into a connected structure. A (very informal) pseudocode 
description follows: 


ALGORITHM KRUSKAL’S ALGORITHM 


Kruskal (n х n matrix A; collection of arcs Т) 
//Finds a minimal spanning tree; 7 15 initially empty; 
//at termination, T = minimal spanning tree 
order arcs in G by increasing distance 
repeat 
if next arc in order does not complete a cycle then 
add that arc into Т 
end if 
until 7 is connected and contains all nodes of G 
end Kruskal 
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For Exercises 21—24 use Kruskal’s algorithm to find the minimal spanning tree. 
21. Graph for Exercises 1—4 23. Graph for Exercise 19 
22. Graph for Exercise 18 24. Graph for Exercise 20 
25. Give an example to show that adding the node closest to JN at each step, as is done in Prim’s minimal 
spanning tree algorithm, will not guarantee a shortest path. 
26. Let a be the arc of lowest weight in a weighted graph. Show that a must be an arc in any minimal spanning tree. 
27. Acity plans to lay out bike paths connecting various city parks. A map with the distances between the parks 


is shown in the figure. (Some direct connections would have to cross major highways, so these distances 
are not shown in the map.) Find which paths to pave so that all parks are connected but the cost is minimal. 


28. Assume that arc weights represent distance. Then adding new nodes and arcs to a graph may result in a 
spanning tree for the new graph that has less weight than a spanning tree for the original graph. (The new 
spanning tree could represent a minimal-cost network for communications between a group of cities ob- 
tained by adding a switch in a location outside any of the cities.) 


a. Find a spanning tree of minimum weight for the following labeled graph. What is its weight? 


100 


100 100 


b. Put a node in the center of the square. Add new arcs from the center to the corners. Find a spanning tree 
for the new graph, and compute its (approximate) weight. 


29. At the beginning of this chapter, you received the following assignment: 


You are the network administrator for a wide-area backbone network that serves your company’s many offices 
across the country. Messages travel through the network by being routed from point to point until they reach 
their destination. Each node in the network therefore acts as a switching station to forward messages to other 
nodes according to a routing table maintained at each node. Some connections in the network carry heavy traf- 
fic, while others are less used. Traffic may vary with the time of day; in addition, new nodes occasionally come 
on line and existing nodes may go off line. Therefore you must periodically provide each node with updated 
information so that it can forward messages along the most efficient (that is, the least heavily traveled) route. 


How can you compute the routing table for each node? 


You realize that you can represent the network as a weighted graph, where the arcs are the connections 
between nodes and the weights of the arcs represent traffic on the connections. The routing problem then be- 
comes one of finding the shortest path in the graph from any node to any other node. Dijkstra's algorithm can 
be used to give the shortest path from any one node to all other nodes (see Exercise 9), so you could use the 
algorithm repeatedly with different start nodes. Or you could use Floyd's algorithm. Discuss the advantages 
and disadvantages of each approach, including an analysis of the order of magnitude of each approach. 
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SECTION 7.4 


TRAVERSAL ALGORITHMS 


So far this chapter has considered various path questions about a graph G. Is there 
a path in G from node x to node y? Is there a path through G that uses each arc 
once? Is there a path through G that ends where we started and uses each node 
once? What is the minimum-weight path between x and y? In this section we deal 
with a simpler problem—we only want to write down all the nodes of a simple, 
connected graph G in some orderly way. This means we must find a path that 
visits each node at least once, but we can visit it more than once if we don’t write 
it down again. We can also retrace arcs on the graph if necessary, and clearly this 
would in general be necessary if we were to visit each node in a tree. This process 
is called graph traversal. We already have several mechanisms for tree traversal 
(Section 6.2). The two algorithms in this section generalize traversal to apply to 
any simple, connected graph. 


Depth-First Search 


In the depth-first search algorithm for graph traversal, we begin at an arbitrary 
node a of the graph, mark it visited, and write it down. We then strike out on a 
path away from a, visiting and writing down nodes, proceeding as far as pos- 
sible until there are no more unvisited nodes on that path. We then back up the 
path, at each node exploring any new side paths, until finally we retreat back to 
a. We then explore any new paths remaining from a. Figure 7.13 shows a graph 
after the first few nodes (marked by circles) have been visited using depth-first 
search. 


Figure 7.13 


For a more formal description of the depth-first search algorithm we will use 
recursion, where the algorithm invokes itself in the course of its execution. In the 
following algorithm, the input is a simple, connected graph G and a specified node 
а; the output is a list of all nodes in G in depth-first order from a. 
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ALGORITHM 


DEPTHFIRST 


DepthFirst(simple, connected graph G; node a) 
//Writes nodes in graph G in depth-first order from node a 


mark a visited 

write (a) 

for each node n adjacent to a do 
if n not visited then 

DepthFirst(G, n) 

end if 

end for 

end DepthFirst 


REMINDER 


In a depth-first search, go 
as far as possible, then 
back up, catching any 
paths missed on the way 
down. 


In the recursive step, the algorithm is invoked with a new node specified as the 
starting point. We have not indicated here how to mark visited nodes or how to 
find those nodes n that are adjacent to a. 


We will apply depth-first search to the graph of Figure 7.13, where a is the initial 
node. We first mark that we have visited a (it’s helpful in tracing the execution of 
the algorithm to circle a visited node), and then we write out a. Next we search the 
nodes adjacent to a for an unvisited node. We have a choice here (b, e, h, and i); let us 
select node b. (Just so we all get the same answers, let's agree to choose the node 
that is alphabetically first when we have a choice; in practice, the choice would 
be determined by how the vertices were stored in the graph representation.) Then 
we invoke the depth-first search algorithm beginning with node b. 

We go back to the beginning of the algorithm, where the specified node is 
now b rather than a. Thus we first mark b visited and write it out. Then we search 
through nodes adjacent to b to find an unmarked node. Nodes a and c are adjacent 
to b, but node a is marked as already visited. Node c will do, and we invoke the 
depth-first search algorithm beginning with node c. 

Node c is marked and written out, and we look for unmarked nodes adjacent to 
c. By our alphabetical convention, we select node d from the set (d, f }. Continuing 
in this fashion, after visiting node d we next visit node fand then node g. When we 
get to node g, we have reached a dead end because there are no unvisited adjacent 
nodes. Thus the for loop of the instance of the algorithm invoked with node g is 
complete. (The graph at this point looks like Figure 7.13.) 

We are therefore done with the algorithm for node g, but node g was (possibly 
one of) the unmarked nodes adjacent to node /, and we are still in the for loop for the 
instance of the algorithm invoked with node f. As it happens, g is the only unvisited 
node when we are processing f; therefore we complete the for loop and thus the algo- 
rithm for node f. Similarly, backing up to node d, the algorithm finds no other adjacent 
unmarked nodes, and it backs up again to the instance of the algorithm invoked with 
node c. Thus, after processing node d and everything that came after it until the dead 
end, we are still in the for loop for the algorithm applied to node c. We look for other 
unmarked nodes adjacent to c and find one—node e. Therefore we apply depth-first 
search to node e, which leads to nodes Л, i, and k before another dead end is reached. 
Backing up, we have a final new path to try from node Л, which leads to node j. The 
complete list of the nodes, in the order in which they would be written out, is 
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Example 11 makes the depth-first search process sound very complex, but it is 
much easier to carry out than to write down, as you will see in Practice 14. 


PRACTICE 14 | Write the nodes in a depth-first search of the graph in Figure 7.14. Begin with node a. 


Figure 7.14 


Breadth-First Search 


In breadth-first search, beginning at an arbitrary node a, we first fan out from node 
a to visit nodes that are adjacent to a; then we fan out from those nodes, and so on, 
almost like the concentric circles of ripples in a pond. Figure 7.15 shows the first few 
nodes visited in the same graph as Figure 7.13, this time using breadth-first search. 


Figure 7.15 
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To write the breadth-first search algorithm in an elegant fashion, we will use a 
queue structure. A queue is simply a line in which new arrivals are added at the back 
and departures take place at the front. A checkout line in a grocery store is an example 
of a queue of customers—a new customer joins the line at the back and departures 
take place from the front of the line as customers are checked through. The addition of 
an entry at the back of a queue is called an enqueue operation, and a departure from 
the front of the queue is called a dequeue operation. Thus the notation enqueue(a, О) 
denotes adding a to the end of a queue called О, and dequeue(Q) denotes removal of 
the entry currently at the front of О. We'll also use a function front(Q), which returns 
the value of the entry currently at the front of О but does not remove that entry. In the 
following algorithm, the input is a simple, connected graph G and a specified node a; 
the output is a list of all nodes in G in breadth-first order from a. 


ALGORITHM BREADTHFIRST 


BreadthFirst(simple, connected graph G; node a); 

//writes nodes in graph С in breadth first order from node a 
Local Variable: 

queue of nodes О 


initialize О to be empty 
mark a visited 
write(a) 
enqueue(a, Q) 
while Q is not empty do 
for each node n adjacent to front(Q) do 
if n not visited then 
mark n visited 
write(n) 
enqueue(n, О) 
end if 
end for 
dequeue(Q) 
end while 
end BreadthFirst 


Let's walk through the algorithm for a breadth-first search of the graph of Figure 
7.15 beginning at node a (this is the same graph on which we did the depth-first 
search in Example 11). We begin by initializing an empty queue Q, marking node a 
as visited, writing it out, and adding it to the queue. When we first reach the while 
loop, the queue is not empty and a is the entry at the front of the queue. In the for 
loop, we look for unvisited nodes adjacent to a to visit, write them out, and add 
them to the back of the queue. We may have a choice of nodes to visit here; as be- 
fore, and purely as a convention, we will agree to visit them in alphabetical order. 
Thus the first time we complete the for loop, we have visited and written out b, e, 
h, and i, in that order, and added them to the queue. The graph at this point looks 
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like Figure 7.15. We then remove a from the front of the queue, which as a result 
contains (from front to back) 


b,e,h,i 


In the next iteration of the while loop, 5 is the front element in the queue, and the 
for loop searches for unvisited nodes adjacent to 5. The only previously unvisited 
node here is c, which gets marked as visited, written out and added to the queue. 
After removing b, the queue contains 


@, (D. d (€ 


Performing the while loop again, e is at the front of the queue. A search of the 
nodes adjacent to node e produces one new node, node j. The graph now looks like 
Figure 7.16, and after removing e the queue contains 


h, il CH 


Figure 7.16 


When searching for nodes adjacent to Л, we pick up one new node, node k. When 
searching for nodes adjacent to i, no new nodes are added to the queue. When c 
becomes the first element in the queue, a search for nodes adjacent to c turns up 
two new nodes, d and f. After adding these to the queue (and removing c), the 
queue contains 


J, k, d, f 


Looking for nodes adjacent to j and then to k adds no new nodes to the queue. 
When the front of the queue is d, a new node g is found, and the queue (after re- 
moving d) is 


DE 


Section 7.4 Traversal Algorithms 601 


Processing f and then g yields no new nodes. After g is removed from the queue, 
the queue is empty. The while loop—as well as the algorithm—terminates. The list 
of nodes written out by this process, that is, the nodes in breadth-first order from 
a, are 


a, b, Є. h, i, GJ k, @ Jf, 8 е 


Like the depth-first search, the breadth-first search is not difficult to trace; one 
must just keep track of the nodes that have been visited and the current contents 
of the queue. 


PRACTICE 15 | Write the nodes in a breadth-first search of the graph in Figure 7.14, beginning with node a. 


Analysis 


How much work do the depth-first search and breadth-first searches perform? 
Both algorithms look for all unvisited nodes adjacent to a given node. Suppose 
the graph contains n nodes and m arcs. One of the advantages of representing a 
graph as an adjacency list rather than an adjacency matrix is that this particular 
operation is more efficient; to find nodes adjacent to node i requires traversing i's 
adjacency list, which may be short, rather than traversing row i of the adjacency 
matrix, which must contain n entries. Therefore we will assume an adjacency list 
representation of the graph. 

In breadth-first search, the algorithm searches all at one time the entire ad- 
jacency list of the node at the front of the queue, marking, writing out, and en- 
queuing the unvisited nodes found. In the depth-first search, the algorithm may be 
interrupted many times while traversing the adjacency list of a given node to go 
off (by virtue of the recursion) and process sections of the adjacency lists of other 
nodes. Eventually, however, every adjacency list is completely covered. 

Traversing the adjacency lists of the graph drives the amount of work done in 
either search. There are n adjacency lists, so the amount of work is at least O(n) 
because each adjacency list must be checked. Because there are m arcs, the work 
in traversing the total length of all the adjacency lists is at least O(m). Therefore 
both depth-first search and breadth-first search are O(max(n, m)) algorithms. If 
there are more arcs than nodes (the usual case), then O(max(n, m)) = Ө(т). 


Applications 


Depth-first search and breadth-first search can be used as the basis for performing 
other graph-related tasks, some of which we have solved before. A nonrooted tree 
structure that is a subgraph ofthe original graph can be associated with each search. 
When traversing node 7's adjacency list, if node j is adjacent to i and is previously 
unvisited, then the 7—/ arc is added to this subgraph. Because no arc to a previously 
visited node is used, cycles are avoided and the subgraph is a nonrooted tree. Be- 
cause all nodes ultimately are visited (for the first time), these trees are spanning 
trees for the graph. Each tree has n — 1 arcs, the minimal number of arcs to connect 
n nodes. Here we are assuming that arcs are unweighted, but if we consider them to 
be weighted arcs, each with weight 1, then these trees are minimal spanning trees. 


PRACTICE 16 
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The dark lines in Figure 7.13 are part of the depth-first search tree associated 
with the search of Example 11, and the dark lines in Figures 7.15 and 7.16 are part 
of the breadth-first search tree associated with the search of Example 12. 


a. Complete the depth-first search tree for Example 11. 
b. Complete the breadth-first search tree for Example 12. a 


The depth-first search and breadth-first search algorithms apply equally well 
to directed graphs and in the process yield a new algorithm for reachability. To 
determine whether node j is reachable from node i, do a depth-first (or breadth- 
first) search beginning at node i; when the algorithm terminates, check whether 
node j has been visited. “АП pairs" reachability, that 1s, which nodes are reach- 
able from which nodes, can thus be determined by running depth-first or breadth- 
first searches using each node in turn as the source node. This process would 
require O(n * max(n, m)) work. If the graph is very sparse, in which case we have 
max(n, т) = n, we would have an O(n’) algorithm for reachability. Recall that 
Warshall's algorithm (Section 7.1) was an Ө(т?) algorithm. The improvement 
comes about because in a sparse graph, most adjacency lists will be short or emp- 
ty, whereas Warshall’s algorithm processes entries in the adjacency matrix even if 
those entries are Os. But if the graph is not sparse, the number of arcs can be O(n’), 
in which case O(n * min(n, т)) = O(m), the same as Warshall's algorithm. In ad- 
dition, Warshall’s algorithm has the advantage of succinct implementation. 

In Section 5.2 we defined a topological sort as a way to extend a partial order- 
ing on a finite set to a total ordering. Let the partially ordered set be represented 
by a directed graph. The topological sort will be achieved by counting the nodes, 
so let the initial value of the count be 0. Pick a node as a source node and perform 
a depth-first search from this node. Whenever the search backs up from a node for 
the final time, assign that node the next counting number. When the depth-first 
search algorithm terminates, pick an unvisited node (if one exists) to be the source 
for another depth-first search, and continue to increment the counting number. 
Continue this process until there are no unvisited nodes left in the graph. A to- 
pological sort results by ordering the nodes in the reverse order of their counting 
number. This process for topological sorting works because we assign the count- 
ing number when we back up from a node for the final time. Its counting number 
will then be higher than the numbers of all the nodes reachable from it, that is, all 
the nodes of which it is a predecessor in the partial ordering, 


Figure 7.17a is a directed graph that represents a partial ordering. Choosing d 
(arbitrarily) as the source node and performing a depth-first search, we visit e and 
f, at which point we must back up. Node fis assigned the counting number 1, but 
we are not yet done with e, because we can go on to visit g. Backing up from g, 
g is assigned the counting number 2. At this point we back up from e for the final 
time and assign e the number 3 and then d the number 4. We choose a as the source 
node for another search. We visit node c and then must back up, so c and a are as- 
signed the numbers 5 and 6, respectively. Beginning with b as a source node, there 
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is nowhere to go, and b is assigned the number 7. There are no unvisited nodes 
left in the graph, so the process stops. The numbering scheme is shown in Figure 


ТЫП 


а (6) 


b (7) 


(а) (b) 


Figure 7.17 


In reverse order of the counting numbers, we get 


Ss 
© 
Су 
Q 
« 
09 № 
Er 


which is a topological ordering. 


PRACTICE 17 | Use the depth-first search algorithm to do a topological sort on the graph in Figure 7.18. 
Indicate the counting numbers on the graph. 


Figure 7.18 


8 (2) 


Now consider a graph G (undirected) that may not be a connected graph. A 
connected component of G is a subgraph of G that is both connected and not a 
subgraph of a larger connected subgraph. In Figure 7.19 there are three connected 
components. Of course, if the original graph is connected, then it has only one 


connected component. 


Figure 7.19 
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A depth-first or breadth-first search can be used to find the connected com- 
ponents of a graph. We pick an arbitrary node as a source node and then conduct 
a search. When the algorithm terminates, all visited nodes belong to one compo- 
nent. We then find an unvisited node in the graph to serve as a source for another 
search, which will produce a second component. We continue this process until 
there are no unvisited nodes in the graph. 

Although we defined reachability only for directed graphs, the concept also 
makes sense for undirected, unconnected graphs. Let us consider only simple un- 
directed, unconnected graphs but impose the convention that, even though there 
are no loops, each node is reachable from itself. Reachability then becomes an 
equivalence relation on the set of nodes of the graph; our convention imposes the 
reflexive property, and symmetry and transitivity follow because the graph is un- 
directed. This equivalence relation partitions the nodes of the graph into equiva- 
lence classes, and each class consists of the nodes in one connected component 
of the graph. Warshall’s algorithm can be applied to undirected graphs as well as 
directed graphs. Using Warshall’s algorithm results in a matrix from which the 
nodes making up various components of the graph can be determined, but this 
requires more work than using the depth-first search. 

As a final remark about depth-first search, we saw in Section 1.5 that the 
programming language Prolog, when processing a query based on a recursive 
definition, pursues a depth-first search strategy (Example 40). 


SECTION 7.4 REVIEW 


TECHNIQUES MAIN IDEAS 
Ù Conduct a depth-first search of a graph. * Algorithms exist to visit the nodes of a graph 
QJ Conduct a breadth-first search of a graph. systematically. 


EXERCISES 7.4 


* Depth-first and breadth-first searches can serve as a 
basis for other tasks. 


For Exercises 1—6, write the nodes in a depth-first search of the following graph, beginning with the node specified. 


/_ 


8 
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For Exercises 7-10, write the nodes in a depth-first search of the following graph, beginning with the node 
specified. 


7. a 8. е 9. f 10. A 


For Exercises 11—16, write the nodes in a breadth-first search of the graph for Exercises 1—6, beginning with 
the node specified. 


ll.a 12. c 13. d 14. g 15. e 16. А 


For Exercises 17—20, write the nodes in a breadth-first search of the graph for Exercises 7—10, beginning with 
the node specified. 


ILa 18e 19.7  20.h 


For Exercises 21—24, write the nodes in a depth-first search of the following graph, beginning with the node 
specified. 
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For Exercises 25—28, write the nodes in a breadth-first search of the graph for Exercises 21—24, beginning with 
the node specified. 


25.a 26. g 27. f 28. e 

29. In the computer network in the accompanying figure, the same message is to be broadcast from node C 
to nodes A, E, F, and С. One way to do this is to find the shortest path from C to each of these nodes and 
send out multiple copies of the same message. A more efficient approach is to send one copy out from C 


along a spanning tree for the subgraph containing the nodes involved. Use the depth-first search algorithm 
to find a spanning tree for the subgraph. 


F 


30. Using the graph for Exercise 29, use the breadth-first search algorithm to find a spanning tree for the subgraph. 


3]. Use the depth-first search algorithm to do a topological sort on the following graph. Indicate the counting 
numbers on the graph. Also state the starting node or nodes for the search. 


b 


32. Use the depth-first search algorithm to do a topological sort on the following graph. Indicate the counting 
numbers on the graph. Also state the starting node or nodes for the search. 


b 


33. The data structure used to implement a breadth-first search is a queue. What is the appropriate data structure 
to implement a depth-first search? 


34. Find a way to traverse a tree in level order, that is, so that all nodes at the same depth are listed from left 
to right for increasing depth. (Hint: We already have a way to do this.) 


35. Describe how the depth-first search algorithm can be used in a connected (undirected) graph to detect 
the presence of cycles in the graph. (While it is simple to look at Figure 7.13 and see that a—b—c—e-a, for 
example, is a cycle, in a huge graph with thousands of node and arcs, a cycle may be less easy to spot, in 
addition to which you might not even have a visual representation.) 


36. a. Describe the order in which nodes are visited in a breadth-first search of the bipartite complete graph К, 
b. Describe the order in which nodes are visited in a depth-first search of the bipartite complete graph K, 
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SECTION 7.5 


* DEFINITION 


© DEFINITION 


ARTICULATION POINTS AND COMPUTER NETWORKS 


The Problem Statement 


In a graph that represents a computer network, the nodes denote the communicat- 
ing entities (end-user computers, servers, routers, and so on) and the arcs denote 
the communications medium (coaxial cable, fiber optic, and so on). Such a graph 
should be a connected graph, so that there is a path between every pair of nodes. 
To minimize the length of cable or wire required, we would choose a minimum 
spanning tree. However, if an arc in a minimum spanning tree is removed (for 
example, that section of cable or wire is damaged or broken), then the graph is no 
longer connected. Each arc becomes a single point of failure for the network. That 
is why such a network usually contains more arcs than just those of a minimal 
spanning tree. However, even in a graph sufficiently rich in arcs to withstand the 
loss of a single arc, a node may be a single point of failure. If the node fails (and 
thus is logically removed), the arcs of which that node is an endpoint are disabled 
and the result may be a disconnected graph. 


ARTICULATION POINT 
A node in a simple, connected graph is an articulation point if its removal 
(along with its attached arcs) causes the remaining graph to be disconnected. 


Node d in the graph of Figure 7.20a is an articulation point. Removing d results in 
the disconnected graph of Figure 7.20b. 


а а 

Ь e 5 © 

d 
е j e f 
8 8 
(а) (b) 

Figure 7.20 e 

BICONNECTED GRAPH 


A simple, connected graph is biconnected if it has no articulation points. 
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The presence of articulation points is clearly an undesirable feature of a net- 
work. Although it is easy to spot an articulation point in a graph as small as that 
of Figure 7.20a, we will develop an algorithm that will detect such points no mat- 
ter how large the graph (and, of course, does not require a visual representation 
of the graph). Articulation points separate the graph into biconnected compo- 
nents, subgraphs that are biconnected and are not subgraphs of larger biconnected 
subgraphs. In Figure 7.20, a—b—d—c and d—e—g—/ are biconnected components. 


The Idea behind the Algorithm 


The key to this algorithm is depth-first search. We know from the previous section 
that a depth-first search determines a nonrooted tree. An arc is added to the tree 
whenever the search progresses to a previously unvisited node. Arcs of the graph 
belonging to this tree are called tree arcs. The remaining arcs in the graph are 
called back arcs. 


In Figure 7.20 a depth-first search from node a visits nodes in the order a, b, d, c, 
e, g, and f. In Figure 7.21 the tree arcs are dark, and the back arcs are light. 


а 
Ь G 
d 
e jJ 
& 
Figure 7.21 e 


The depth-first search tree passes through all nodes. To detect nodes that are 
articulation points, we examine their relative positions in the tree. First consider 
the single node that is the starting point of the depth-first search tree. If only one 
tree arc emanates from the starting node, then as the tree continues, all other 
nodes in the graph can be reached from the node at the other end of that tree arc. 
Therefore, removing the starting node will not disconnect the graph. However, if 
two or more tree arcs emanate from the starting node, then the only way to get 
from one subtree to another is to pass back through the starting node. In this case, 
removing the starting node disconnects the graph. 

Thus in Figure 7.21 node a is the starting node of the depth-first search tree, 
and there is a single tree arc emanating from a. Removing node a (and its two arcs) 
does not disconnect the graph. Had we begun a depth-first search at node d, how- 
ever, the tree would have looked like Figure 7.22. There would be two tree arcs 
coming from node d, showing that d is an articulation point. 
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a 
b c 
d 
e f 
8 
Figure 7.22 


Consider any node л that is a leaf of the depth-first search tree (attached to the 
end ofa single tree arc). Such a leaf may be a leaf of the graph itself, that is, a node 
of degree 1, in which case it is clearly not an articulation point. If not, then the 
other arcs emanating from n were not used in the depth-first search, so the nodes 
adjacent to л are reachable through alternative paths that do not go through л. 
Because n is not needed on a path to any other node, its removal does not discon- 
nect the graph. Therefore no leaves of the depth-first search tree are articulation 
points. In Figure 7.21 node c, for example, is a leaf of the depth-first search tree; 
the arc from c to a is a back arc, so node a is accessible through another route that 
does not require node c. Node c can be removed without disconnecting the graph. 

Now consider a node 7 that is not a leaf in the depth-first search tree and is 
not the starting node. Because n is not a leaf, there are one or more subtrees below 
n. Suppose there is a single subtree; let x be a node on this subtree. If x has a back 
arc to some node that precedes n in the depth-first search (an “ancestor” of n), then 
this arc provides part of an alternative path for x—and all other nodes in the sub- 
tree—to be connected with the rest of the graph without using node л. In this case 
nis not an articulation point. (See Figure 7.23a, where removing n and its attached 
arcs does not disconnect the graph.) If there is more than one subtree below n, then 
n will not be an articulation point if and only if each subtree has such an "escape 
route" allowing it to connect with the rest of the graph—including the other sub- 
trees—without going through n. (See Figure 6.23b; note that the back arc from y 
to z does not help because it does not reach back to an ancestor of л.) 


r 
Escape route 


Ms 


Escape route 


(a) (b) 
Figure 7.23 
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The Algorithm Itself 


The key to the algorithm, as we may guess from the foregoing discussion, is keeping 
track of the destinations of the back arcs. We will assign a tree number to each node 
that corresponds to the order in which that node is visited in the depth-first search. 
Thus the starting node in a depth-first search has tree number 1, the next node visited 
has tree number 2, and so on. In addition, we will maintain a “back number” for each 
node x. The back number will be the minimum tree number of a node (the farthest 
back node) reachable from x using either back arcs from x or from descendants of x in 
the subtree. To incorporate information about back arcs of descendants of x, the back 
number of x is adjusted when the depth-first search backs up to node x from farther 
down in the tree. Node л is an articulation point whenever a subtree of n has no back 
arc to an ancestor of n, and this circumstance is detected when the search backs up to 
n from the subtree. Suppose the search is backing up from x to n. If the back number 
of x at this point is not smaller than the tree number of л, then n is an articulation point. 

The following algorithm carries out the depth-first search and builds the 
depth-first search tree. It correctly handles both leaves and nonstarting-node non- 
leaves of the depth-first search tree, leaving only the starting node as a special 
case. It also assumes that the graph structure itself contains the tree number and 
back number for each node. 


ALGORITHM 


ARTPOINT 


ArtPoint(graph G; node n; integer TreeNumber) 

//detects articulation points in G by depth-first search from n; graph G 
//also maintains a tree number TN and back number BN value for each 
//node; TreeNumber = 0 when first invoked. 


Local variable: 
node x //temporary node 


mark n visited //first encountering n, assign its numbers 
TreeNumber = TreeNumber + 1 
ТМп| = TreeNumber 
BN[n] = TNÍn] 
for each node x adjacent to n by a nontree edge do 
if x not visited then 
make n-x a tree edge 
ArtPoint(G, x, TreeNumber) 


//depth-first search now backing up to n from x 
if BN[x] >= TN[n then /lline 1 
write(“n is an articulation point") 
else 
//adjust back number of n 
BN|n] = min(BN[n], BN(x]) /line 2 
end if 
else 
//arc n-x is a back edge, adjust BN [n] 
BN[n] = min(BN[n], TN(x |) //line 3 
end if 
end for 
end ArtPoint 
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We will trace the articulation point algorithm on the graph of Figure 7.20a, where 
a is the starting node. The tree begins with arcs a—b, b-d, and d—c. Each new node 
is numbered with a consecutive tree number, and its back number is set equal to 
its tree number (Figure 7.24a; in the figure, the numbers in parentheses are the tree 
number and the back number, respectively). While processing node c, the back 
edge to a is discovered, and the back number of c is adjusted to 1, the tree number 
of a (line 3 in the algorithm description). This action completes the processing of 
node c, and the depth-first search backs up to d. The back number of c is less than 
the tree number of d, so the back number of d is adjusted to equal that of c (line 2). 
The situation at this point is shown in Figure 7.24b. 

The depth-first search moves on to nodes e, g, and f (Figure 7.24c). At f the 
back arc to d is found, and the back number of fis set equal to the tree number of d 
(line 3). Backing up from f to g, the back number of g is adjusted to equal the back 
number of f (line 2) and similarly for e (line 2 again). (See Figure 7.244.) 


a (1, 1) a (1, 1) a (1, 1) 
OD) ta д) cL TI д 2 c (4, 1) 
d) d (3, 1) d (3, 1) 
e Ў е f е (5, 5) fO.) 
g g g (6, 6) 
(a) (b) (c) 
clo 10) a (1, 1) 
b (2, 2) EtG, I) 0 b) c (4, 1) 
d (3, 1) d (3, 1) 
e (5, 3) iD) G3) (012 33) 
8 (6, 3) 8 (6, 3) 


d = articulation point 
(d) (e) 
Figure 7.24 


Finally, in backing up from e to d, the back number of e is greater than or equal 
to the tree number of d, so d is declared an articulation point (line 1). The re- 
cursion backs up to node b, adjusting the back number of b, and then node a, 
at which point line 1 would seem to apply (Figure 7.24e). But a is the starting 
node of the search and so is not an articulation point, because there is only one 
tree arc from a. ө 
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PRACTICE 18 | In Figure 7.25, the depth-first search began at node a. Explain why each node is marked as 
it is and how it is concluded that c is an articulation point. 


a (1, 1) b (2, 1) 
CID 
d (4, 4) 
c = articulation point 
Figure 7.25 І 
SECTION 7.5 REVIEW 
TECHNIQUE MAIN IDEA 


( Find articulation points in a simple, connected • Articulation points represent single points of fail- 
graph (using algorithm ArtPoint) ure in a computer network, but an algorithm exists 
to detect their presence. 


EXERCISES 7.5 


For Exercises 1—6, draw the depth-first search trees, where node a is the starting node of the depth-first search. 
Identify the back arcs. 


1. р 4. а 
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For Exercises 7—12, use algorithm ArtPoint to find the articulation points. Label the tree number and back 
number for each node, both as first assigned and as changed. Draw the biconnected components of the graph. 
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CHAPTER 7 REVIEW 


TERMINOLOGY 


adjacency relation (p. 555) 
articulation point(p. 607) 

back arc (p. 608) 

biconnected component(p. 608) 
biconnected graph(p. 607) 
breadth-first search (p. 598) 
connected component (p. 603) 
depth-first search (p. 596) 


dequeue (p. 599) 
enqueue (p. 599) 


SELF-TEST 


Answer the following true—false questions. 


Section 7.1 

1. Any binary relation on a set N has an associated 
adjacency matrix. 

2. Transitive closure is the adjacency relation equiva- 
lent of reachability. 

3. The reachability matrix R for a directed graph G 
is computed by taking the powers of the adjacency 
matrix up to n’. 

4. Warshall's algorithm proceeds by computing, in 
turn, the number of paths of length 1, then length 2, 
and so on, between nodes. 

5. Warshall’s algorithm computes symmetric closure 
in the case of a symmetric adjacency relation. 


Section 7.2 

1. A graph with four odd nodes can still be a connected 
graph. 

2. An Euler path exists in any graph with an even 
number of odd nodes. 

3. An Ө(п?) algorithm exists to test the existence of 
an Euler path in a graph with n nodes. 

4. A Hamiltonian circuit uses each arc and node of the 
graph exactly once except for the starting and end- 
ing node. 

5. No algorithm to solve the Hamiltonian circuit 
problem is known. 


Section 7.3 

1. Dijkstra's algorithm for the shortest path in a graph 
maintains a set /N and adds at each step the node 
closest to a node in JN. 

2. A greedy algorithm is one that recursively divides 
the problem into as many subproblems as possible. 

3. The minimal spanning tree for a graph may not be 
unique. 


Euler path (p. 572) 

even node (p. 572) 

graph traversal (p. 596) 
greedy algorithm (p. 586) 
Hamiltonian circuit (p. 576) 
minimal spanning tree (p. 587) 


odd node (p. 572) 

queue (p. 599) 

reachable node (p. 557) 
reachability matrix (p. 559) 
spanning tree (p. 587) 

tree arc (p. 608) 


4. Using a linked-list representation for nodes not in 
IN does not improve the order of magnitude of the 
worst-case work done by Dijkstra's algorithm. 

5. The collection of all arcs that are not in a minimal 
spanning tree for a graph will also form a spanning 
tree, but it may not be minimal. 


Section 7.4 

1. The depth-first search visits nodes at the bottom of 
the graph first. 

2. In a breadth-first search beginning with node i, all 
the nodes adjacent to į are visited in order. 

3. An analysis of the depth-first search and the 
breadth-first search shows them to be algorithms of 
the same order of magnitude. 

4. Preorder traversal is the tree equivalent of the 
breadth-first search, using the root as the starting node. 

5. Topological sorting can be done by a succession of 
breadth-first searches on a directed graph. 


Section 7.5 

1. If node n is an articulation point in a connected 
graph, then any path between any two nodes in the 
graph must pass through л. 

2. A biconnected graph 15 a simple, connected graph 
with no articulation points. 

3. When a node л is first reached during a depth-first 
search, any other arcs from л to previously visited 
nodes are back arcs. 

4. A node n where every subtree of л in the depth-first 
search tree has a back arc to a predecessor of и is 
not an articulation point. 

5. The root of a depth-first search is always an articula- 
tion point in the graph because any node's back num- 
ber will be greater than or equal to its tree number. 
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ON THE COMPUTER 


For Exercises 1—5, write a computer program that 
produces the desired output from the given input. 


1 


. Input: Adjacency matrix A for a directed graph 


Output: Reachability matrix Rforthe graph, comput- 
ed from the formula R = A V AO V---V AO 


. Input: Adjacency matrix A for a directed graph 


Output: Reachability matrix R for the graph, com- 
puted by using Warshall's algorithm 


. Input: Adjacency matrix A for a graph 


Output: Message indicating whether the graph has 
an Euler path 


. Input: Adjacency matrix A for a simple weighted 


graph or directed graph and two nodes in the graph 
Output: Distance for the shortest path between the 
two nodes or a message that no path exists; verti- 
ces in the shortest path if one exists (Hint: You will 
need to find some way of denoting which vertices 
are currently in JN.) 


. Input: Adjacency matrix A for a simple, weighted, 


connected graph 
Output: Arcs (as ordered pairs) in a minimal 
spanning tree 


For Exercises 6—8, first write a function that collects 
information from the user about a graph and builds an 
adjacency list representation of the graph; incorporate 
this function in the programs requested. 


6. Input: Information about a graph (see instructions 
above) and a node in the graph 
Output: Nodes in a depth-first search of the graph 
beginning with the given node 


7. Input: Information about a graph (see instructions 
above) and a node in the graph 
Output: Nodes in a breadth-first search of the graph 
beginning with the given node 


8. Input: Information about a graph (see instructions 
above) 
Output: Articulation points in the graph 
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Question: 


After studying this chapter, you will be able to: 


* Determine whether a given mathematical structure is a Boolean algebra. 

* Prove properties about Boolean algebras. 

* Understand what it means for an isomorphism function to preserve the effects 
of a binary operation or other property. 

* Draw a logic network to represent a Boolean expression. 

* Write a Boolean expression to represent a logic network. 

* Write the truth function for a Boolean expression or logic network. 

* Write a Boolean expression in canonical sum-of-products form for a given 
truth function. 

* Use NAND and NOR gates as well as AND, OR, and NOT gates to build logic 
networks. 

* Write a truth function from a description of a logical control device. 

* Simplify Boolean expressions and logic networks using Karnaugh maps. 

* Simplify Boolean expressions and logic networks using the Quine-McCluskey 
method. 


You have been hired by Rats R Us to build the control logic for the production facilities 
for a new anticancer chemical compound being tested on rats. The control logic must 
manage the opening and closing of two valves, A and B, downstream of the mixing 
vat. Valve A is to open whenever the pressure in the vat exceeds 50 psi (pounds per 
square inch) and the salinity of the mixture exceeds 45 g/L (grams per liter). Valve B is 
to open whenever valve A is closed and the temperature exceeds 53°С and the acidity 
falls below 7.0 pH (lower pH values mean more acidity). 


How many and what type of logic gates will be needed in the circuit? 


The answer to this electronics problem lies, surprisingly, in a branch of math- 
ematics developed around 1850 by George Boole, an English mathematician. 
Boole was interested in developing rules of "algebra" for logical thinking, 
similar to the rules of algebra for numerical thinking. Derided at the time as 
useless, if harmless, Boole's work is the foundation for the electronics found in 
computers today. 
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In Section 8.1 we define Boolean algebra as a mathematical model of both 
propositional logic and set theory. The definition requires every Boolean algebra 
to have certain properties, from which many additional properties can be derived. 
This section also discusses what it means for two instances of a Boolean algebra 
to be isomorphic. 

Section 8.2 establishes a relationship between the Boolean algebra structure 
and the wiring diagrams for the electronic circuits in computers, calculators, in- 
dustrial control devices, telephone systems, and so forth. Indeed, we will see that 
truth functions, expressions made up of variables and the operations of Boolean 
algebra, and these wiring diagrams are all related. As a result, we can effectively 
pass from one formulation to another and still preserve characteristic behavior 
with respect to truth values. We will also find that we can simplify wiring dia- 
grams by using properties of Boolean algebras. In Section 8.3 we will look at two 
other procedures for simplifying wiring diagrams. 


SECTION 8.1 


BOOLEAN ALGEBRA STRUCTURE 


Let us revisit the wffs of propositional logic and associate with them a certain 
type of function. Suppose a propositional wff P has n statement letters. Then each 
row of the truth table for that wff associates a value of T or F with an n-tuple of 
T-F values. The entire truth table defines a function fsuch that f£: (T, F} — (T, F}. 
The function associated with a tautology maps (T, F}” — {T}, and the function 
associated with a contradiction maps (T, FI" — {F}. 


We've seen this idea before. From Example 31 in Chapter 5, the function 
РАТ, FP = (T, Е} for the wff A V B' is given by the following truth table. 


A B B АУ B’ 
Т Т F Tr 
Т Б Ш T 
Б T F Б 
F [F Ш T 
Here f(T, Р) = T and ЌЕ, T) = Е. e 


Suppose we agree, for any propositional wff P with n statement letters, to let 
the symbol P denote not only the wff but also the corresponding function defined 
by the truth table. If P and О are equivalent wffs, then they have the same truth 
tables and therefore define the same function. Then we can write P — Q rather 
than P < О. This simply confirms that a given function has multiple names, al- 
though a given wff defines a unique function. 

With this agreement, the short list of tautological equivalences from Section 
1.1 can be written as follows, where V and ^ denote disjunction and conjunc- 
tion, respectively, A' denotes the negation of a statement А, 0 stands for any con- 
tradiction, and 1 stands for any tautology: 
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la AVB=BVA lb. A^B—-B^A (commutative properties) 

2а. (AV B) VC = 2b (AA B)AC= (associative properties) 
AV (BV С) AN(BAC) 

3a. AV(BA^AC)- 3b. A^(BVC)- (distributive properties) 
(AV B) ^ (A V C) (A ^ B)V (AAC) 

Да. AV 0 = A 4b. A^1-—4A (identity properties) 

5а. АУА! = 1 5b. A /\ A’ = 0 (complement properties) 


Switching gears a bit, in Section 4.1 we studied set identities among the sub- 
sets of a set S (the elements of ((5)). We found the following list of set identities, 
where U and N denote the union and intersection of sets, respectively, A’ is the 
complement of a set A, and @ is the empty set: 


la AUB=BUA Ib. AN B=BNA (commutative properties) 


2a (AU B)U C = 2b. (АПВ) ПС = (associative properties) 
AU(BUC) AN(BNC) 

За. AU(BNC) = 3b. AN (BUC) = (distributive properties) 
(AUB)N(AUC) (ANB)U(NC) 

4a. АО Ӧд = А 4b. ANS = A (identity properties) 

5а. AUA'=S 5b. ANA' = @ (complement properties) 


These two lists of properties are similar. The disjunction of statements and the 
union of sets seem to play the same roles in their respective environments. So do 
the conjunction of statements and the intersection of sets. A contradiction seems 
to correspond to the empty set and a tautology to S. What should we make of this 
resemblance? 


Models or Abstractions 


We seem to have found two different examples—propositional logic and set 
theory—that share some common properties. One of the hallmarks of scientific 
thought is to look for patterns or similarities among various observed phenomena. 
Are these similarities manifestations of some underlying general principle? Can 
the principle itself be identified and studied? Could this research shed light on the 
behavior of various instances of this principle? Sometimes, as seems to be the 
case with propositional logic and set theory, similar mathematical properties or 
behavior can be seen in different contexts. A mathematical structure is a formal 
model that serves to embody or explain this commonality, just as in physics the 
law of gravity is a formal model of why apples fall, the ocean has tides, and the 
planets revolve around the sun. 

Mathematical principles are models or abstractions intended to capture prop- 
erties that may be common to different instances or manifestations. These princi- 
ples are sometimes expressed as mathematical structures—abstract sets of objects, 
together with operations on or relationships among those objects that obey certain 
rules. (This concept may give you a clue about why this book is titled as it is.) 

We can liken a mathematical structure to a human skeleton. We can think of 
the skeleton as the basic structure of the human body. People may be thin or fat, 
short or tall, black or white, and so on, but stripped down to skeletons they all look 
pretty much alike. Although the outward appearances differ, the inward structure, 
the shape and arrangement of the bones, is the same. Similarly, mathematical 
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structures represent the underlying sameness in situations that may appear out- 
wardly different. 

It appears reasonable to abstract the common properties (tautological equiva- 
lences and set identities) for propositional wffs and set theory. Thus we will soon 
define a mathematical structure called a Boolean algebra that incorporates these 
properties. First, however, we note that modeling or abstracting is not an entirely 
new idea to us. 


1. We used predicate logic to model reasoning and formally defined an inter- 
pretation as a specific instance of predicate logic (Section 1.3). 

2. We defined the abstract ideas of partial ordering and equivalence relation, 
and considered a number of specific instances that could be modeled as 
posets or sets on which an equivalence relation is defined (Section 5.1). 

3. We noted that graph and tree structures can model a great variety of in- 
stances (Sections 6.1 and 6.2). 


Boolean algebra is just another model or abstraction for which we already have 
two instances. 


Definition and Properties 


Let us characterize formally the similarities between propositional logic and set 
theory. In each case we are talking about items from a set: a set of wffs or a set 
of subsets of a set S. In each case we have two binary operations and one unary 
operation on the members of the set: disjunction/conjunction/negation or union/ 
intersection/complementation. In each case there are two distinguished elements 
of the set: 0/1 or @/S. Finally, there are the 10 properties that hold in each case. 
Whenever all these features are present, we say that we have a Boolean algebra. 


BOOLEAN ALGEBRA 

A Boolean algebra is a set B on which are defined two binary operations + and - 
and one unary operation ' and in which there are two distinct elements 0 and 1 
such that the following properties hold for all x, y, z € B: 


lax+y=yt+x Ib.x-y=y-x (commutative properties) 

2a.(x +y)+z= 2b.(x:y):z = (associative properties) 
x+(y + 2) x:(y:z) 

За. x + (y:z) = 3b. х: (у + z) = (distributive properties) 
(x + у): (+ 2) (ху) + (х2) 

4a. х+0 = х 4. х1 =x (identity properties) 

San + x" = 1 5b. хх = 0 (complement properties) 


What, then, is the Boolean algebra structure? It is a formalization that ab- 
stracts, or models, the two cases we have considered (and perhaps others as well). 
There is a subtle philosophical distinction between the formalization itself, the 
idea of the Boolean algebra structure, and any instance of the formalization, such 
as these two cases. Nevertheless, we will often use the term Boolean algebra to 
describe both the idea and its occurrences. This usage should not be confusing. 
We often have a mental idea (“chair,” for example), and whenever we encounter a 


PRACTICE 1 
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Verify property 4b for the Boolean algebra of Example 2. 


concrete example of the idea, we also call it by our word for the idea (this object 
is a "chair"). 

The formalization helps us focus on the essential features common to all 
examples of Boolean algebras, and we can use these features—these facts from 
the definition of a Boolean algebra—to prove other facts about Boolean algebras. 
Then these new facts, once proved in general, hold in any particular instance of 
a Boolean algebra. To use our analogy, if we ascertain that in a typical human 
skeleton, “the thighbone is connected to the kneebone,” then we don't need to 
reconfirm the fact in every person we meet. 

We denote a Boolean algebra by [B, + ,:,',0,1]. 


Let В = (0, 1} (the set of integers 0 and 1) and define binary operations + and - 
on В by x + y = max(x, у), x: y = min(x, y). Then we can illustrate the opera- 
tions of + and - by the following tables. 


A unary operation ' can be defined by means of a table, as follows, instead of by 
a verbal description. 


Thus 0’ = land 1' = 0. Then [B, +,- ,’, 0, 1] is a Boolean algebra. We can 
verify the 10 properties by checking all possible cases. Thus, for property 2b, the 
associativity of - , we show that 


(0:0)-0=0-(0:0) = 0 
(0:0) :-1= 0: (0:1) = 0 
(0:1) :0= 0: (1:0) = 0 
(0:1) :1=0:(1:1) = 0 
(1:0) :0 = 1: (0:0) = 0 
(1:0) :1=1:(0:1) = 0 
(1:-1):0=1:(1:0) = 0 
(le Шей = leqle П) = | 
For property 4a, we show that 
0+0= 
1+0 = ө 
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Bear in mind that Example 2 illustrates a particular instance of the Boolean 
algebra structure. Any Boolean algebra must, by the definition, have at least two 
elements, a 0 element and a 1 element. The specific Boolean algebra in Example 
2 has just those two elements, the integers 0 and 1. But if you want to do a proof 
about Boolean algebras in general, you cannot assume that 0 and 1 are the only el- 
ements. This means that if you know that x and y are two elements in an arbitrary 
Boolean algebra and that x = y, that does not mean that у = x’. 


| EXAMPLES | Let S = (a, b, c}. Then (S) has eight elements: 
2, {a}, (b), {с}, (a. b), (a, с}, (b, c), (a, b, c) 


so using those eight elements together with the operations of union, intersec- 
tion, and complementation gives an eight-element Boolean algebra. The empty 
set is the 0 element, and {1,2,3} is the 1 element. We could show the tables 
for union and intersection, but let's just look at complementation. Remember that 
x Iac. 


@ Дар ©, cel 
{а} {b, с} 
{b} {a, c} 
{c} {a. b} 
{a. b} {c} 
{a, c} {b} 
{b, c} {a} 
{a, b, c} Ø 


Here {a, c} and {c} are two distinct elements of this Boolean algebra, but {a, c} is 
not {c}’. ө 


There are many other properties that hold in any Boolean algebra. We сап 
prove these additional properties by using the properties in the definition. 


|. EXAMPEEA | The idempotent (pronounced eye' -dem-po-tent) property 


X sx 


holds in any Boolean algebra because 


gear oe (eue) (4b) 
= (Ge sr хл) (хх) (5а) 
eee (eese) (3a) 
=x чь (0 (5b) 


x (4a) ө 


REMINDER 


A Boolean algebra 
property may be applied 
only when your expres- 
sion exactly matches the 
pattern of one side of the 
property. 
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Although ordinary arithmetic of integers has many of the properties of a Bool- 
ean algebra, the idempotent property should convince you that arithmetic is not a 
Boolean algebra. The property x + x = x does not hold for ordinary numbers and 
ordinary addition unless x is zero. 

In the proof of Example 4, we used property 5a to replace 1 with x + x’. The 
properties of Boolean algebra are equalities, and either side of an equal sign can 
be replaced with the other side. The Boolean algebra properties (rules) are like the 
equivalence rules in logic; to apply the rule, your situation must match exactly the 
pattern of the rule. For example, it is legal to replace 


Qz) tx 
with 
x + (y:z) 
using property la, because (y: 2) + x matches the right side of la where y is the 


Boolean algebra element y : z, and x + (у · z) matches the left side of la under the 
same interpretation of y. We cannot say 


xct(Qez-Gy + (х2) 


using either property 3a ог 3b because we have mixed up the two properties. And, 
strictly speaking, we cannot replace 


Qz +x 
with 
(y * x):(G t x) 


and claim that we are using property 3a because in property 3a the addition is to 
the left of the multiplication. We must reason as follows: 


(уа) t+x=x+ (уча) (Та) 
= (x+y): (х + 2) (За) 
= (у +): (z + х) (Та twice) 


However, we will sometimes make implicit use of the associative property and 
write 


Xr yr Zs 


with no parentheses. 

Each property in the definition of a Boolean algebra has its dual as part of 
the definition, where the dual is obtained by interchanging + and -, and 1 and 
0. For example, x + 0 = x and x: 1 = x are duals of each other. Therefore, every 
time a new property P about Boolean algebras is proved, each step in that proof can 
be replaced by the dual of that step. The result is a proof of the dual of P. Thus, once 
we have proved P, we know that the dual of P also holds. It’s a two-for-one deal! 
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| EXAMPLES | The dual of the idempotent property (Example 4), x -x = x, is true in any Boolean 


algebra. 2 


PRACTICE 2 


a. What does the idempotent property of Example 4 become in the context of propositional logic? 
b. What does it become in the context of set theory? н 


Once a property about Boolean algebra is proved, we can use it to prove new 
properties. 


PRACTICE 3 


a. Prove that the universal bound property x + 1 = 1 holds in any Boolean algebra. Give a 
reason for each step. 
b. What is the dual property? и 


More properties of Boolean algebras appear in the exercises at the end of 
this section. The most important of these properties are double negation and De 
Morgan’s laws: 


(x) =x (double negation—Exercise 7) 
(x ty) = ху (ху) =x’ +y (De Morgan's laws—Exercise 8) 


Table 8.1 suggests hints that may help when trying to prove a Boolean algebra 
property of the form 


some expression — some other expression 


TABLE 8.1 


Hints for Proving Boolean Algebra Equalities 


Usually the best approach is to start with the more complicated expression and try to show that it reduces to the 
simpler expression. 


Think of adding some form of 0 (like x - x') or multiplying by some form of 1 (like x + x’). 


Remember property За, the distributive property of addition over multiplication —it is easy to forget because it 
doesn't look like arithmetic. 


Remember the idempotent property x + x = x and its dual x: x = x. 


Remember the universal bound property x + 1 = 1 and its dual x- 0 = 0. 


Prove that x’: y = x' ‘у + x':y:z in any Boolean algebra. 
Following the suggestion in Table 8.1, we start with the more complicated 
expression, x’: y + x':y-:z. There is a “common factor" of x’ · y, so we should 


REMINDER 


To prove that something 
is unique, assume that 
there are two of them and 
prove that they must be 
the same. 


REMINDER 


If it walks like a duck and 
it quacks like a duck, it 
must be a duck. 

If it has the two properties 
of the complement, then 
by uniqueness it must be 
the complement. 


Ф THEOREM 
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use a distributive property, although it’s not clear at this point how the z is going 
to disappear. 


Ио (4b) 
= IUE (3b) 
= se ape (ce JD (1a) 
= x':y-(1) (universal bound—and that’s 
how z disappears) 
= х':у (4b) e 


For x an element of a Boolean algebra B, the element x' is called the comple- 
ment of x (picking up the terminology from set theory). The complement of x 
satisfies 

xtx-l and x:x' =0 
Indeed, x' is the unique element with these two properties. To prove it, suppose x, 
is an element of B with these same properties, 


x+x,=1 and x:x,—0 
Then 

x) =x,'1 (4b) 
=x; x + x’) (x + x’ = 1) 
= (ux) + Qux) (3b) 
= (xx) + x) (1b) 
= 0 + (x'- x) (хх = 0) 
= (xix) + (xx) (x:x' = 0) 
= (x' x) + (о) (1b) 
=x'-(x + x) (3b) 
= x':1 (x + x; = 1) 
= х' (4b) 


Thus x, = x’, and x’ is unique. (Uniqueness in the context of propositional logic 
means that the truth table is unique, but there can be many different wffs associ- 
ated with any particular truth table.) 

The following theorem summarizes our observations. 


ON THE UNIQUENESS OF COMPLEMENTS 
For any x in a Boolean algebra, if an element x, exists such that 


x+x,= 1 and xx, =0 


then x, = x’. 
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PRACTICE 4 | Prove that 0’ = 1 and 1’ = 0. (Hint: 1’ = 0 will follow by duality from 0’ = 1. To show 
0' = 1, use the theorem on the uniqueness of complements.) 


There are many ways to define a Boolean algebra. Indeed, in our definition 
of Boolean algebra, we could have omitted the associative properties, since these 
can be derived from the remaining properties of the definition. It is much more 
convenient, however, to include them. 


Isomorphic Boolean Algebras 


What Is Isomorphism? 

Two instances of a structure are isomorphic if there is a bijection (called an iso- 
morphism) that maps the elements of one instance onto the elements of the other 
so that important properties are preserved. (Isomorphic graphs were discussed in 
Section 6.1.) If two instances of a structure are isomorphic, each is a mirror image 
of the other, with the elements simply relabeled. The two instances are essentially 
the same. Therefore, we can use the idea of isomorphism to classify instances of a 
structure, lumping together those that are isomorphic. 


Consider the two partially ordered sets 


S, = {1, 2,3,5, 6, 10, 15, 30}; x py €» x divides у 


The Hasse diagram of each partially ordered set appears in Figure 8. 1. These two 
diagrams certainly appear to be mirror images of each other; just by looking at the 
diagrams, an obvious relabeling of the nodes, as shown in Figure 8.2, suggests itself. 
The important properties of a partially ordered set are which elements are related, 
and the Hasse diagram displays this information. For example, Figure 8.1a shows 
that 1, because of its position at the bottom of the graph, is related to every element 
in S,. Is this property preserved under the relabeling of Figure 8.2? Yes, because 
© is the image of 1 under that relabeling, and © is related to every element in S}. 
Similarly, all the other “is related to” properties are preserved under the relabeling. 


30 
6 15 
> 
2 5 
1 
(а) (b) 


Figure 8.1 
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Figure 8.2 


More formally, the relabeling is accomplished by the following bijection f 
from the set of nodes in Figure 8.1a onto the set of nodes in Figure 8.1b. 


=g FQ) = Ш JOE p 
йе? JMs M=; SA S12 


The bijection fis an isomorphism from poset (S1, p) to poset (S5, с) Because this 
isomorphism exists, the posets (5, p) and (S5, с) are isomorphic. (The function 
f | would be an isomorphism from (S,, с) to (Sj, р).) e 


In Example 7 it was relatively easy to find an isomorphism because of the 
visual representation that captured the important properties (which elements 
are related). Suppose that instead of a partially ordered set, we have a struc- 
ture (like a Boolean algebra) where binary or unary operations are defined on 
a set. Then the important properties pertain to how these operations act. An 
isomorphism must preserve the effects of performing these operations. Each 
instance of two such structures that are isomorphic must be the mirror image 
of the other in the sense that “operate and then map” must equal “map and then 
operate." 

Figure 8.3 illustrates this general idea for a binary operation. In Figure 8.3a, 
the binary operation is performed on a and 5, resulting in c, then c is mapped to d. 
In Figure 8.3b, a and 5 are mapped to e and f; on which a binary operation is 
performed, resulting in the same element d as before. Remember, 


operate and map — map and operate 


Still another view of this little equation appears in the commutative diagram of 
Figure 8.4. 


PRACTICE 5 
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Map 
f 
Lii 
Operate 
Figure 8.3 (b) 
Operate 


a, b iO 
Map Map 


e, f Operate d 
Figure 8.4 


Isomorphism as Applied to Boolean Algebra 

Now let's determine specifically what is involved when two instances of a Boolean 
algebra are isomorphic. Suppose we have two Boolean algebras, [B, +, ·, ', 0, 1] 
and [b, &, *, ", ф, Y]. This notation means that, for example, if x is in B, x’ is 
the result of performing on x the unary operation defined in В, and if z is an ele- 
ment of b, z" is the result of performing on 2 the unary operation defined in 5. 
How would we define isomorphism between these two Boolean algebras? First, 
we would need a bijection f from B onto b. Then f must preserve in b the effects of 
the various operations in В. There are three operations, so we use three equations 
to express these preservations. To preserve the operation +, we want to be able to 
operate using + on two elements in В and then map the result to 5, or to map the 
two elements to b and operate using the corresponding operation & on the results 
there. (Think “operate and map = map and operate") Thus, for x and y in В, we 
require 


f(x y) = Ло) & f(y) 


a. Write the equation requiring f to preserve the effect of the binary operation -. 
b. Write the equation requiring f to preserve the effect of the unary operation '. п 
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Here is the definition of an isomorphism for Boolean algebras. 


* DEFINITION ISOMORPHISM FOR BOOLEAN ALGEBRAS 
Let [B, +, :, ', 0, 1] and [5, &, *, ", ф, Y] be Boolean algebras. A function В > b 
is an isomorphism from [B, +, ·, ', 0, 1] to [b, &, *, ", ф, Y] if 


1. fis a bijection 

2. f(x + y) = Јо) & fy) 
3. Fæ y) = Јо) * fO) 
4. fŒ) = S) 


PRACTICE 6 | Illustrate properties 2, 3, and 4 in the definition by commutative diagrams. 


We already know (it was one of our original inspirations) that for any set 
S, (S) under the operations of union, intersection, and complementation consti- 
tutes a Boolean algebra. Example 3 talks about this type of Boolean algebra where 
S = (a, b, c}. If we pick S = (1, 2}, then the elements of (S) are Ø, {1}, {2}, 
and {1, 2}. The operations are given by the following tables. 


U Ø {1,2} {Ih {2} 
Ø Ø {1,2} {1} {2} 
{1, 2} {1,2} {1,2} {1,2} {1,2} 
{1} {1} {1,2} {1} {1, 2} 
{2} {2} (2 11,2) {2} 


N 2 {1,2} i {2} ' 

@ о © о Ø @ {1,2} 
{1,2}|@ {1,2} {1} {2} (1,2119 
{i} |@ {1} UR S {1} | 42} 
{2} |Ø {2} Ø 1} {2} | {1} 


A Boolean algebra can be defined on the set В = {0, 1, a, a'} where the opera- 
tions of + , -, and ' are defined by the following tables (see Exercise 1). 


+ | 0 1 a a 0 1 a a : 

0 1 a а olo 0 0 0 EAE 
1 1 1 1 1 1 0 1 a a 1 0 
а а 1 а 1 а 0 а а 0 а | a’ 
a ja 1 1 а aa |o а 0 a а | a 
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We claim that the mapping f: B — g(S) given by 


fO = @ 
FQ) = {1,2} 
Да) = {1} 
Fla’) = {2} 


is an isomorphism. Certainly it is a bijection. For x, y € В, we can verify each of 
the equations 


fe + y) = fe) Ufo» 
Лоту) = feo Vf» 
fo) = (700) 


by examining all possible cases. Thus, for example, 


fa: D = fa) = {1} = 100,2) = Да) ne» 


PRACTICE 7 | Verify the following equations. 


a. fO + a) = f(0) U f(a) 
b. fla + а) = f(a) U fla’) 
c. faa’) = f(a) а) 


СТОЛА ООЙ 


* THEOREM 


The remaining cases also hold. Even without testing all cases, it is pretty clear 
here that fis going to work because it merely relabels the entries in the tables for 
B so that they resemble the tables for (S). In general, however, it may not be so 
easy to decide whether a given f is an isomorphism between two instances of a 
structure. Even harder to answer is the question of whether two given instances of 
a structure are isomorphic; we must either think up a function that works or show 
that no such function exists. One case where no such function exists is when the 
sets involved are not the same size; we cannot have a four-element Boolean alge- 
bra isomorphic to an eight-element Boolean algebra. 

We just showed that a particular four-element Boolean algebra is isomor- 
phic to g({1, 2}). It turns out that any finite Boolean algebra is isomorphic to 
the Boolean algebra of a power set. Although we state this as a theorem, we 
will not prove it. 


ON FINITE BOOLEAN ALGEBRAS 
Let B be any Boolean algebra with л elements. Then n = 2" for some т, and В is 
isomorphic to e((1, 2, ... , m}). 


This theorem gives us two pieces of information. The number of elements in 
a finite Boolean algebra must be a power of 2. Also we learn that finite Boolean 
algebras that are power sets are—in our lumping together of isomorphic things— 
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really the only kinds of finite Boolean algebras. In a sense we have come full 
circle. We defined a Boolean algebra to represent many kinds of situations; now 
we find that (for the finite case) the situations, except for the labels of objects, are 
the same anyway! 


SECTION 8.1 REVIEW 


TECHNIQUES MAIN IDEAS 


* Mathematical structures serve as models or ab- 
stractions of common properties found in diverse 


* Decide whether something is a Boolean algebra. 
2 Prove properties about Boolean algebras. 


Write the equation meaning that a function f pre- 
serves an operation from one instance of a struc- 
ture to another, and verify or disprove such an 


situations. 
If there is an isomorphism (a bijection that pre- 
serves properties) from А to B, where А and В are 


instances of a structure, then except for labels, 4 
and B are the same. 

* All finite Boolean algebras are isomorphic to 
Boolean algebras that are power sets. 


equation. 


EXERCISES 8.1 
1. Let B = (0,1, а, а'}, and let + and · be binary operations on В. The unary operation ' is defined by 


the table 
Ex 
1 0 
a a' 
a' a 


Suppose you know that [B, + , -,’, 0, 1] is a Boolean algebra. Making use of the properties that must 
hold in any Boolean algebra, fill in the following tables defining the binary operations + апа · : 


+ 0 1 a a’ : 0 1 а a’ 
0 0 
1 1 
a a 
a’ a’ 


2. a. What does the universal bound property (Practice 3) become in the context of propositional logic? 
b. What does it become in the context of set theory? 
3. Define two binary operations + and - on the set Z of integers by x + у = max(x, y) and x : y = min(x, y). 
a. Show that the commutative, associative, and distributive properties of a Boolean algebra hold for these 
two operations on Z. 
b. Show that no matter what element of Z is chosen to be 0, the property x + 0 = x ofa Boolean alge- 
bra fails to hold. 
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4. Let M;(Z) denote the set of 2 X 2 matrices with integer entries, and let + denote matrix addition апа. 
denote matrix multiplication. Given 


0 0 1 0 
Using |а o and [ | as the 0 element and the 1 element, respectively, either prove that 


[M,(Z), + ,-,', 0, 1] is a Boolean algebra or give a reason why it is not. 


5. Let Sbe the set (0, 1}. Then S? is the set of all ordered pairs of Os and 15; S? = {(0, 0), (0, 1), (1, 0), (1, 1}. 
Consider the set B of all functions mapping S? to S. For example, one such function, f(x,y), is given by 


0,0) = 0 
f(0, 1) = 1 
fd, 0) = 1 
fd, =1 


a. How many elements are in B? 
b. For f, and f; members of В and (х, y) є S?, define 


Cfi + ANG y) = max( fix, у), A у)) 
Cfi AO, y) = min( fi, у), fox, y) 


, _ jliffi@,y) = 0 
Paci 
Suppose 
Л(0, 0) = 1 f£(0,0) = 1 
ЛО, 0 =0 50121 
лер fol, 0) = 0 
Ad,yv=0 f0,1=0 


What аге the functions f, + fj, fi: f;, and fi? 
c. Prove that [B, +, ·, ', 0, 1] is a Boolean algebra where the functions 0 and 1 are defined by 


0(0,0)=0 10,0) =1 
000,1) = 0 10,1) =1 
00,0 = 0 11,0) = 1 
001,1) 20 1,0) =1 


6. Let п be а positive integer whose decomposition into prime factors has по repeated prime. Let 
В = {x|x is a divisor of n}. For example, if n = 21 = 3-7, then В = (1,3, 7, 21). Let the following 
operations be defined on B: 


x + у = lem(x, y) x: y = ged(x, y) x' =n/x 


10. 
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Then + and · are binary operations on В and ' is a unary operation on В. 
a. For n = 21, find 
(3-7 
(1) 7:21 
(iii) 1 + 3 
(iv) 3 + 21 
(v) 3' 
b. Prove that the commutative, associative, and distributive properties hold for both + and -. 
c. Find the value of the “0” element and the “1” element, then prove properties 4 and 5 for both + and -. 
d. Consider a value for п whose decomposition has repeated primes. In particular, let n = 12 = 2-2-3. 
Prove that, using the above definitions for + and -, it’s not possible to define a complement for 6 in 
the set (1, 2, 3, 4, 6, 12}. Therefore a Boolean algebra cannot be constructed with п = 12 using the 
process described. 


. Prove the following property of Boolean algebras. Give a reason for each step. (Hint: Remember the 


uniqueness of the complement.) 


(x) =x (double negation) 


. Prove the following property of Boolean algebras. Give a reason for each step. (Hint: Remember the 


uniqueness of the complement.) 


(x+y) = ху (xy) =x + y (De Morgan’s laws) 


. Prove the following properties of Boolean algebras. Give a reason for each step. 


ax+(x-y) =x (absorption properties) 
x-(x+y)=x 
b. x [y + œD] = (ху) + 6:2 (modular properties) 


x + [y (х + 2)] = (x + у): (х + 2) 
с. (х+у) (к +у) = у 
Gy) + (ку) = у 
а. (х + (у 2) =x y кх + 
otz = +y) +2) 
e (х+у): (х +1) = х + (ху) +у 
(ку) + (6-00) Sx ayy 
Prove the following properties of Boolean algebras. Give a reason for each step. 
а. х+у)+(у<х)=х+у 
Ы. (у + х): (2+ у) +х:2:(2 +2) = у +х-2 
c. (ух) - xt (у + х) у= х + (ух) 
d. (x + у): = (х +z) (у +z] 
е. (x- y) + (х2) + (cy 2’) =y + (х' 2) 


П. 


12. 


13. 
14. 
15. 


16. 


17: 


18. 


19. 


20. 
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Prove the following properties of Boolean algebras. Give a reason for each step. 
а х+у =xt+(x'-yt+x-y)' 

b. [с 2] + (уз) = у= 

с. ху ty x = ху ty 

а. (х + у) zt y= z 

е. Gy) + (yz) + z) = (у) + (у 2) + (х2) 

Prove the following properties of Boolean algebras. Give a reason for each step. 


e 


‚ к+у-зх)'\ =x 
x:(ztyt-(x +y) =x 
cry! t ха нусаа х + у 


oc 


spre ya vy 


Or @ 


X'yctyszex' = уа + ухх 

Prove that in any Boolean algebra, x: y' + x': у = y ifand only if x = 0. 
Prove that in any Boolean algebra, x: y' = 0 if and only if x: y = x. 

А new binary operation @ in a Boolean algebra (exclusive OR) is defined by 


xQycxytyx 


Prove that 

a xQy-yQx 
5. х©х=0 

с. 0x-x 

а. 1@х= х 


Prove that for any Boolean algebra: 

a. If x + y = 0, then x = 0 and y = 0. 

b. x = y ifand only if x - y' + y: x' = 0. 

Prove that the 0 element in any Boolean algebra is unique; prove that the 1 element in any Boolean algebra 
is unique. 


a. Find an example of a Boolean algebra with elements x, y, and z for which x + y = x + z but y 7 2. 
(Here is further evidence that ordinary arithmetic of integers is not a Boolean algebra.) 


b. Prove that in any Boolean algebra, if x + y = х + zand x' + y = x' + z, then y = 2. 

Let (S, <) and (S’, <’) be two partially ordered sets. (S, X) is isomorphic to (S', <’) ifthere is a bijection 

f: S — S' such that for x, y in S, x < y > f(x) <'f(y) and f(x) «' f(y) > x < y. 

a. Show that there are exactly two nonisomorphic, partially ordered sets with two elements (use 
diagrams). 

b. Show that there are exactly five nonisomorphic, partially ordered sets with three elements. 

c. How many nonisomorphic, partially ordered sets with four elements are there? 


Find an example of two partially ordered sets (5, <) апа (S’, <’) and a bijection f: 5 — S' where, for x, у 
in S, x < y > fŒ) «' f(y) but f(x) «'f(y) P x < y. 


21. 


22. 


23. 


24. 


25. 
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Let S = (0, 1} and let a binary operation · be defined on S by 


+ 5 7 
5 7 5 
7 5 7 


Consider [S,-] and [7; +] as mathematical structures. 
a. If a function fis an isomorphism from [S, -] to [7; +], what two properties must f satisfy? 
b. Define a function f: 5 — Т and prove that it is an isomorphism from [S, :] to [Z +]. 


Consider the four-element Boolean algebra defined in Exercise 6 with n = 21. Find an isomorphism 
from this Boolean algebra to the four-element Boolean algebra with set g({1, 2}) that was defined in this 
section. 


Let R denote the real numbers and R* the positive real numbers. Addition is a binary operation on R, and 
multiplication is a binary operation on К”. Consider [R, +] and[R *, -] as mathematical structures. 


a. Prove that the function f defined by f(x) = 2" is a bijection from R to К“. 

b. Write the equation that an isomorphism from [R, +] to [R*, -] must satisfy. 

c. Prove that the function f of part (a) is an isomorphism from [R, + ] to [R*,-]. 

d. What is f~! for this function? 

e. Prove that / ! is an isomorphism from [R*,-] to [R, +]. 

An isomorphism from the Boolean algebra with set B = (0, 1, a, a') to the Boolean algebra with set 


e (11, 2}) was defined in this section. Because the two Boolean algebras are essentially the same, an opera- 
tion in one can be simulated by mapping to the other, operating there, and mapping back. 


a. Use the Boolean algebra on (11, 2}) to simulate the computation 1 · a’ in the Boolean algebra on B. 

b. Use the Boolean algebra on @({1, 2}) to simulate the computation (a)' in the Boolean algebra on B. 

c. Use the Boolean algebra on В to simulate the computation {1} U {2} in the Boolean algebra on 
e, 2). 

Consider the set B of all functions mapping (0, 1} to (0, 1}. We can define operations of +, +, and ' on B by 


Cfi + AG y) = max(fiGo, y). h, у)) 
Cfi: AG y) = min( fiGo у), A y)) 

1 if fio, y) = 0 

0 if fi, y) = 1 


Then [B, —,:,', 0, 1] is a Boolean algebra of 16 elements (see Exercise 5). The following table assigns 
names to these 16 functions. 


лоу) = { 


26. 


27. 


28. 
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(x, У ) 0 1 f, f 2 f. 3 fa fs fe f, fs fs f 10 f, fi; fig fia 
0010 1 1 1 1 1 1 1 0 0 O 1 0 0 0 о 
0.1710 1 0 1 1 0 1 O 1 1 1 0 O 1 BD 0 
Golo 1 1 0 1 0 @ Yd 1 1 0 O Yd 9 1 0 
19. 1 0 0 .0 O 1 1 1 O 1 ^ 1 0 O0. 1 


According to the theorem on finite Boolean algebras, this Boolean algebra is isomorphic to 
[0(11, 2, 3, 4), U, П, ’, Ø, (1, 2, 3, 4}]. Complete the following definition of an isomorphism from B to 
odl, 2, 3, 4}). 

06 

1 — {1, 2,3, 4} 

А =] 

Л > {2} 

fis > {3} 

fia {4} 
Let Р, О, and А be three statements in propositional logic with statement letters А and В. P, О, and А 


define the following three functions from (T, FP? to (T, F}. Also shown are the contradiction 0 and the 
tautology 1. 


nm т oH A/D. 
т Am Alo 
4 т m dv 
т т ач” mio 
т oA m т/ж 
тт т njo 
чач ч 


a. Let B = (P,P', О, О’, R, К', 0, 1}. Then [B, V, Л, ', 0, 1] is a Boolean algebra. Write the 8 X 8 
tables for the V and ^ operations and the 8 X 1 table for the ' operation. 

b. [0(11, 2,3}, О, П, ', Ø, (1, 2, 3}] is a Boolean algebra. Write the 8 X 8 tables for the U and N op- 
erations and the 8 X 1 table for the ' operation. 

c. Find an isomorphism from the Boolean algebra of part (a) to the Boolean algebra of part (b). 

Suppose that [B, +,-,’, 0, 1] and [b, &, *, ", ф, Y] are isomorphic Boolean algebras and that fis an iso- 

morphism from B to b. 

a. Prove that A0) = ф. 

b. Prove that A1) = Y. 

According to the theorem on finite Boolean algebras, which we did not prove, any finite Boolean algebra 

must have 2" elements for some т. Prove the weaker statement that no Boolean algebra can have an odd 

number of elements. (Note that in the definition of a Boolean algebra, 0 and 1 are distinct elements of B, 


so B has at least two elements. Arrange the remaining elements of B so that each element is paired with its 
complement.) 
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29. A Boolean algebra may also be defined as a partially ordered set with certain additional properties. Let 
(B, <) be a partially ordered set. For any x, y є В, we define the /east upper bound of x and y as an 
element z such that x < 2, y < z, and if there is any element z» with x < z* and y < zs, then z < z«. 
The greatest lower bound of x and y is an element у such that у < x, у < y, and if there is any element 
w* with w* < x and w* < y, then w* < w. A lattice is a partially ordered set in which every two 
elements x and у have a least upper bound, denoted by x + у, and a greatest lower bound, denoted by x · y. 
a. Prove that in any lattice 

(1) x:y = x if and only if x X y 
(1) x + y = y ifand only if x < y 
b. Prove that in any lattice 
(О х+у=у+х 
(х'у= ух 
(ш) (x Ку) +2 = х + (у + 2) 
(iv) (x-y)'z =x: Qy:z) 
с. А lattice / is complemented if there exists a least element 0 and a greatest element 1, and for every 
x € L there exists х’ є L such that x + x’ = 1 and x: x' = 0. Prove that in a complemented lattice L, 


x+0=x and x'l=x 


for all x € L. 
d. A lattice L is distributive if 


xt(y:z)(xty)(x-z 
and 

xQ +2) = Gy) + (х2) 
for every x, y, z € L. By parts (b) and (c), a complemented, distributive lattice is a Boolean algebra. 
Which of the following Hasse diagrams of partially ordered sets do not represent Boolean algebras? 


Why? (Hint: In a Boolean algebra, the complement of an element is unique.) 


7 


(a) (b) (c) (d) 
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30. a. Let n be a positive integer and consider В to be the set of all positive integer divisors of n. Prove that 
(B, <) is a partially ordered set where x < y means x |y. 


In the terminology of Exercise 29, the least upper bound of x and y is the least common multiple of x and 
у, and the greatest lower bound is the greatest common divisor. (В, <) is a distributive lattice. 


b. Prove that for n = 6, (B, <) is a Boolean algebra. (Hint: 1 is the least element and 6 is the greatest 


element). 


c. Forn = 8, (В, <) is not a Boolean algebra. 


(i) Show that this is true by using the definition of a Boolean algebra. 


(1) Show that this is true by using Exercise 6. 


SECTION 8.2 


LOGIC NETWORKS 


Combinational Networks 


Basic Logic Elements 

In 1938 the American mathematician Claude Shannon perceived the parallel be- 
tween propositional logic and circuit logic and realized that Boolean algebra could 
play a part in systematizing this new realm of electronics. 

Let us imagine that the electrical voltages carried along wires fall into one 
of two ranges, high or low, which we represent by 1 and 0, respectively. Voltage 
fluctuations within these ranges are ignored, so we are forcing a discrete, indeed 
binary, mask on an analog phenomenon. We also suppose that switches can be 
wired so that a signal of 1 causes the switch to be closed and a signal of 0 causes 
the switch to be open (Figure 8.5). Now we combine two such switches, controlled 
by lines x, and x, in parallel. If either or both lines carry а 1 value, one or both 
of the switches will be closed, and the output line will have a value of 1. However, 
values of x, = 0 and x, = 0 will cause both switches to be open and thus break the 
circuit, so that the voltage level on the output line will be 0. Figure 8.6 illustrates 
the various cases. 


x=0 P udi 
+ | > ew 


x=1 
Closed switch Open switch 
Figure 8.5 
x,=1 x,=0 
=== = 
1 1 
x21 x21 
x,=1 x,=0 
- t m 
1 0 


L—— 


Figure 8.6 
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TABLE 8.2 Table 8.2 summarizes the behavior of the circuit. Substituting T for 1 and 


F for 0 in the table results in the truth table for the logical connective of disjunc- 


к | ОВ tion. Disjunction is an example of the Boolean algebra operation + in the realm of 


propositional logic. Thus we may think of the circuit more abstractly as an elec- 


tronic device that performs the Boolean operation +. Similarly, conjunction and 


о о| ~ | ~ 
оре о| ~ 


1 
1 negation are examples of the Boolean algebra operations · and ', respectively, in 
0 the realm of propositional logic. Other devices perform these Boolean operations. 


For example, switches connected in series would serve to implement the - operation; 
both switches must be closed (x, = 1 and x, = 1) in order to have an output of 1. 
However, we’ll ignore the details of implementing the devices; suffice it to say that 
technology has progressed from mechanical switches through vacuum tubes and 
then transistors to integrated circuits, and now even bacteria and DNA. We will 
simply represent these devices by their standard symbols. 

The OR gate, Figure 8.7a, behaves like the Boolean operation +. The AND 
gate, Figure 8.7b, represents the Boolean operation:. Figure 8.7с shows an 
inverter, corresponding to the unary Boolean operation '. Because of the associativ- 
ity property for + and · , ће OR and AND gates can have more than two inputs. 


Xx х' 
x X, +x. x, 1 72 1 
1 1 2 X; 
х X 


X, X3 Xj X, X, X» Xp X, xy x 
1 1 1 1 1 1 1 0 
1 0 1 1 0 0 0 1 
0 1 1 0 1 0 (с) Inverter 
0 0 0 0 0 0 
(a) OR gate (b) AND gate 
Figure 8.7 


Boolean Expressions 


* DEFINITION BOOLEAN EXPRESSION 
A Boolean expression in n variables, хү, x», ... , x,, is any finite string of 
symbols formed by applying the following rules: 


l. x1, X», ... , x,, are Boolean expressions. 
2. If P and О are Boolean expressions, so are (Р + О), (Р. О), and (P’). 


(The definition of a Boolean expression is another example of a recursive 
definition; rule 1 1s the basis step and rule 2 the inductive step.) When there 
18 no chance of confusion, we can omit the parentheses introduced by rule 2. 
In addition, we define · to take precedence over + and ' to take precedence 
over + ог", SOX, + x,-x4 stands for x, + (x;:x4) and x, + xj stands for 
x, + (x5); this convention also allows us to remove some parentheses. Finally, we 
will generally omit the symbol · and use juxtaposition, so x, : x; is written xx». 
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ЭСТ x, (х. + x2) х3, (хз + ха) х, and (xi х›)'х are all Boolean expressions. e 


Truth Functions 


DEFINITION TRUTH FUNCTION 
А truth function is a function f such that f: (0, 1)" > 10, 1} for some integer 
п 2 1. 
The notation (0, 1)" denotes the set of all n-tuples of Os and 15. А truth 
function thus associates a value of 0 or 1 with each such n-tuple. 


The truth table for the Boolean operation + describes a truth function 
f with n=2. The domain of f is {(1, 1), (1, 0), (0, 1), ,0)}, and 
fC, 1) = 1, fC, 0) = 1,/(0,1) = 1, and f(0, 0) = 0. Similarly, the Boolean 
operation : describes a different truth function with л = 2, and the Boolean 
operation ' describes a truth function for n = 1. e 


PRACTICE 8 


a. If we are writing a truth function f: (0, 1)" — {0,1} in tabular form (like a truth table), 
how many rows will the table have? 

b. How many different truth functions are there that take (0, 1)? — (0, 1}? 

c. How many different truth functions are there that take (0, 1)" > (0, 1}? и 


Any Boolean expression defines а unique truth function, just as do the sim- 
ple Boolean expressions x, + xz, хух›, and xj. 


The Boolean expression xx, + хз defines the truth function given in Table 8.3. 
(This is just like doing the truth tables of Section 1.1.) 


TABLE 8.3 

Xi X2 | Xs X4X2 + X3 

1 1 1 1 

1 1 0 0 

1 0 1 1 

1 0 0 1 

0 1 1 1 

0 1 0 0 

0 0 1 1 

0 0 0 0 с 
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Networks and Expressions 

It’s time to see how these ideas of logic gates, Boolean expressions, and truth 
functions are related. By combining AND gates, OR gates, and inverters, we can 
construct a logic network representing a given Boolean expression that produces 
the same truth function as that expression. 


| EXAMPLE 11 | The logic network for the Boolean expression хх + х; is shown in Figure 8.8. 


X, 


X3 


X3 


Figure 8.8 ө 


PRACTICE 9 | Design the logic network for the following Boolean expressions. 


24 
Оя am ey н 


Conversely, if we have a logic network, we can write a Boolean expression 
with the same truth function. 


| EXANPLE?2 | A Boolean expression for the logic network in Figure 8.9 is 


(хр + xs)! + x; 


Figure 8.9 e 


PRACTICE 10 


a. Write a Boolean expression for the logic network in Figure 8.10. 


Figure 8.10 
b. Write the truth function (in table form) for the network (and expression) of part (a). и 
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Logic networks constructed of AND gates, OR gates, and inverters are also 
called combinational networks. They have several features that we should note. 
First, input or output lines are not tied together except by passing through gates. 
Lines can be split, however, to serve as input to more than one device. There are 
no loops where the output of an element is part of the input to that same element. 
Finally, the output of a network is an instantaneous function of the input; there are 
no delay elements that capture and remember input signals. Notice also that the 
picture of any network is, in effect, a directed graph. 


Canonical Form 
Here is the situation so far (arrows indicate a procedure that we can carry out): 


truth function < Boolean expression <> logic network 


We can write a unique truth function from either a network or an expression. 
Given an expression, we can find a network with the same truth function, and 
conversely. The last part of the puzzle concerns how to get from an arbitrary truth 
function to an expression (and hence a network) having that truth function. An 
algorithm to solve this problem is explained in the next example. 


Suppose we want to find a Boolean expression for the truth function f of Table 8.4. 
There are four rows in the table (rows 1, 3, 4, and 7) for which fis 1. The basic 
form of our expression will be a sum of four terms 


ОООО 


such that the first term has the value 1 for the input values of row 1 and for по oth- 
ers, the second term has the value 1 for the input values of row 3 and for no others, 
and so on. Thus, the entire expression has the value 1 for these inputs and for no 
others—precisely what we want. (Other inputs cause each term in the sum, and 
hence the sum itself, to be 0.) 


TABLE 8.4 
X X2 | Хз | f(Xy X2, Хз) 
1 1 1 1 
1 1 0 0 
1 0 i i 
1 0 0 1 
0 1 1 0 
0 1 0 0 
0 0 1 1 
0 0 0 0 
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Each term in the sum will be a product of the form aBy where a is ei- 
ther x, or xj, B is either x, or x5, and y is either х; or x3. If the input value of 
x; i = 1, 2, 3, in the row we are working on is 1, then х, itself is used; if the input 
value of x; in the row we are working on is 0, then x; is used. These values will 
force aBy to be 1 for that row and 0 for all other rows. Thus, we have 


row 1: X4X3X3 
TOW 3: Xqx5xa 
row 4: X4X5X5 


TOW 7: X|X5X4 
The final expression is 


(х\хухз) + (x1x2x3) + (хүхух) + (Ххх) ө 


The procedure described in Example 13 always leads to an expression that is 
a sum of products, called the canonical sum-of-products form, or the disjunc- 
tive normal form, for the given truth function. The only case not covered by 
this procedure is when the function has a value of 0 everywhere. Then we use an 
expression such as 


XQ 


which is also a sum (one term) of products. Therefore, we can find a sum-of- 
products expression to represent any truth function. A pseudocode description 
of the algorithm is given in the accompanying box. For this algorithm, the input 
is a truth table representing a truth function on n variables xj, x5, ... , x,; the 
output is a Boolean expression in disjunctive normal form with the same truth 
function. 


ALGORITHM 


SUM-OF-PRODUCTS 


Sum-Of-Products (truth table; integer n) 

//the truth table represents a truth function with n arguments; 

//result is the canonical sum-of-products expression for this truth function 
Local variables: 


sum //sum-of-products expression 
product //single term in sum, a product 

i /hndex for the columns of the table 
TOW //index for the rows of the table 


sum = empty 
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for row = 1 to 2" do 
if truth value for row is 1 then 
initialize product; 
for i = 1 ton do 
if x; — 1 then 
put x; in product 
else 
put x; in product 
end if 
end for 
sum = sum + product 
end if 
end for 
if sum is empty then 
sum = xxi 
end if 
write (“The canonical sum-of-products expression for this truth function is”, sum) 
end Sum-Of-Products 


Figure 8.11 


Because any expression has a corresponding network, any truth function has 
a logic network representation. Furthermore, the AND gate, OR gate, and inverter 
are the only devices needed to construct the network. Thus, we can build a net- 
work for any truth function with only three kinds of parts—and lots of wire! Later 
we will see that it is necessary to stock only one kind of part. 

Given a truth function, the canonical sum-of-products form just described 
is one expression that has this truth function, but it is not the only possible one. 
A method for obtaining a different expression for any truth function is given in 
Exercise 25 at the end of this section. 


The network for the canonical sum-of-products form of Example 13 is shown in 
Figure 8.11. We have drawn the inputs to each AND gate separately because it 
looks neater, but actually a single х, x», or хз input can be split as needed. 


> XiX2X3 "FXQXSXS КУО ENAN 
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PRACTICE 11 


a. Find the canonical sum-of-products form for the truth function | TABLE 8.5 
of Table 8.5. E X x, | foc, Xp, Xa) 
b. Draw the network for the expression of part (a). i = s a 
i 1 1 1 
1 1 0 0 
1 0 1 1 
1 0 0 1 
0 1 1 0 
0 1 0 0 
0 1 1 
0 0 1 
m 


Minimization 

As already noted, a given truth function may be represented by more than one 
Boolean expression and hence by more than one logic network composed of AND 
gates, OR gates, and inverters. 


|. EXAMPLE S | The Boolean expression 


Sk qr d$ 


has the truth function of Table 8.5. The logic network corresponding to this expres- 
sion is given by Figure 8.12. Compare this with your network in Practice 11(b)! 


Xi 


X3 


X5 


Figure 8.12 e 


DEFINITION EQUIVALENT BOOLEAN EXPRESSIONS 
Two Boolean expressions are equivalent if they have the same truth functions. 


We know that 


XjX3X4 + ХХХ ОХХХ, + xika + XXX 
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and 
хуз + x} 


for example, are equivalent Boolean expressions. 

Clearly, equivalence of Boolean expressions is an equivalence relation on the 
set of all Boolean expressions in п variables. Each equivalence class is associated 
with a distinct truth function. Given a truth function, algorithm Sum-Of-Products 
produces one particular member of the class associated with that function, name- 
ly, the canonical sum-of-products form. However, if we are trying to design the 
logic network for that function, we want to find a member of the class that is as 
simple as possible. We would rather build the network of Figure 8.12 than the one 
for Practice 11(b). 

How can we reduce a Boolean expression to an equivalent, simpler expres- 
sion? We can use the properties of a Boolean algebra because they express the 
equivalence of Boolean expressions. If P is a Boolean expression containing the 
subexpression (x, + x;)(x, + x3), for example, and О is the expression obtained 
from P by replacing (x, + x )(x, + x3) with the equivalent expression x, + (x3), 
then P and О are equivalent and Q is simpler than Р. 


ИШЕТ Using the properties of Boolean algebra, we can reduce 


Piles ТОЖУ dr Ж: Т ОХ Ооуу ТОЛУ, 
to 
Sees aP 35s 
as follows: 


КО ЛААЛ UP TERRES "P SORS "P IOTER 
1X2X3 1X2X3 1X2X3 1X2X3 1X2X3 


= А F ОХХХ, ОХХХ, T ОХХХ, F ХХХ F ОХОО (idempotent) 
= Wii ОХХХ IP ОХХХ TF XOU T сарон Тох (1b) 
= Оо ох) Fa t a) F xri t a) (3b) 
= lean Il eraa 1 (5a) 
= хх, ОХО ЧР Јар (4b) 
Rea ar dese, F 33 (1b) 
Xa Fx T X) (3b) 
= xx tx Il (5a) 
= xx tx (4b) e 


Unfortunately, one must be fairly clever to apply Boolean algebra properties to 
simplify an expression. In Section 8.3 we will discuss more systematic approaches 
to this minimization problem that require less ingenuity. For now, we should say 
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a bit more about why we want to minimize. When logic networks were built from 
separate gates and inverters, the cost of these elements was a considerable factor 
in the design, and it was desirable to have as few elements as possible. Now, how- 
ever, most networks are built using integrated circuit technology, a development 
that began in the early 1960s. An integrated circuit is itself a logic network rep- 
resenting a certain truth function or functions, just as if some gates and invert- 
ers had been combined in the appropriate arrangement inside a package. These 
integrated circuits are then combined as needed to produce the desired result. 
Because the integrated circuits are extremely small and relatively inexpensive, it 
might seem pointless to bother minimizing a network. However, minimization is 
still important because the reliability of the final network is inversely related to the 
number of connections between the integrated circuit packages. 

Moreover, the designers of integrated circuits are highly interested in the 
minimization problem. Integrated circuits are embedded into a substrate of silicon 
or other semiconductor material. The resulting chips may be tiny, yet they can 
contain the equivalent of 3 billion transistors for implementing truth functions. 
The distance between two gates may be as small as 45 nanometers (about 1/1000 
of the width of a human hair). Minimizing the number of components and the 
amount of wiring required to realize a desired truth function makes it possible to 
embed more functions in a single chip. 


Programmable Logic Devices 

Instead of designing a custom chip to implement particular truth functions, a PLD 
(programmable /ogic device) can be used. A PLD is a chip that is already im- 
planted with an array of AND gates and an array of OR gates, together with a rect- 
angular grid of wiring channels and some inverters. Once Boolean expressions in 
sum-of-products form have been determined for the truth functions, the required 
components in the PLD are activated. Although this chip is not very efficient and is 
practical only for smaller-scale circuit logic, on the order of hundreds of gates, the 
PLD can be mass-produced, and only a small amount of time (i.e., money) is then 
required to “program” it for the desired functions. A FPGA (field-programmable 
gate array) is a big brother to the PLD. The term “field-programmable” suggests 
that, like a PLD, the user can configure the chip for a specific purpose. An FPGA 
basically connects a number of PLDs in a reconfigurable way, and it can support 
thousands of gates. Often the FPGA also contains hardwired components such as 
multipliers or even processors and memory, thus producing a small reconfigurable 
computer that is “programmed” in hardware rather than software. 


Figure 8.13a shows a PLD for the three inputs x;, x2, and хз. There are four output 
lines, so four functions can be programmed in this PLD. When the PLD is pro- 
grammed, the horizontal line going into an AND gate will pick up certain inputs, 
and the AND gate will form the product of these inputs. The vertical line going 
into an OR gate will, when programmed, allow the OR gate to form the sum of 
certain inputs. Figure 8.13b shows the same PLD programmed to produce the truth 
functions / from Example 13 (xjxjx4 + xjxox4 + xxx} + ххх) and f from 
Practice 11 (xjxox4 + xxix; + xxix} + xpoxs + xjxox3). The dots represent ac- 
tivation points. 
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Input Output 


Figure 8.13 


A Useful Network 


We can design a network that adds binary numbers, a basic operation that a com- 
puter must be able to perform. The rules for adding two one-bit numbers are sum- 


marized in Table 8.6. 
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TABLE 8.6 TABLE 8.7 TABLE 8.8 
X X, | Sum X хә S X х2 с 
1 1 10 1 1 0 1 1 1 
1 0 1 1 0 1 1 0 0 
0 1 1 0 1 1 0 1 0 
0 0 0 0 0 0 0 0 0 


We can express the sum аз a single sum bit s (the right-hand bit of the actual sum) to- 
gether with a single carry bit c; doing this gives us the two truth functions of Tables 8.7 
and 8.8, respectively. The canonical sum-of-products form for each truth function is 


S = XiX + ху) 
C = ху 
An equivalent Boolean expression for s is 
= [à 
s = (x, + xj)(xix;) 


Figure 8.14a shows a network with inputs x, and x; and outputs s and c. This de- 
vice, for reasons that will be clear shortly, is called a half-adder. 


„== 


x, ———— 


хо-—————— 


(a) Half-adder 


Half-adder 
c;-1 
Half-adder 
X, 5 
X, ———- 
Д 
x, el аге Ci 


(b) Full-adder 
Figure 8.14 


| 650 Boolean Algebra and Computer Logic 


To add two n-bit binary numbers, we add column by column from the low- 
order to the high-order bits. The ith column (except for the very first column) has 
as input its two bits x, and x, plus the carry bit from the addition of column i — 1 
to its right. Thus we need a device incorporating the previous carry bit as input. 
Such a device can be accomplished by adding x, and x, with a half-adder and then 
adding the previous carry bit c;_ | (using another half-adder) to the result. Again, a 
sum bit s and final carry bit c; are output, where с; is 1 if either half-adder produces 
a 1 as its carry bit. The full-adder is shown in Figure 8.14b. The full-adder is thus 
composed of two half-adders and an additional OR gate. 

To add two n-bit binary numbers, the two low-order bits, where there is no input 
carry bit, can be added with a half-adder. The remaining bits are added with full- 
adders. АП are chained together. Figure 8.15 shows the modules required to add two 
3-bit binary numbers z,y,x, and z; уух), resulting in the answer a3a5a,a9, where the 
leading bit а; is the final carry bit and could be 0 (leading 05 are usually not written) 


or 1. 
[ | 
С] 
— r- 
Full-adder Full-adder Half-adder 
БИЙ 
Figure 8.15 


The adder circuit shown in Figure 8.15 is called a “ripple-carry adder” because 
the carry bits have to propagate right to left through each adder in turn. Although 
we have assumed that gates output instantaneously, there is in fact a small time 
delay in an n-bit adder due to this ripple effect that can be appreciable for large л. 
Variations on the basic circuitry that speed up the addition process depend on 
anticipating the higher-order carry bits. 


PRACTICE 12 | Trace the operation of the circuit in Figure 8.15 as it adds 101 and 111. 


Other Logic Elements 


The basic elements used in integrated circuits are not really AND and OR gates 
and inverters, but NAND and NOR gates. Figure 8.16 shows the standard sym- 
bol for the NAND gate (the NOT AND gate) and its truth function. The NAND 
gate alone is sufficient to realize any truth function because networks using only 
NAND gates can do the job of inverters, OR gates, and AND gates. Figure 8.17 
shows these networks. 
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221 X (x, + х) 

1 1 0 

1 0 1 

0 1 1 

0 0 1 
Figure 8.16 


' 
X1 
Xx, 
| А еу 
(х'\х')'= xj * X 
' 
x2 


(b) 


(xx) xx) = хх, + хх, = хх» 


(с) 


Figure 8.17 


The NOR gate (the NOT OR gate) and its truth function appear in Figure 8.18. An 
exercise at the end of this section asks you to construct networks using only NOR 
gates for inverters, OR gates, and AND gates.’ 


'Exercises 51 and 52 of Section 1.1 give truth tables for binary connectives that agree with the NAND and 
NOR truth functions. There you were asked to prove that either of these two connectives is sufficient to write 
any propositional wff; that is, you can write V, /\, and ' in terms of one of these connectives. 


PRACTICE 13 
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X X; | (+ 2%)’ 

1 1 0 

1 0 0 

0 1 0 

0 0 1 
Figure 8.18 


Although we can construct a NAND network for a truth function by replacing 
AND gates, OR gates, and inverters in the canonical form or a minimized form 
with the appropriate NAND networks, we can often obtain a simpler network by 
using the properties of NAND elements directly. 


a. Rewrite the network of Figure 8.12 with NAND elements by directly replacing the AND gate, 
OR gate, and inverter, as in Figure 8.17. 

b. Rewrite the Boolean expression xx, + x; for Figure 8.12 using De Morgan’s laws, and then 
construct a network using only two NAND elements. = 


EXAMPLE 18 


Constructing Truth Functions 


We know how to write a Boolean expression and construct a network from a 
given truth function. Often the truth function itself must first be deduced from the 
description of the actual problem. 


At a mail-order cosmetics firm, an automatic control device is used to supervise 
the packaging of orders. The firm sells lipstick, perfume, makeup, and nail polish. 
As a bonus item, shampoo is included with any order that includes perfume or any 
order that includes lipstick, makeup, and nail polish. How can we design the logic 
network that controls whether shampoo is packaged with an order? 

The inputs to the network will represent the four items that can be ordered. 
We label the items 


xı = lipstick 
X, = perfume 
x3 = makeup 
x4 = nail polish 


The value of x; will be 1 when that item is included in the order and 0 otherwise. 
The output from the network should be 1 if shampoo is to be packaged with the 
order and 0 otherwise. The truth table for the circuit appears in Table 8.9. The ca- 
nonical sum-of-products form for this truth function 15 lengthy, but the expression 
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X1X3X4 + x, also represents the function. Figure 8.19 shows the logic network for 
this expression. 


TABLE 8.9 
X | X2 | X3 | X, | f(x Xo Хз, Ха) 
1 1 1 1 1 
1 1 1 0 1] 
1 1 0 1 1 
1 1 0 0 1 
1 0 1 1 1 
1 0 1 0 0 
1 0 0 1 0 
1 0 0 0 0 
0 1 1 1 1i 
0 1 1 0 {l 
0 1 0 1 1 
0 1 0 0 1 
0 0 1 1 0 
0 0 1 0 0 
0 0 0 1 0 
0 0 0 0 0 


XiX3X4 + X2 


Figure 8.19 ө 


PRACTICE 14 | A hall light is controlled by two light switches, one at each end of the hall. Find (a) a 
truth function, (b) a Boolean expression, and (c) a logic network that allows the light to be 
switched on or off by either switch. 


In some problems the corresponding truth functions have certain undefined 
values because certain combinations of input cannot occur (see Exercise 35 at 
the end of this section). Under these “don’t-care” conditions, any value may be 
assigned to the output. 

In a programming language where the Boolean operators AND, OR, and 
NOT are available, designing the logic of a computer program may consist in part 
of choosing appropriate truth functions and their corresponding Boolean expres- 
sions (see Exercise 36 of Section 1.1). 


SPECIAL INTEREST PAGE 


Pruning Chips and Programs 


Computer computations (barring coding errors) are 
generally viewed as the height of accuracy. Translat- 
ing truth functions into circuits that produce correct 
outputs for given inputs has been the theme of this 
chapter. But some computer applications do not re- 
quire 100% accuracy and can tolerate a certain amount 
of error. Image processing is one such application be- 
cause small variations in an image are imperceptible 
to the human eye. (This feature of human eyesight is 
used to advantage in JPEG lossy image compression, 
as discussed in Section 6.4.) 

Recently, researchers looking for increased ef- 
ficiency in chip design have “pruned” traditional 
circuits, essentially cutting away the parts that are sel- 
dom used. Pruned chips consume less energy and are 
both smaller and faster than the complete chip. We can 
lump these three factors—energy, space, and time— 
together under the general term “efficiency.” Experi- 
mental pruning of a chip’s addition circuit produced a 
7.5% gain in overall efficiency. But of course there is 
a tradeoff; some percentage of error is introduced in 
the chip’s operation. The 7.5% efficiency gain came at 
the cost of a 0.25% error rate. For specialized applica- 
tions such as image processing, where small amounts 
of error can be tolerated, the tradeoff seems to be a 
good thing. Developers of this technology are looking 
toward applications such as hearing aids, cameras, or 
even tablet computers that could run on solar power. 


ө Chapter 


The same approach is also being tried with soft- 
ware. For example, “loop perforation” skips some of 
the iterations in a loop. This is a particularly evocative 
name—“perforating a loop" sounds very much like 
“pruning a chip.” Other approaches may skip entire 
tasks or randomly discard some input values. “Re- 
laxed program" is a general title for software that has 
built-in nondeterminism that can dynamically prune 
(skip) some of its instructions or data. The benefit is 
increased runtime efficiency. The penalty is some per- 
centage of incorrect results. And the trick, of course, is 
to ensure—and formally verify—that such techniques, 
while increasing efficiency, keep the output within an 
acceptable error range. 


“Inexact Design— Beyond Fault Tolerance," Anthes, G., 
Communications of the ACM, April, 2013. 

http://news.rice.edu/2012/05/17/computing-experts-unveil- 
superefficient-inexact-chip/ 

http://web.mit.edu/newsoffice/2010/fuzzy-logic-0103.html 

“Proving Acceptability Properties of Relaxed Nondeter- 
ministic Approximate Programs," Carbin, M., Kim, 
D., Misailovic, S., Rinard, M., ACM Conference on 
Programming Language Design and Implementation, 
June 11—16, 2012, Beijing, China. 

http://web.mit.edu/newsoffice/2012/loop-perforation-0522. 
html 
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SECTION 8.2 REVIEW 


TECHNIQUES MAIN IDEAS 

* Find the truth function corresponding to a given ° We can effectively convert information from any of 
Boolean expression or logic network. the following three forms to any other form: 

W) Construct a logic network with the same truth func- 
tion as a given Boolean expression. truth function <> Boolean expression <> 

* Write a Boolean expression with the same truth logic network 


function as a given logic network. 
W) Write the Boolean expression in canonical sum-of- ° А Boolean expression can sometimes be converted 


products form for a given truth function. to a simpler, equivalent expression using the prop- 
* Findanetwork composed only of NAND gates that erties of Boolean algebra, thus producing a simpler 
has the same truth function as a given network with network for a given truth function. 


AND gates, OR gates, and inverters. 
* Finda truth function that satisfies the description of 
a particular problem. 


EXERCISES 8.2 


For Exercises 1—4, write a truth function and construct a logic network using AND gates, OR gates, and inverters 
for each of the given Boolean expressions. 


1. (x + X2)x3 


2. (x, + xj) + хуз 
3. хо + (д)! 
4. (x, + х,)' х3 + xi 
For Exercises 5—8, write a Boolean expression and a truth function for each of the logic networks shown. 


5. Ж 


No 
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a. Write the truth function for the Boolean operation x (B y = xy’ + ух’. 
b. Draw the logic network for x © y. 
с 


. Show that the network of the accompanying figure also represents x ® y. Explain why the network 
illustrates that ® is the exclusive OR operation. (Recall that a bitwise exclusive-OR operation is used 
in DES encoding, as discussed in Section 5.6.) 


— D 
Е 


10. a. Write the truth function for ће Boolean expression 
(ху) Ox) 


b. Draw ће logic network for this expression. 


c. By looking at either the truth function or the logic network, what propositional logic connective does 
this Boolean expression represent? 


For Exercises 11—20, find the canonical sum-of-products form for the truth functions in the given tables. 


П. ху | X2 | fp, х) 12. Xi | X2 | f(x, х) 13; х, | х, | х, | fos xs xy) 
1 1 0 1 1 1 1 1 1 1 
1 0 0 1 0 0 1 1 0 0 
0 1 0 0 1 1 1 0 1 0 
0 0 1 0 0 0 1 0 0 0 
0 1 1 1 
0 1 0 0 
0 0 1 0 
0 0 0 0 


14. 


16. 


18. 
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Ху | X2 | X3 | f(xi, X2, X3) 

1 1 1 0 

1 1 0 1 

1 0 | 1 1 

1 0 0 0 

O0 | 1 1 1 

0 | 1 0 0 

0 |0 1 0 
ого о 1 
Ху | X2 | Хз | F(X4, X2, X3) 

1 1 1 0 

1 1 0 1 

1 0 | 1 1 

1 0 |0 0 

0 | 1 1 0 

0 | 1 0 1 

0 |0 1 0 
оо |0 0 
Ху | X2 | Xa | X4 | f(x, X2, Хз, Ха) 
1 1 1 1 1 
1 1 1 0 0 
1 1 0 1 1 
1 1 оо 0 
1 0 1 1 1 
1 0 1 0 1 
1 о1о |1 0 
1 о1о о 0 
0 1 1 1 1 
0 1 1 0 0 
0 1 0 | 1 1 
0 1 0 |0 0 
о1о 1 1 0 
010 1 0 0 
оо o 1 0 
о1о о о 0 


15, 


17. 


Xi | X2 | хз | f(x4, X2, X3) 
түл 1 0 
11110 0 
1101 1 
1/0] 0 1 
0 | 1]| 1 0 
о1о 1 
0 |0 |1 0 
оо о 0 

Xi | х | хз | ха | ха, х2, Хз, X4) 
лл + 1 
| |1 0 0 
1|1|0 |1 1 
1|1|0 о 0 
1|0 1) 1 1 
1 | 0} 1) 0 0 
110 о 1 
1 то го о 0 
отт 0 
0.11.0 0 
ото |1 0 
ото о 0 
оо |1 |1 1 
0.0.1.0 1 
ооо |1 0 
оо о о 0 
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1 1 1 1 0 1 1 1 1 1 
1 1 1 0 0 1 1 1 0 1 
1 1 0 1 0 1 1 0 1 0 
1 1 0 0 0 1 1 0 0 1 
1 0 1 1 0 1 0 1 1 1 
1 0 1 0 1 1 0 1 0 0 
1 0 0 1 0 1 0 0 1 0 
1 0 0 0 0 1 0 0 0 0 
0 1 1 1 1 0 1 1 1 0 
0 1 1 0 0 0 1 1 0 1 
0 1 0 1 1 0 1 0 1 0 
0 1 0 0 0 0 1 0 | O 1 
0 | O 1 1 1 0 0 1 1 0 
0 | O 1 0 1 0 0 1 0 0 
0 | O 0 1 1 0 0 0 1 0 
0 0 0 0 0 0 0 0 0 0 


. Find the canonical sum-of-products form for the truth function in the accompanying table. 
. Draw the logic network for the expression of part (a). 


c. Use properties of a Boolean algebra to reduce the expression of part (a) to an equivalent expression 


whose network requires only two logic elements. Draw the network. 


Ху | х | Хз | f(x, X2, Хз) 
1 1 1 0 
1 1 0 1 
1 0 | 1 0 
1 0 0 1 
0 | 1 1 0 
0 | 1 0 0 
0 |0 1 0 
оо о 0 


. Find the canonical sum-of-products form for the truth function in ће accompanying table. 
. Draw the logic network for the expression of part (a). 


c. Use properties of a Boolean algebra to reduce the expression of part (a) to an equivalent expression 


whose network requires only three logic elements. Draw the network. 
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ху | X2 | X3 | f(x4, X2, Хз) 
1 1 1 1 

1 1 0 0 

1 0 1 0 

1 0 |0 0 

0 1 1 1 

0 1 0 1 

0 |0 |1 0 
оо 0 0 


23. а. Show that the two Boolean expressions 


(x, + ху)(ху + xa) + хз) 
and 
(1x3) + (ухо) 


are equivalent by writing the truth function for each. 
b. Write the canonical sum-of-products form equivalent to the two expressions of part (a). 
c. Use properties of a Boolean algebra to reduce one of the expressions of part (a) to the other. 


24. The accompanying figure shows an unprogrammed PLD for three inputs, x;, x», and x4. Program this PLD 
to generate the truth functions f, and f; represented by 


А ! ! 1 Ж, 
ЛА: ХХХ; + xqpxya + xix, 


Pc 


РЧ y ri EN : $ E 
Л: X1X2X3 Xq1X5X3 T X1X5X3 


23. 


2 


a 


27. 
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There is also a canonical product-of-sums form (conjunctive normal form) for any truth function. This 
expression has the form 


OOO 


with each factor a sum of the form 
ät Bp Ruro 


where а = x, or xj, В = x; or x5, and so on. Each factor is constructed to have a value of 0 for the input 
values of exactly one of the rows of the truth function having value 0. Thus, the entire expression has 
value 0 for these inputs and no others. Find the canonical product-of-sums form for the truth functions of 
Exercises 11—15. 


. Consider the truth function given here. Because there are many more 15 than Os. the canonical sum-of- 


products form might seem tedious to compute. Instead, create a Boolean sum-of-products expression us- 
ing the same formula as before but on the 0 rows instead ofthe 1 rows. This expression will give outputs of 
15 for those rows and no others, exactly the opposite of what you want. Then complement the expression 
(equivalent to sticking an inverter on the end of the network). 


X | X2 | Хз | fa, X2, X3) 
1 1 1 1 

1 1 0 1 

1 0 | 1 1 

1 0 |0 0 

0 | 1 1 1 

O0 | 1 0 1 

0 |0 |1 0 
оо 0 1 


a. Use this approach to create a Boolean expression for this truth function. 


b. Prove that the resulting expression is equivalent to the canonical product-of-sums form described in 
Exercise 25. 


The 25 complement of an n-bit binary number p is an n-bit binary number q such that p + q equals an 
n-bit representation of zero (any carry bit to column + 1 is ignored). Thus 01110 is the 2’s complement 
of 10010 because 


10010 
+ 01110 


(1)00000 


The 2's complement idea can be used to represent negative integers in binary form. After all, the negative 
of p is by definition a number that, when added to p, results in zero. 

Given a binary number p, the 2's complement of p is found by scanning p from low-order to high- 
order bits (right to left). As long as bit i of p is 0, bit i of q is 0. When the first 1 of p is encountered, say at 
bit j, then bit j of q is 1, but for the remaining bits, j < i = n, q; = pi. For p = 10010, for instance, the 


28. 


29. 


30. 


3 


n 


32. 
33. 
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rightmost 0 bit of p stays a 0 bit in q, and the first 1 bit stays a 1 bit. The remaining bits of q, however, are 
the reverse of the bits in p. 


First 1 

< 
p=100 i10 
q-011 ! 10 
qdi^p ' qi= р 


For each binary number p, find the 2° complement of p, namely д, and then calculate р + q. 
a. 1100 b. 1001 c. 001 


For any bit x; in a binary number p, let r; be the corresponding bit in q, the 2's complement of p (see 
Exercise 27). The value of r; depends on the value of x; and also on the position of x; relative to the first 
1 bit in p. For the ith bit, let c;..; denote a 0 if the bits р, 1 5j =i — 1, are0 anda 1 otherwise. A value 
c; must be computed to move on to the next bit. 


a. Give a truth function for r; with inputs x; and c;_,. Give a truth function for c; with inputs x; and c;. ,. 
b. Write Boolean expressions for the truth functions of part (a). Simplify as much as possible. 

c. Design a circuit module to output r; and c; from inputs x; and с; ;. 
d 


. Using the modules of part (c), design a circuit to find the 2's complement of a three-bit binary number 
zyx. Trace the operation of the circuit in computing the 2’s complement of 110. 


a. Construct a network for the following expression using only NAND elements. Replace the AND and 
OR gates and inverters with the appropriate NAND networks. 


xix; + xix, + x4 


b. Use the properties of a Boolean algebra to reduce the expression of part (a) to one whose network 
would require only three NAND gates. Draw the network. 


Replace the following network with an equivalent network using one AND gate, one OR gate, and one inverter. 


x ee 


. Using only NOR elements, construct networks that can replace (a) an inverter, (b) an OR gate, and (c) an 


AND gate. 
Find an equivalent network for the half-adder module that uses exactly five NAND gates. Draw the network. 


A thermostat controls a heating system that should raise the temperature above 67°F during working 
hours (between 7:00 AM and 6:00 рм). The input values are 


x, = 1 when the temperature is less than 67°F 

x, = 0 otherwise 

x, = 1 when the time is less than 7:00 AM or greater than 6:00 PM 
X, = 0 otherwise 


Find a truth function, a Boolean expression, and a logic network for when the heating system should be 
on (value 1) or off (value 0). 


34. 


35. 


36. 
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The space shuttle was controlled by three on-board computers; binary output from these three computers 
was compared and a majority vote was required for certain actions to take place. Find a truth function, a 
Boolean expression, and a logic network that outputs the majority vote of the three input values. 


You have just been hired at Mercenary Motors. Your job is to design a logic network so that a car can be 
started only when the automatic transmission is in neutral or park and the driver’s seat belt is fastened. 
Find a truth function, a Boolean expression, and a logic network. (There is a don’t-care condition to the 
truth function, since the car cannot be in both neutral and park.) 


Mercenary Motors has expanded into the calculator business. You need to design the circuitry for the 
display readout on a new calculator. This design involves a two-step process. 


а. Any digit 0, 1, ... , 9 put into the calculator is first converted to binary form. Part (a) of the accompany- 
ing figure illustrates this conversion, which involves four separate networks, one each for x, to x4. Each 
network has 10 inputs, but only 1 input can be on at any given moment. Write a Boolean expression and 
then draw a network for x. 


b. The binary form of the digit is then converted into a visual display by activating a pattern of seven out- 
puts arranged as shown in figure (b). To display the digit 3, for example, уу, y», уз, ys, and y; must be 
on, as in figure (c). Thus, the second step of the process can be represented by figure (d), which involves 
seven separate networks, one each for y, to y;, each with four inputs, x, to x4. Write a truth function, a 
Boolean expression, and a network for у; and for ye. 


Logic 
for 
binary 


encoder 


© бо М с л Ь оо э но 


y 
i Logic 
Уз Ys for 
y display 
с — readout 
Y6 Ут 
Уз 
(b) (c) 


(d) 
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. A multiplexor is a control circuit with 2" input lines (numbered 0 through 2" — 1), n selector lines, and 
exactly one output line. The signal on the output line is to match the signal on one of the input lines; the 
selector lines determine which of the input line signals will be propagated to the output line. Signals on 
the n selector lines determine an n-bit binary number, which can range in value from 0 to 2" — 1; hence 
the numeric value on the selector lines identifies exactly one input line. The arithmetic-logic unit of a com- 
puter processor contains circuits for various operations (addition, subtraction, comparison, and so forth), 
and when an arithmetic or comparison operation is to be done, the ALU may activate all these circuit. A 
multiplexor then picks out the one desired result. 


a. Write a truth function for a multiplexor where n = 1; that is, there are 2 input lines, 1 selector line and, 
of course, one output line. 


b. Draw the logic network (as simple as possible). 


A decoder is a control circuit with n input lines and 2" output lines (numbered 0 through 2" — 1). The pat- 
tern of n bits on the input lines represents a binary number between 0 and 2" — 1. A decoder activates the 
output line with the corresponding identification number by putting a 1 output on that line and 0 outputs 
on all other output lines. A decoder in a computer can, for example, read input lines that represent a binary 
memory address and then activate the line to that memory cell for a read operation. 


a. Write truth functions for a decoder where n — 2, that is, there are 2 input lines and 2? — 4 output lines 
(hence four truth functions). 


b. Draw the logic network that incorporates all four truth functions. 
. At the beginning of this chapter, you were 


... hired by Rats К Us to build the control logic for the production facilities for a new anticancer 
chemical compound being tested on rats. The control logic must manage the opening and closing 
of two valves, А and B, downstream of the mixing vat. Valve A 15 to open whenever the pressure 
in the vat exceeds 50 psi (pounds per square inch) and the salinity of the mixture exceeds 45 g/L 
(grams per liter). Valve B is to open whenever valve A is closed and the temperature exceeds 53?C 
and the acidity falls below 7.0 pH (lower pH values mean more acidity). 

How many and what type of logic gates will be needed in the circuit? 


Answer this question by finding the canonical sum-of-products form for the logic circuits to control А 
and B. 


SECTION 8.3 | MINIMIZATION 


Minimization Process 


Remember from Section 8.2 that a given truth function is associated with an 
equivalence class of Boolean expressions. If we want to design a logic network 
for the function, the ideal would be to have a procedure that chooses the simplest 
Boolean expression from the class. What we consider simple will depend on the 
technology employed in building the network, what kind of logic elements are 
available, and so on. At any rate, we probably want to minimize the total number 
of connections that must be made and the total number of logic elements used. (As 
we discuss minimization procedures, keep in mind that other factors may influ- 
ence the economics of the situation. If a network is to be built only once, the time 
spent on minimization is costlier than building the network. But if the network 
is to be mass-produced, then the cost of minimization time may be worthwhile.) 


664 Boolean Algebra and Computer Logic 


We have had some experience in simplifying Boolean expressions by apply- 
ing the properties of Boolean algebra. However, we had no procedure to use. We 
simply had to guess, attacking each problem individually. What we want now is 
a mechanical procedure that we can use without having to be clever or insightful. 
Unfortunately, we won’t develop the ideal procedure. However, we already know 
how to select the canonical sum-of-products form from the equivalence class of 
expressions for a given truth function. In this section we will discuss two proce- 
dures to reduce a canonical sum-of-products form to a minimal sum-of-products 
form. Therefore, we can minimize within the framework of a sum-of-products 
form and reduce, if not completely minimize, the number of elements and con- 
nections required. 


| EXAMPLE 19 | The Boolean expression 


Х\ХоХз + XIXX3 + Xqxoxá 

is in sum-of-products form. An equivalent minimal sum-of-products form is 

XoX4 + XX1 
Implementing a network for this form would require two AND gates, one OR gate, 
and an inverter. Using one of the distributive laws of Boolean algebra, this expres- 
sion reduces to 

X(x + xi) 
which requires only one AND gate, one OR gate, and an inverter, but it is no lon- 


ger in sum-of-products form. Thus, a minimal sum-of-products form may not be 
minimal in an absolute sense. ө 


There are two extremely useful equivalences in minimizing a sum-of-products 
form. They are 
хүр + ХХ) = X; 
and 


Xx; + xix = x, + X 


PRACTICE 15 | Use properties of Boolean algebra to reduce the following expressions as indicated: 


ашуу + Xj Xo, t0 X; 
БРЕ у о E ш 


The equivalence xjx; + xix) = x) means, for example, that the expression 
ХХХ + xiX5X4x4 reduces to xjx4x4. Thus, when we have a sum of two products 
that differ in only one factor, we can eliminate that factor. However, the canonical 
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sum-of-products form for a truth function of, say, four variables might be quite 
long and require some searching to locate two product terms differing by only one 
factor. To help us in this search, we can use the Karnaugh map. The Karnaugh 
map 15 a visual representation of the truth function so that terms in the canonical 
sum-of-products form that differ by only one factor can be matched quickly. 


Karnaugh Map 


In the canonical sum-of-products form for a truth function, we are interested in 
values of the input variables that produce outputs of 1. The Karnaugh map records 
the 1s of the function in an array that forces products of inputs 

differing by only one factor to be adjacent. The array form for xy, xi 

a two-variable function is given in Figure 8.20. Notice that the 

square corresponding to xx», the upper left-hand square, is ad- 2 

jacent to squares xix, and хүхә, which differ in one factor from _, 
ху; however, it is not adjacent to the xix, square, which differs 
in two factors from xx». Figure 8.20 


The truth function of Table 8.10 is represented by the Karnaugh map of Figure 
8.21. At once we can observe 15 in two adjacent squares, so there are two terms in 
the canonical sum-of-products form differing by one variable; again from the map, 
we see that the variable that changes is x,. It can be eliminated. We conclude that 
the function can be represented by x,. Indeed, the canonical sum-of-products form 
for the function is xix; + xix, which, by our basic reduction rule, reduces to x. 
However, we did not have to write the canonical form—we only had to look at the 


map. 
TABLE 8.10 
Xi | X2 f(x, x2) 
1 1 1 
1 0 0 
0 1 1 
d А Š Figure 8.21 e 


PRACTICE 16 | Draw the Karnaugh map and use it to find a reduced expression for the function in 


Table 8.11. 
TABLE 8.11 
x X2 |Ңх,х›) 
1 1 0 
1 0 0 
0 1 1 
0 0 1 c 


REMINDER 


It is crucial to label the 
Karnaugh map so that 
adjacent squares differ by 
one variable. 
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Maps for Three and Four Variables 

The array forms for functions of three and four variables are shown in Figure 8.22. 
In these arrays, adjacent squares also differ by only one variable. However, in 
Figure 8.22a, the leftmost and rightmost squares in a row also differ by one vari- 
able, so we consider them adjacent. (They would in fact be adjacent if we wrapped 
the map around a cylinder and glued the left and right edges together.) In Figure 
8.22b, the leftmost and rightmost squares in a row are adjacent (differ by exactly 
one variable), and also the top and bottom squares in a column are adjacent. 


XX2 XX} ХХ) XO 


X3X4 
, Vt , " 
Мо АХ Хх XM X3X4 
X3 XX4 
, А 
Хз X3X4 
(a) (b) 


Figure 8.22 


In three-variable maps, when two adjacent squares are marked with 1, one vari- 
able can be eliminated; when four adjacent squares are marked with 1 (either in a 
single row or arranged in a square), two variables can be eliminated. 

The labelings for two-, three-, and four-variable maps can be done in various 
ways, but they must be done so that adjacent squares differ by one variable. It's 
probably best to just memorize the labeling schemes we've used here. 


In the map of Figure 8.23, the squares that combine for a reduction are shown as a 
block. These four adjacent squares reduce to х; (eliminate the changing variables 
x, and х). The reduction uses our basic reduction rule more than once: 


аро ОХХХ, T BOBO IF SONS = Std oo IP oes) "P epee) тео) 
= ХХ + XiX 
ER 1 
= 0 


= х; 


But, again, you don’t have to go through this process; you can just look at the 
Karnaugh map in Figure 8.23. 


Xi Ойу KIX х 


Figure 8.23 e 


| Section 8.3 Minimization 667 


In four-variable maps, when two adjacent squares are marked with 1, one 
variable can be eliminated; when four adjacent squares are marked with 1, 
two variables can be eliminated; when eight adjacent squares are marked with 
1, three variables can be eliminated. 

Figure 8.24 illustrates some instances of two adjacent marked squares, Figure 8.25 
illustrates some instances of four adjacent marked squares, and Figure 8.26 shows 
instances of eight. 


XU ОХХ) XX} X1X» 


Figure 8.24 


хо XX) Хх) кю XX ХХ» Хх) XIX. 


Figure 8.25 


XX. Xxx XIX кю хо XK Хх») кю 
X3X4 
1 
X3X4 
KG 
X3X4 


ХУ 


Figure 8.26 
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| EXANPLE22 | In the map of Figure 8.27, the four outside corners reduce to х›х and the inside 


square reduces to хохд. 


X3X4 
П 
X3X4 
De 
X3X4 
! 
X3X4 
Figure 8.27 ө 


PRACTICE 17 | Find the two terms represented by the map in Figure 8.28. 


XX2 XX3 Хх) XN 
X3X4 
D 
X3X4 
NE) 
X3X4 


X3X4 


Figure 8.28 L| 


Using the Karnaugh Map 

How do we find a minimal sum-of-products form from a Karnaugh map (or from 
a truth function or a canonical sum-of-products form)? We must use every marked 
square ofthe map, and we want to include every marked square in the largest com- 
bination of marked squares possible, since doing so will reduce the expression as 
much as possible. Surprisingly, we cannot begin by simply looking for the largest 
blocks of marked squares on the map. 


In the Karnaugh map of Figure 8.29, if we simply looked for the largest block of 
marked squares, we would use the column of 15 and reduce it to x;x;. However, 
we would still have four marked squares unaccounted for. Each of these marked 
squares can be combined into a two-square block in only one way (Figure 8.30), 
and each of these blocks has to be included. But when this adjustment is made, 
every square in the column of 15 is used, and the term xx; is superfluous. The 
minimal sum-of-products form for this map becomes 


КОО d- Lapeer шша P А, 
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Xo Ойл sar ОР) imo Ойл ce BO 


Figure 8.29 Figure 8.30 e 


To avoid the redundancy illustrated by Example 23, we analyze the map as 
follows. First, we form terms for those marked squares that cannot be combined 
with anything. Then we use the remaining marked squares to find those that can be 
combined only into two-square blocks and in only one way. Then among the un- 
used marked squares—that is, those not already assigned to a block—we find those 
that can be combined only into four-square blocks and in only one way; then we 
look for any unused squares that go uniquely into eight-square blocks. At each step, 
if an unused marked square can go into more than one block, we do nothing with 
it. Finally, we take any unused marked squares that are left (for which there was a 
choice of blocks) and select blocks that include them in the most efficient manner. 

Table 8.12 shows the steps involved. Note, however, that this procedure for 
handling Karnaugh maps is not, strictly speaking, an algorithm because it doesn't 
always produce the correct result. If there are many 15 in the map, thus allowing 
many different blockings, even this procedure may not lead to a minimal form 
(see Example 28). 


TABLE 8.12 


Steps in Using Karnaugh Maps 


1. Set up the grid, using correct labeling for the number of Boolean variables. 


2. Insert 1s in the table for the terms in the canonical sum-of-products expression. 


3. Form terms for any isolated marked squares. 


4. Combine squares uniquely into two-square blocks, if possible. 


5. Combine squares uniquely into four-square blocks, if possible. 


6. Combine squares uniquely into eight-square blocks, if possible. 


7. Combine any remaining unused marked squares into blocks as efficiently as possible. 


In Figure 8.31a we show the only square that cannot be combined into a larger 
block. In Figure 8.31b, we have formed the unique two-square block for the ххх; 
square and the unique two-square block for the x,x5x4 square. All marked squares 
are covered. The minimal sum-of-products expression is 


ХХ, Ж 305303 охх) 
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(ЖА Д6 1 


Formally, the last two terms are obtained by expanding xx2x5 into xjx3x3 + xix5xà 
and then combining it with each of its neighbors. 


Spey Avy asain р Xm AE Bobi) depen 


Figure 8.31 


Figure 8.32a shows the unique two-square blocks for the x(x;x4x4 square and the 
xyx5x3x4 square. In Figure 8.32b the two unused squares have been combined into 
a unique four-square block. The minimal sum-of-products expression is 


Жүз F ora Т ОРОВ 


х хх кх к х х0 KIX ШЖ 

X3X4 X3X4 

ГА , 

X3X4 X3X4 

LE "arri 

X3X4 X3X4 

, , 

X3X4 X3X4 
Figure 8.32 


Figure 8.33a shows the unique two-square blocks. We can assign the remaining 
unused marked square to either of two different two-square blocks; these blocks 
are shown in Figure 8.33b. There are two minimal sum-of-products forms, 

XQjXoX4 + XqX3Xa + X3X3X4 
and 


XjX4X4 ЧЕ X]XoXa + XqX3X4 


Either can be used, as they are equally efficient. 
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Ju Teh ЛО» — S4, 


X3X4 
j 
X3X4 
Ted 
X3X4 


X5X4 


Figure 8.33 e 


Figure 8.34a shows the unique two-square and four-square blocks. The remaining 
two unused marked squares can be assigned to two-square blocks in two different 
ways, as shown in parts (b) and (c). Assigning them together to a single two-square 
block is more efficient because it produces a sum-of-products form with three 
terms rather than four. The minimal sum-of-products expression is 


а аР IP ЖОШО, 
, үл, J , T A y, 
Хә XX2 XX2 XX2 XX» XU» Хх х 
X3X4 
y 
X3X4 
X3X4 


X53X4 


Figure 8.34 e 


ЭДШ ЛШ Consider the map of Figure 8.35a. Here the two unique four-square blocks deter- 
mined by the squares with * have been chosen. In Figure 8.35b, the remaining un- 
marked squares, for which there was a choice of blocks, are combined into blocks 
as efficiently as possible. The resulting sum-of-products form is 


Жүз SP ЖҮЛ, IP asa ОХХ» Олд, 
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Yet in Figure 8.35c, choosing a different four-square block at the top leads to a 
simpler sum-of-products form, 


Жой T Tepes AP Xa T SORA 


Imm 3p iuo dm 


Figure 8.35 ө 


PRACTICE 18 | Write the minimal sum-of-products expression for the map shown in Figure 8.36. 


X3X4 
! 
X3X4 


XX4 


X4X4 


Figure 8.36 L| 


If the Karnaugh map corresponds to a function with don't-care conditions, 
then the don't-care squares on the map can be left blank or assigned the value 1, 
whichever aids the minimization process. 

We have used Karnaugh maps for functions of two, three, and four variables. 
By using three-dimensional drawings or overlapping transparency sheets, Kar- 
naugh maps for functions of five, six, or even more variables can be constructed, 
but the visualization gets too complicated to be worthwhile. The next procedure 
works for any number of variables. 
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Quine-McCluskey Procedure 


Remember that the key to reducing the canonical sum-of-products form for a truth 
function lies in recognizing terms of the sum that differ in only one factor. In the 
Karnaugh map, we see where such terms occur. A second method of reduction, 
the Quine—McCluskey procedure, organizes information from the canonical sum- 
of-products form into a table to simplify the search for terms differing by only 
one factor. 

The procedure is a two-step process paralleling the use of the Karnaugh map. 
First we find groupings of terms (just as we looped together marked squares in 
the Karnaugh map); then we eliminate redundant groupings and make choices for 
terms that can belong to several groups. 


Let's illustrate the Quine-McCluskey procedure by using the truth function for 
Example 23. We did not write the actual truth function there, but the information 
is contained in the Karnaugh map. The truth function is shown in Table 8.13. The 
eight 4-tuples of 0s and 1s producing a function value of 1 are listed in Table 8.14, 
which is separated into four groupings according to the number of 15. Note that 
terms of the canonical sum-of-products form differing by only one factor must be 
in adjacent groupings, which simplifies the search for such terms. 


TABLE 8.13 

ху || 2H || 55 x ep хә хх) 

1 1 1 1 0 

1 1 1 0 0 

1] 1 0 | 1 0 

1 1 0 |0 0 

{l 0) | 4 1 1 

1 @ | 4 0 0 

ШАО КО 0 ТАВГЕ 8.14 

Т 0 0 0 i Number of 15 | х; | X2 | X3 | X4 
0 1 1 1 0 Three 1 0 1 1 
0 1 1 0 1 Two 0 1 1 0 
0 | 1 0 1 1 0 1 0 | 1 
@ | 3] оо 0 @ || @ || q 1 
0 | @ || 4 1 1 One 1 0 | © | @ 
@ | @ | 3 0 1 © |o)! 0 
0 0 1 1 оо 0 1 
оо 0 1 Мопе 0 | @ | e | @ 
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We compare the first term, 1011, with each of the three terms of the second 
group, 0110, 0101, and 0011, to locate terms differing by only one factor. Such a 
term is 0011. The combination 1011 and 0011 reduces to —011 when the chang- 
ing variable x, is eliminated. We will write this reduced term with a dash in the 
x, position in the first row of a new table. The new table is Table 8.15b, where 
we've just seen how the first row was obtained. We've rewritten the original 
Table 8.14 as Table 8.15a, but we have also marked the two terms 1011 and 0011 
in this table with a superscript 1. This superscript 1 is a pointer that indicates 
the row number of the reduced term in Table 8.15b that is formed from these 
two terms (numbering terms corresponds to putting loops in the Karnaugh map). 


TABLE 8.15 

Number of 15 | х; | X2 | хз | X4 Number of 15 | х; | X2 | X3 | X4 

Three 110 1111 |" Two - || т | 1 

Two One 0|-|11]|0 
3 01-10 |1 
iu 0 | @ | 3 - 

One s 0|0|- |1 
си None = | @ | © | © 
SPESE о1о - |0 

None ОЕ ооо | - 


(а) (b) 


We continue this process with all the terms in Table 8.15a. A numbered term 
may still be used in other combinations, just as a marked square in a Karnaugh 
map can be in more than one loop. When we are done, the result is the completed 
Table 8.15b shown, where the terms in this table are again grouped by the num- 
ber of 15. 

We now build still another table by processing the terms in Table 8.15b. 
Here not only the groupings but also the dashes help organize the search pro- 
cess, since terms differing by only one variable must have dashes in the same 
location. Tables 8.16a and 8.16b are the same as Tables 8.15a and 8.15b, and 
Table 8.16c is the new table. Again, numbers on terms in Table 8.16b that combine 
serve as pointers to the reduced terms in Table 8.16c. When we have processed 
all the terms in Table 8.16b, the reduction process cannot be continued. The un- 
numbered terms are irreducible, so they represent the possible maximum-sized 
loops on a Karnaugh map. 
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TABLE 8.16 
Number of 15 | х; | х, | X3 | X4 Number of 1s | х; | X2 | X3 | X4 
Three ї|@|т | |" Two ЕСО Е 
Two 0111110 | Опе @|=(|( | 
0111017 | 01-101 
© | @ | ad | a |74 О | @ ЕЕЕ Е 
Опе 11910 |0 | 0101-1" 
010127 | o |2 None = 01010 
0101010 | ОВ КОЯ 25 605 |: 
Мопе © | © | @ | © |2 ОКО О ЕУ Е 


(а) (b) 


Number of 1s | х; | X2 | X3 | X4 


None 0 0 - - 


(С) 


For the second step of the process, we compare the original terms with the 
irreducible terms. We form a table with the original terms as column headers 
and the irreducible terms (the unnumbered terms in the reduction tables just con- 
structed) as row labels. A check in the comparison table (Table 8.17) indicates that 
the original term in that column eventually led to the irreducible term in that row, 
which can be determined by following the pointers. 


TABLE 8.17 
1011 | 0110 | 0101 | 0011 | 1000 | 0010 | 0001 | 0000 
-011 e "4 
0-10 7 / 
0-01 "4 4 
-000 / / 
00— v 7 / / 


If а column in the comparison table has a check in only one row, the irreduc- 
ible term for that row is the only one covering the original term, so it is an essen- 
tial term and must appear in the final sum-of-products form. Thus, we see from 
Table 8.17 that the terms —011, 0—10, 0—01, and —000 are essential and must be 
in the final expression. We also note that all columns with a check in row 5 also 
have checks in another row and so are covered by an essential reduced term al- 
ready in the expression. Thus, 00—— is redundant. As in Example 23, the minimal 
sum-of-products form is 


309630. ЧЕ др рЫ, AK denen IE Te e 
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In situations where there is more than one minimal sum-of-products form, the 
comparison table will have nonessential, nonredundant reduced terms. A selec- 
tion must be made from these reduced terms to cover all columns not covered by 
essential terms. 


We will use the Quine-McCluskey procedure on the problem presented in Example 
26. The reduction tables are given in Table 8.18, and the comparison table appears 
in Table 8.19. 


TABLE 8.18 
Number of 15 | х; | X2 | X3 | X4 Number of 1s | х; | х, | X3 | X4 
Three ОЕ Е Е Two ola 1|- 
Two 1 | @ | ү © | One 2 СОЕД 0 

olada ol 1 @ | = || @ 
One оо a | © |? 0 | = | 1 0 

110440 |10 |“ 

(а) (р) 

ТАВГЕ 8.19 

0111 |1010 |0110 |0010 | 1000 

011- "A 4 

-010 J V 

10-0 J V 

0-10 v 4 


We see from the comparison table that 011— and 10—0 are essential reduced terms 
and that there are no redundant terms. The only original term not covered by es- 
sential terms is 0010, column 4, and the choice of the reduced term for row 2 or for 
row 4 will cover it. Thus, the minimal sum-of-products form, as before, is 


ХОЛУ ШАГАЛА ШЕЙЛА А, 
ог 


X|X3X3 P ДОЛЛА А 304 
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PRACTICE 19 | Use the Quine-McCluskey procedure to find a minimal sum-of-products form for the truth 


function in Table 8.20. 


TABLE 8.20 


X4 | X2 | X3 f(x1, X2, X3) 


While the Quine-McCluskey procedure applies to truth functions with any 
number of input variables, for a large number of variables, the procedure is extremely 
tedious to do by hand. However, it is exactly the kind of systematic, mechanical pro- 
cess that lends itself to a computerized solution. In contrast, Karnaugh maps make 
use of the human ability to quickly recognize visual patterns. 

If the truth function f has few 0 values and a large number of 1 values, it may 
be simpler to implement the Quine-McCluskey procedure for the complement of 
the function, f", which will have 1 values where f has 0 values, and vice versa. 
Once a minimal sum-of-products expression is obtained for f", it can be comple- 
mented to obtain an expression for f, although the new expression will not be in 
sum-of-products form. (In fact, by De Morgan's laws, it will be equivalent to a 
product-of-sums form.) We can obtain the network for f from the sum-of-products 
network for f" by tacking an inverter on the end. 

The whole object of minimizing a network is to simplify the internal configu- 
ration while preserving the external behavior. In Chapter 9 we will attempt the 
same sort of minimization on finite-state machine structures. 


SECTION 8.3 REVIEW 


TECHNIQUES MAIN IDEA 
W) Minimize the canonical sum-of-products form for — * Algorithms exist for reducing a canonical sum- 
a truth function by using a Karnaugh map. of-products form to a minimized sum-of-products 
W Minimize the canonical sum-of-products form for form. 


a truth function by using the Quine-McCluskey 


procedure. 


EXERCISES 8.3 


For Exercises 1—8, write the minimal sum-of-products form for the Karnaugh maps of the given figures. 
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l. Xi, ХХ» xix к 
X3 1 1 
X5 1 1 1 
2. Xi, ХХ» xixi к 
X3 1 1 
X3 1 
3. ХХ á XX Хх XID 
Хз 1 1 1 1 
x3 1 1 
4. XX. ХХ» xix) к 
X3 
X3 1 1 1 
5. Хх\хә ХХ» xix) X|X5 
X3X4 1 
X3XÀ 1 1 1 
X3X4 1 1 1 
X3X4 1 


For Exercises 9 and 10, use a Karnaugh map to find the minimal sum-of-products form for the truth functions 


shown. 
9. ху | X2 | Хз | f(xi, Xo, Хз) 
1 1 1 1 
1 1 0 1 
1 0 1 0 
1 0 0 0 
0 1 1 1 
0 1 0 0 
0 0 1 0 
0 0 0 0 


6. 


X3X4 


X3X4 


X3X4 


X3X4 


X3X4 


X3X4 


X3X4 


X3X4 


X3X4 


X3X4 


ХАХА 


X3X4 


хх хр — Xp) хо 
1 1 1 
1 1 
XX х1Х» х\х» X|X2 
1 
1 1 1 


1 
XX х1Х» х\х» XiX 
1 
1 1 
1 1 
1 
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10. X4 | X2 | X3 | X4 f(x1, Хо, Хз, X4) 
1 1 1 1 1 
1 1 1 0 1 
1 1 0 1 1 
1 1 0 |0 1 
1 0 1 1 0 
1 0 1 0 1 
1 о1о 1 0 
1 оо о 1 
0 1 1 1 1 
0 1 1 0 1 
0 1 0 | 1 1 
0 1 0 |0 1 
о |0 1 1 0 
0|0 |1 0 0 
оо о 1 0 
оо о о 0 


11. Use а Karnaugh тар to find the minimal sum-of-products form for the truth function of Exercise 17, 
Section 8.2. 


12. Use a Karnaugh map to find the minimal sum-of-products form for the truth function of Exercise 18, 
Section 8.2. 


13. a. Use a Karnaugh map to find the minimal sum-of-products form for the truth function of Exercise 19, 
Section 8.2. 


b. Draw the logic network for the reduced expression of part (a). 


14. a. Use a Karnaugh map to find the minimal sum-of-products form for the truth function of Exercise 20, 
Section 8.2. 


b. Draw the logic network for the reduced expression of part (a). 


15. Use a Karnaugh map to find the minimal sum-of-products form for the following Boolean expression. 
X|X3X34X4 o xjX3x4x4 + xqx4x4x4 + xQx3Xax4 + XqXoxqx4 + XIXX3X4 + X1X3X3X4 
16. Use a Karnaugh map to find the minimal sum-of-products form for the following Boolean expression. 


X[X4X4X4  XQX3X4X4 + XqX2X4X4 A+ XQxax4x4 F iora + XKX 
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17. Use a Karnaugh map to find — S 
a minimal sum-of-products к —— 1] 
expression for the network of ^s 


three variables shown in the 
figure. Sketch the new net- 
work. 


+ 


18. At Rats R Us, you found a standard sum-of-products form for the logic to control valves А and B 
(Exercise 39, Section 8.2). Now earn yourself a raise by using Karnaugh maps to minimize these 
expressions. 


19. Use a Karnaugh map to find a minimal sum-of-products form for the truth function in the table. Don't- 
care conditions are shown by dashes. 


X х2 Хз X4 f(X1, X2, Хз, Ха) 
1 1 1 1 0 
1 1 1 0 1 
1 1 0 1 0 
1 1 0 0 - 
1 0 1 1 0 
1 0 1 0 - 
1 0 0 1 0 
1 0 0 0 0 
0 1 1 1 0 
0 1 1 0 1 
0 1 0 1 0 
0 1 0 0 1 
0 0 1 1 1 
0 0 1 0 0 
0 0 0 1 - 
0 0 0 0 0 
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20. Use a Karnaugh map to find a minimal sum-of-products form for the truth function in the table. Don’t- 
care conditions are shown by dashes. 


X X» Xs X4 f(x1, X2, Хз, Ха) 
1 1 1 1 0 
1 1 1 0 1 
1 1 0 1 0 
1 1 0 0 - 
1 0 1 1 - 
1 0 1 0 0 
1 0 0 1 0 
1 0 0 0 0 
0 1 1 1 1 
0 1 1 0 0 
0 1 0 1 1 
0 1 0 0 0 
0 0 1 1 1 
0 0 1 0 0 
0 0 0 1 - 
0 0 0 0 0 


21. Use the Quine-McCluskey procedure to find a minimal sum-of-products form for the truth function il- 
lustrated by the map for Exercise 3. 


22. Use the Quine- x 
McCluskey pro- 5 
cedure to find a 
minimal sum-of- 
products form for 
the network in x 
the figure. Sketch 
the new network. х 


d DEN 
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For Exercises 23 and 24, use the Quine-McCluskey procedure to find the minimal sum-of-products form for 
the truth functions in the given tables. 


23. 24. 


Xi X2 X3 X4 f(X1, Хә, Xa, Ха) Хі X2 Хз х4 f(X1, X2, Хз, Ха) 
1 1 1 1 0 1 1 1 1 1 
1 1 1 0 1 1 1 1 0 0 
1 1 0 | 1 0 1 1 0 | 1 1 
1 1 о | о 0 1 1 O | 0 В 
1 0 | 1 1 0 1 о | 1 1 1 
1 0 | 1 0 1 1 0 | 1 0 0 
т | о | 6 | 4 1 1 | 0 [о | 14 1 
1 о! 0 | о 1 1 0 | 0 | о 1 
0 | 1 1 1 0 0 | 1 1 1 1 
0 | 1 1 [о 0 0 | 1 1 | 0 d 
0 | 1 0 | 1 0 0 | 1 0 | 1 1 
0 | 1 o | о 1 o | 1 0 | 0 1 
o/ofi|4 1 оо [11 1 
о! о | 1 0 1 0 | 0 [1 0 0 
о | о | о [1 0 оо [о |1 1 
о | 0 | о | о 1 оо | o | о 1 


In Exercises 25—28, use the Quine—McCluskey procedure to find the minimal sum-of-products form for the 
Boolean expressions. 

25. xyxixaxa F xpxixaxí t xqixax3x4 + xixixixa + ХХХ + XU X3x3x4 

26. хух›хзха + XQx)xax4 t XQxqxax4 ОХХХ + XiXX3X4 + 


XXX Xa F HANNA F XXNX + ХХХ 


о, EPRE auf 


27. XyXx3X34X4 T raa F Aaaa t raaa T XXX + 


X[X3X4X4 T AXXa + Xq[X4X4X4 F ANXN 


28. x[xax3xaxz + хрхохахах5 T XKAN: + xq[x4xax4xs + 


хүхэ F XqX4Xi1X4Xs HAAS WINS + XqX3XaX4Xxi + 


XjX3X4X4Xs + XqXX3X4X5 
29. Use the Quine-McCluskey procedure to find a minimal sum-of-products form for the truth function 
illustrated by the map in Figure 8.34. 
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TERMINOLOGY 


AND gate (p. 639) 

Boolean algebra (p. 620) 

Boolean expression 
(p. 639) 

canonical sum-of-products form 
(p. 643) 

combinational network (p. 642) 

complement (of a Boolean algebra 
element) (p. 625) 

De Morgan’s laws (for a Boolean 
algebra) (p. 624) 

disjunctive normal form (p. 643) 


SELF-TEST 


double negation (for a Boolean 
algebra) (p. 624) 

dual (of a Boolean algebra 
property) (p. 623) 

equivalent Boolean expressions 
(p. 645) 

FPGA (p. 647) 

full-adder (p. 650) 

half-adder (p. 649) 

idempotent property (of a Boolean 
algebra) (p. 622) 

inverter (p. 639) 


Answer the following true-false questions. 


Section 8.1 


1. In any Boolean algebra, х + x’ = 0. 

2. Set theory is an instance of a Boolean algebra in 
is set intersection. 
3. In any Boolean algebra, x + (y + х:2) = х + у. 2. 
4. The dual of the equation in the previous statement 


which + is set union and · 


is x*[y- (x + 2] = x: y. 


5. Any two Boolean algebras with 16 elements are 


isomorphic. 


Section 8.2 


isomorphic instances of a structure 
(p. 626) 

isomorphism (p. 626) 

isomorphism for Boolean algebras 
(p. 629) 

NAND gate (p. 650) 

NOR gate (p. 651) 

OR gate (p. 639) 

PLD (p. 647) 

truth function (p. 640) 

universal bound property (of a 
Boolean algebra) (p. 624) 


1. A logic network for the Boolean expression 

(x + y)' could be built using one AND gate and 
two inverters. 

The canonical sum-of-products form for a truth 
function f: (0, 1)" = {0, 1} has и terms. 


3. Two single-bit binary numbers can be added using 


a network consisting of two half-adders. 


4. The following two logic networks represent the 
same truth function: 


x 
y 


EA e o 


5. The most efficient way to construct a logic net- 


work for a given truth function using only NAND 
gates is to construct the logic network using AND, 
OR, and NOT gates and then replace each of these 
elements with its equivalent form in NAND gates. 
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Section 8.3 

1. A Karnaugh map is a device to help change a 
canonical sum-of-products form to a reduced sum- 
of-products form. 

2. The 1s in a Karnaugh map correspond to the 1 
values of the truth function. 

3. When using a Karnaugh map to reduce a Boolean 
expression, the largest possible blocks should be 
determined first because they provide the greatest 
reduction. 


ON THE COMPUTER 


For Exercises 1—4, write a computer program that 
produces the desired output from the given input. 


1. Input: n and tables defining two binary operations 
and one unary operation on a set of n objects 
Output: Indication of whether the structure is a 
Boolean algebra 
Algorithm: Testing the 10 properties for all cases 


2. Input: n, tables defining a binary operation on each 
of two sets of n elements, and a table defining a 
bijection from one set to the other 
Output: Indication of whether the function is an 
isomorphism 
Algorithm: Testing all possible cases 


4. 


S. 


In the Quine-McCluskey procedure, terms that 
combine must have dashes in the same locations. 
In the Quine-McCluskey procedure, a check in 
some row of the comparison table indicates that 
the term for that row is an essential term that must 
appear in the reduced expression. 


. Input: n and a table representing a truth function 


with п arguments 
Output: Canonical sum-of-products Boolean ex- 
pression for the truth function 


. Input: n and a table representing a truth function 


with п arguments 

Output: Minimal sum-of-products Boolean expres- 
sion for the truth function 

Algorithm: Using the Quine-McCluskey procedure 


Modeling Arithmetic, Computation, 
and Languages 


| CHAPTER OBJECTIVES |— _@ 


Question: 


Chapter 


After studying this chapter, you will be able to: 


See how algebraic structures, finite-state machines, and Turing machines are 
all models of various kinds of computation, and how formal languages attempt 
to model natural languages. 

Recognize certain well-known group structures. 

Prove some properties about groups. 

Understand what it means for groups to be isomorphic. 

Be able to construct group codes for single-error correction of binary m-tuples. 
Be able to decode received n-tuples for a single-error correcting perfect code. 
Trace the operation of a given finite-state machine on an input string. 
Construct finite-state machines to recognize certain sets. 

For a given finite-state machine, find an equivalent machine with fewer states 
if one exists. 

Build a circuit for a finite-state machine. 

Trace the operation of a given Turing machine on an input tape. 

Construct Turing machines to perform certain recognition or computation tasks. 
Understand the Church- Turing thesis and what it implies for the Turing ma- 
chine as a model of computation. 

Be aware of the P = NP question regarding computational complexity. 
Given a grammar G, construct the derivation of strings in L(G). 

Understand the relationship between different classes of formal languages and 
different computational devices. 


Your team at Babel, Inc., is writing a compiler for a new programming language, cur- 
rently code-named ScrubOak after a tree outside your office window. During the first 
phase of compilation (called the lexical analysis phase) the compiler must break down 
statements into individual units called tokens. In particular, the compiler must be able 
to recognize identifiers in the language, which are strings of letters, and also recog- 
nize the two keywords in the language, which are if and in. 


How can the compiler recognize the individual tokens in a statement? 
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A mathematical structure, as discussed in Chapter 8, is a formal model intended 
to capture common properties or behavior found in different contexts. A structure 
consists of an abstract set of objects, together with operations on or relationships 
among those objects that obey certain rules. The Boolean algebra structure of 
Chapter 8 is a model of the properties and behavior common to both propositional 
logic and set theory. As a formal model, it is an abstract entity, an idea; proposi- 
tional logic and set theory are two instances, or realizations, of this idea. 

In this chapter we study other structures. In Section 9.1, algebraic structures 
are defined that model various types of arithmetic such as addition of integers and 
multiplication of positive real numbers. As an aside, section 9.2 looks at coding 
theory, an important application of one of these algebraic structures. This type of 
encoding (and decoding) is not about secrecy but about detecting and perhaps cor- 
recting bit errors in data transmission or storage. 

The “arithmetics” of Section 9.1 represent a limited form of computation, but 
we will see models of much broader forms of computation in Sections 9.3 and 9.4. 
Our initial choice for such a model, the finite-state machine, is a useful device for 
certain tasks, such as the lexical analysis task facing your team at Babel. But the 
finite-state machine is ultimately too limited to model computation in the general 
sense. For a model that captures the notion of computation in all its generality, we 
turn to the Turing machine. Using the Turing machine as a model of computation 
will reveal that some well-defined tasks are not computable at all. 

Finally, Section 9.5 discusses formal grammars and languages, which were 
developed as attempts to model natural languages such as English. While less 
than completely successful in this regard, formal grammars and languages do 
serve to model many constructs in programming languages and play an important 
role in compiler theory. 


SECTION 9.1 


ALGEBRAIC STRUCTURES 


Definitions and Examples 


We begin by analyzing a simple form of arithmetic, namely the addition of in- 
tegers. There is a set Z of objects (the integers) and a binary operation on those 
objects (addition). Recall from Section 4.1 that a binary operation on a set must be 
well defined (giving a unique answer whenever it is applied to any two members 
of the set) and that the set must be closed under the operation (the answer must 
be a member of the set). The notation [Z, + ] will denote the set together with the 
binary operation on that set. 
In [Z, + |, an equation such as 


2+(3+5)=(2+3)+5 


is true. On each side of the equation the integers remain in the same order, but the 
grouping of those integers, which indicates the order in which the additions are 
performed, changes. Changing the grouping has no effect on the answer. Another 
type of equation that holds in [Z, + ] is 


2+3=3+2 


| Section 9.1 Algebraic Structures 687 


Changing the order of the integers being added has no effect on the answer. 
Equations such as 


2+0=2 
0+3=3 
—125 + 0 = —125 


are also true. Adding zero to any integer does not change the value of that integer. 
Finally, equations such as 


are true; adding the negative of an integer to the integer gives 0 as a result. 
These equations represent four properties that occur so often they each have 
a name. 


* DEFINITIONS PROPERTIES OF BINARY OPERATIONS 
Let S be a set and let · denote a binary operation on S. (Here · does not necessarily 
denote multiplication but simply any binary operation.) 


1. The operation · is associative if 


VAW: Qi z) = (х) z] 


Associativity allows us to write x: y: z without using parentheses because 
grouping does not matter. 
2. The operation · is commutative if 


(Vx)(Vy)Gx * y = ух) 
3. [S, >] has an identity element if 
(3)(Vx)(x:i = i:x = х) 


4. If [S,-] has an identity element i, then each element in S has an inverse 
with respect to · if 


з ЕЗ x45 


In е statements of the properties, the universal quantifiers range over the 
set S; if the associative property holds, the equation x (y: z) = (x-y): z is true 
for any x, y, z € S, and similarly for the commutative property. The existential 
quantifier also applies to the set 5, so an identity element i, if it exists, must be an 
element of S, and an inverse element х !, if it exists, must be an element of S. Note 
the order of the quantifiers: In the definition of an identity, the existential quanti- 
fier comes first—there must be one identity element і that satisfies the equation 
x:i— іх = x for every x in S, just like the integer 0 in [Z, + ]. In the definition 
of the inverse element, the existential quantifier comes second—for each x, there 
is an x !, and if x is changed, then х ' can change, just like the inverse of 2 in 
[Z, + ]is —2 and the inverse of 5 is —5. If there is no identity element, then it does 
not make sense to talk about inverse elements. 
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* DEFINITIONS GROUP, COMMUTATIVE GROUP 
[S,-] is a group if S is a nonempty set and · is a binary operation on S such that 


1. + is associative. 
2. an identity element exists (in S). 
3. each element in S has an inverse (in S) with respect to - . 


A group in which the operation : is commutative is called a commutative 
group. 


Once again, the dot in the definitions is a generic symbol representing a binary 
operation. In any specific case, the particular binary operation has to be defined. 
If the operation is addition, for example, then the + sign replaces the generic 
symbol, as in [Z, + ]. As an analogy with programming, we can think of the generic 
symbol as a formal parameter to be replaced by an actual argument—the specific 
operation—when its value becomes known. If it is clear what the binary operation 
is, we may refer to “the group S" rather than “the group [S, -]." 

From the discussion, it should be clear that [Z, + ] is a commutative group, 
with an identity element of 0. The idea of a group would not be useful if there 
were not a number of other instances. (Again as an analogy with programming, 
one can think ofa group as an abstract data type—a pattern—with many possible 
instances of that data type.) 


Let R^ denote the positive real numbers, and let - denote real-number multipli- 
cation, which is a binary operation on R*. Then [R *,-] is a commutative group. 
Multiplication is associative and commutative. The positive real number 1 serves 
as an identity because 

ее [| = По =x 


for every positive real number x. Every positive real number x has an inverse with 
respect to multiplication, namely the positive real number 1/x, because 


= eee — 1 ө 


The set in Example 1 is limited to the positive real numbers. Is [R, :] a commutative group? 
Why or why not? 


PRACTICE 1 


Let M,(Z) denote the set of 2 X 2 matrices with integer entries, and let + denote 
matrix addition. Then + is a binary operation on МЬ() (note that closure holds). 
This is a commutative group because the integers are a commutative group, so 
each corner of the matrix behaves properly. For example, matrix addition is com- 
mutative because 
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bii T ау b g ^F i 
Lb», w 054 by» ap 220 


The matrix 


is an identity. The matrix 


is an inverse of the matrix 


=|| М 

— =5 ө 
REMINDER 
It's important to under- A structure called a monoid results from dropping the inverse property in the 
stand all this new termi- definition of a group; thus a monoid has an associative operation and an identity 
nology. You can't prove element, but in a monoid that is not also a group, at least one element has no in- 


that P — О if you don't 
know what you're starting 
with or where you want 
to go. 


verse. A semigroup results from dropping the identity property and the inverse 
property in the definition of a group; thus a semigroup has an associative opera- 
tion, but in a semigroup that is not also a monoid, no identity element exists. Many 
familiar forms of arithmetic are instances of semigroups, monoids, and groups. 


Consider [Mx(Z), :] where - denotes matrix multiplication. Closure holds. It can be 
shown (Exercise 2) that matrix multiplication is associative. The matrix 


0 1 
Serves as an identity because 


ЕЕЕ 


Thus [M,(Z), +] is at least a monoid. e 


PRACTICE 2 | Prove that [M;(Z), :] is not a commutative monoid. 


PRACTICE 3 | Prove that [M;(Z), -] is not a group. 
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Although the requirements for a structure to be a semigroup are relatively 
modest, not every arithmetic structure qualifies. 


PRACTICE 4 | Prove that [Z, — |15 not a semigroup, where — denotes integer subtraction. - 
PRACTICE 5 | Let 5 be the set of noninteger rational numbers, and let denote multiplication. Is [S, -] 
a semigroup? = 


Each of the following is an instance of a commutative semigroup. You should be 
able to verify closure, associativity, and commutativity for each: 


[N, +], [N,-], [Q,-] [R*, +], [R, +] ә 


For any Boolean algebra [B, +, ', 0, 1, [B, +] and [B,-] are commutative 
semigroups. Therefore for any set S, [p(S), U] and [0(5), П] are commutative 
semigroups. e 


Because the requirements that must be satisfied in going from semigroup to 
monoid to group keep getting stiffer, we expect some examples to drop out, but 
those remaining should have richer and more interesting personalities. 


PRACTICE 6 | Which of the following semigroups are monoids? Name the identities. 


IN, +] [N,-] [Q,-] [R*, +] (К, +] [ots), U 11069), A] и 


РВАСТІСЕ 7 | Which of the monoids from the list in Practice 6 are groups? 


Next we look at a selection of other examples of semigroups, monoids, and 
groups where the elements are not just simple numbers or where the operations 
are less familiar. 


| EXAMPLE | An expression of the form 


ад. Ар Bg ағ SENE do 


where a; € R, = 0,1, ..., n, and n є N is a polynomial in x with real-number 
coefficients (or a polynomial in x over R.) For each i, a; is the coefficient of x’. 
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If i is the largest integer greater than 0 for which a; # 0, the polynomial is of 
degree i; if no such i exists, the polynomial is of zero degree. Terms with zero 
coefficients are generally not written. Thus, zx'—2/3x? + 5 is a polynomial of 
degree 4, and the constant polynomial 6 is of zero degree. The set ofall polynomials 
in x over R is denoted by R[x]. 

We define binary operations of + and · in R[x] to be the familiar operations 
of polynomial addition and multiplication. For polynomials f(x) and g(x) members 
of R[x], the products f(x) · g(x) and g(x) · х) are equal because the coefficients are 
real numbers, and we can use all the properties of real numbers under multiplica- 
tion and addition (properties such as commutativity and associativity). Similarly, 
for f(x), g(x), and h(x) members of R[x], ( (x) : g(x): A(x) = f(x) - (g(x) : A(X). 
The constant polynomial 1 is an identity because 1 - f(x) = f(x): 1 = f(x) for ev- 
ery f(x) є R[x]. Thus, [R[x],-] is a commutative monoid. It fails to be a group 
because only the nonzero constant polynomials have inverses. For example, there 
is no polynomial g(x) such that g(x) -x = x: g(x) = 1, so the polynomial x has no 
inverse. (Note that while x: (1/x) = 1, 1/х = х! is not a polynomial.) However, 
[R[x], +] is a commutative group. e 


PRACTICE 8 


а. For f(x), g(x), h(x) € R[x], write the equations saying that R[x] under + is commutative 
and associative. 
b. What is an identity element in [R[x], + ]? 
c. What is an inverse of 7x* — 2x? + 4 in [R[x], +]? a 


Polynomials play a special part in the history of group theory (the study of 
groups) because much research in group theory was prompted by the very practi- 
cal problem of solving polynomial equations of the form f(x) = 0, f(x) є R[x]. 
The quadratic formula provides an algorithm for finding solutions for every f(x) 
of degree 2, and the algorithm uses only the algebraic operations of addition, sub- 
traction, multiplication, division, and taking roots. Other such algorithms exist 
for polynomials of degrees 3 and 4. One of the highlights of abstract algebra is 
the proof that no algorithm using only these operations exists for every f(x) of 
degree 5. (Notice that this statement is much stronger than simply saying that no 
algorithm has yet been found; it says to stop looking for one.) 

The next example uses modular arithmetic. You may recall from Section 5.1 
(Example 15 and the subsequent discussion) that each computer has some limit 
on the size of the integers that it can store. Although we would like a computer 
to be able to exhibit the behavior of [Z, + |, the best we can obtain is some finite 
approximation. The approximation is achieved by performing addition modulo л. 
The “answer” to the computation x + y for x, y € Z is then either the actual value 
x + y if this value falls within the limit that can be stored or a remainder value 
obtained by doing modular arithmetic, which is equivalent to x + y under the 
equivalence relation of congruence modulo л. 


PRACTICE 9 
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Let Z; = {0, 1, 2, 3, 4} and define addition modulo 5, denoted by +5, on Z; by 
x +;y =r, where r is the remainder when x + y is divided by 5. In other 
words, x +5y = (x + y) mod 5. For example, 1 +;2 = 3 and 3 +;4 = 2. 
Multiplication modulo 5 is defined by ху = (х · y) mod5. Thus, 2%3 = 1 and 
344 = 2. Then [Z;, +] is a commutative group, апа [2;, -;] is a commutative 
monoid. ө 


a. Complete the following tables defining +; and-; on Zs. 


hol d I 3934 eb: We 
0 0 

1 3 1 

2 2 1 

3 3 2 
4 4 


b. What is an identity in [Z5, +5]? In [Z,, :;]? 
c. What is an inverse of 2 in [Z;, +5]? 
d. Which elements in [Z;, +; | have inverses? н 


PRACTICE 10 


As we did on Z;, we can define operations of addition modulo п and mul- 
tiplication modulo п on the set Z, = (0, 1, ..., n — 1} where n is any positive 
integer. Again [Z,, +,| is a commutative group and [Z,,*,] is a commutative 
monoid. (See Exercise 47 for the relationship between the group [Z,, +,,] and the 
set of equivalence classes under the binary relation of congruence modulo л.) 


a. Give the table for '; on Ze. 
b. Which elements in [Z,, | have inverses? B 


Notice that when we use a table to define an operation on a finite set, it 
is easy to check for commutativity by looking for symmetry around the main 
diagonal. It is also easy to find an identity element because its row looks like 
the top of the table and its column looks like the side. And it is easy to locate 
an inverse of an element. Look along the row until you find a column where the 
identity appears; then check to see that changing the order of the elements still 
gives the identity. However, associativity (or the lack of it) is not immediately 
apparent from the table. 
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ЭТТЕ Let Z5 be the set of all binary n-tuples with n = 1. We'll usually use a little short- 
hand and write, for example, 1101 instead of (1, 1, 0, 1). Using componentwise 


REMINDER addition modulo 2, +,. it's easy to see that the addition operation is associative 
[йиз аә Zette and commutative and that the n-tuple of all Os is an identity. Each n-tuple has an 
setis 10: 1) ы бе © inverse. Therefore [Z7, +,] is a commutative group. ө 


eration is addition modulo 
2. In the group [Z3, +2] for 
any n, the set is all binary 
n-tuples and the operation 
is componentwise addi- 
tion modulo 2. 


PRACTICE 11 | In the group [Z3, +,], what is 


а. 01101 +, 11011? 
b. —10100? и 


The next two examples give us algebraic structures where the elements аге 
functions, mappings from a domain to a codomain. 


Let A be a set and consider the set S of all functions f such that f: A > A. The 
binary operation is function composition, denoted by ° . Note that S is closed under 
о and that function composition is associative (see Practice 12). Thus [5, °] is a 
semigroup, called the semigroup of transformations on A. Actually [S, °] is a 
monoid because the identity function i, that takes each member of A to itself has 
the property that for any fe S, 


еме ее ү ө 


PRACTICE 12 | Prove that function composition on the set S just defined is associative. 


Again let A be a set and consider the set Sy of all bijections f such that f: A — A 
(permutations of A). Bijectiveness is preserved under function composition, func- 
tion composition is associative, the identity function i, is a permutation, and for 
any f € S4, the inverse function f^^! exists and is a permutation. Furthermore, 


UM Е 


Thus, [S,, °] is a group, called the group of permutations on A. ө 


If A = (1,2, ... , п} for some positive integer n, then S, is called the symmet- 
ric group of degree п and denoted by S,. Thus, 5;, for example, is the set of all 
permutations on (1, 2, 3). There are 3! — 6 such permutations, which we name 
as follows (using the cycle notation of Section 5.4): 


a =i œ, = (1, 2) оз = (1,3) 
Q4 = (2, 3) Qs = (1, 2, 3) Qs = (1, 3, 2) 


PRACTICE 13 
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Recall that the notation (1, 2), for example, means that 1 maps to 2, 2 maps to 1, 
and unnamed elements map to themselves. The composition (1, 2) ° (1, 3) is done 
from right to left, so 


By (1, 3) By (1,2) 


1233 
22521 
3512 


resulting in (1, 3, 2). Thus a, ° a; = (1, 2) ° (1, 3) = (1,3,2) = о. 


a. Complete the group table for [S,, °]. 


o O4 Ao ag Од as ag 


2 % 


b. Is [53, °] a commutative group? " 


[S,, °] is our first example of a noncommutative group (although [M;(Z), :] was a 
noncommutative monoid). 

The next example is very simple but particularly appropriate because it ap- 
pears in several areas of computer science, including formal language theory and 
automata theory. 


Let А be a finite set; its elements are called symbols and А itself is called an 
alphabet. A* denotes the set ofall finite-length strings, or words, over A. A* can be 
defined recursively (as in Example 6 in Chapter 3), where · denotes concatenation 
of strings: 

a. The empty string A (the string with no symbols) belongs to A*. 

b. Any single member of A belongs to A*. 

c. If x and у are strings in A*, so is x · y. 


Thus, if А = (a, b}, then abbaa, bbbbba, and a are all strings over A, and abbaa + a 
gives the string abbaaa. From the recursive definition, any string over А contains 
only a finite number of symbols. The number of symbols in a string is called its 
length. The empty string A is the only zero-length string. 
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The empty string A should not be confused with the empty set (Z; even if A 
itself is (Z, then А* = {A}. If A is nonempty, then whatever the size of A, A* is a 
denumerable (countably infinite) set. If A contains only one element, say А = {a}, 
then A, a, aa, aaa, ... , is an enumeration of A*. If A contains more than one ele- 
ment, then a lexicographical (alphabetical) ordering can be imposed on the ele- 
ments of A. An enumeration of A* is then obtained by counting the empty string 
first, then lexicographically ordering all strings of length 1 (there is a finite number 
of these), then lexicographically ordering all strings of length 2 (there is a finite 
number of these), and so forth. Note also that if 4 is nonempty, strings of arbitrary 
length can be found in A*. 

Concatenation is a binary operation on A*, and it is associative. The empty 
string А is an identity because for any string x € A*, 


xX°A=A-+x=x 
Therefore, [4*, -] is a monoid, called the free monoid generated by A. e 
PRACTICE 14 | For А = {а,Ь} 
a. Is [A*,-] a commutative monoid? 
b. Is [A*, -] a group? и 


REMINDER 


To prove that something is 
unique ... 


PRACTICE 15 | Justify the foregoing equality signs. 


Basic Results about Groups 


We will now prove some basic theorems about groups. There are hundreds of 
theorems about groups and many books devoted exclusively to group theory, so 
we are barely scratching the surface here. The results we will prove follow almost 
immediately from the definitions involved. 

By definition, a group [G,:] (or a monoid) has an identity element, and we 
have tried to be careful to refer to an identity element rather than the identity ele- 
ment. However, it is legal to say the identity because there is only one. To prove 
that the identity element is unique, suppose that 7, and i, are both identity ele- 
ments. Then 


Ф THEOREM 


Because i, = i», the identity element is unique. Thus, we have proved the fol- 
lowing theorem. 


ON THE UNIQUENESS OF THE IDENTITY IN A GROUP 
In any group (ог monoid) [G, >], the identity element i is unique. 


Each element x in a group [G, -] has an inverse element x '. Therefore, С contains 
many different inverse elements, but for each x, the inverse 1s unique. 
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THEOREM ON THE UNIQUENESS OF INVERSES IN A GROUP 
For each x in a group [С, ·], x ' is unique. 


PRACTICE 16 | Prove the preceding theorem. (Hint: Assume two inverses for x, namely y and z, and let 


i be the identity. Then y = y: i = у: (х:2) —---.) 


If x and y belong to a group [G, ·], then x · y belongs to С and must have an 
inverse element in G. Naturally, we expect that inverse to have some connection 
with x~! and у !, which we know exist in С. Wecanshowthat(x:y) = y !:x ^; 


thus the inverse of a product is the product of the inverses in reverse order. 


THEOREM ON THE INVERSE OF A PRODUCT 
For x and y members of a group [G,-], (x: y) = ух. 


Proof: We will show that y ! : x ! has the two properties required of (x: y) !. 
REMINDER | . - c4 Li 
Then, because inverses are unique, (y :x )mustbe(x:y) . 
If it walks like а duck ... 
QD ts exin uw 
ur аи 
Exe 


=i 
Similarly, (y ! :x !) + (x: y) = i. Notice how associativity and the meaning of i 
and inverses all come into play in this proof. End of Proof. 


PRACTICE 17 | Write 10 as 7 +,,3 and use the theorem on the inverse of a product to find (10)! in the 
group [Ziz +12]. 


We know that many familiar number systems such as [Z, +] and [R, +] are 
groups. We make use of group properties when we do arithmetic or algebra in 
these systems. In [Z, +], for example, if we see the equation x + 5 = y + 5, we 
conclude that x — y. We are making use of the right cancellation law, which, we 
will soon see, holds in any group. 


Ф DEFINITION CANCELLATION LAWS 
A set S with a binary operation - satisfies the right cancellation law if for 
X,y»zeSxz-y':zimplis x = y. It satisfies the left cancellation law if 
z:x —z-yimplies x = y. 
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Suppose that x, у, and z are members of a group |G, +] and that x : z = y» z. To 
conclude that x = y, we take advantage of z '. Thus, 


X'z—y'z 
implies 
боа eda 
xez’) =y rir) 
x'i-—y'i 
х=у 


Hence, G satisfies the right cancellation law. 


PRACTICE 18 | Show that any group [G, +] satisfies the left cancellation law. 


We have proved the following theorem. 


THEOREM ОМ CANCELLATION IN A GROUP 
Any group |G, :] satisfies the left and right cancellation laws. 


We know that | Ze, ';] is not a group. Here the equation 
4 © 2) = || © 2) 
holds, but of course 4 = 1. ө 


Again, working in [Z, + ], we would solve the equation 6 + x = 13 by add- 
ing —6 to both sides, producing a unique answer of x = (—6) + 13 = 7. The 
property of being able to solve linear equations for unique solutions holds in all 
groups. Consider the equation a: x = b in the group [G,-] where a and b belong 
to Gand x is to be found. Then x = a`! · b is an element of G satisfying the equa- 
tion. Should x, and x, both be solutions to the equation ax = b, then a x, = а · x; 
and, by left cancellation, x, = x;. Similarly, the unique solution to x: a = b is 
x-b-a. 


THEOREM ON SOLVING LINEAR EQUATIONS IN A GROUP 
Let a and b be any members of a group [G, ·]. Then the linear equations a: x = b 
and x: a = b have unique solutions in С. 


PRACTICE 19 | Solve the equation x +&3 = 1 in [Z;, +g]. 


The theorem on solving linear equations tells us something about tables for 
finite groups. As we look along row a of the group operation table, does element b 
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appear twice? If so, then the table says that there are two distinct elements x, and 
x; of the group such that a-x, = b and a-x, = b. But by the theorem on solving 
linear equations, this double occurrence cannot happen. Thus, a given element of 
a finite group appears at most once in a given row of the group table. However, 
to complete the row, each element must appear at least once. A similar result 
holds for columns. Therefore, in a group table, each element appears exactly once 
in each row and each column. This property alone, however, is not sufficient to 
insure that a table represents a group; the operation must also be associative (see 
Exercise 31). 


PRACTICE 20 | Assume that ° is an associative binary operation on 11, a, b, c, d ). Complete the fol- 


lowing table to define a group with identity 1, 


PRACTICE 21 


o 1 a b С а 

1 1 

а C d 1 

b С а 

С, а а 

а b c п 


If [G, -] is a group where С is finite with n elements, then л is said to be the order 
of the group, denoted by |G]. If G is an infinite set, the group is of infinite order. 


a. Name a commutative group of order 18. 
b. Name a noncommutative group of order 6. п 


More properties of groups appear in the exercises at the end of this section. 


Subgroups 


We know what groups are and we know what subsets are, so it should not be hard 
to guess what a subgroup is. However, we will look at an example before we give 
the definition. We know that [Z, + ] is a group. Now let A be any nonempty subset 
of Z. For any x and y in A, x and y are also in Z, so x + y exists and is unique. The 
set A “inherits” a well-defined operation, +, from [Z, + ]. The associativity prop- 
erty is also inherited, because for any х, y, z € A, it is also true that x, y,z є Z 
and the equation 


(x+y) +2= х + (у + 2) 


holds. Perhaps А under the inherited operation has all the structure of [Z, + | and 
is itself a group. Whether this is true depends on A. 
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Suppose that A = E, the set of even integers. E is closed under addition, E 
contains 0 (the identity element), and the inverse of every even integer (its nega- 
tive) is an even integer. [E, + ] is thus a group. But suppose that А = О, the set of 
odd integers. [O, + ] fails to be a group for several reasons. For one thing, it is not 
closed—adding two odd integers produces an even integer. (Closure depends on 
the set as well as the operation, so it is not an inherited property). For another, a 
subgroup must have an identity with respect to addition; 0 is the only integer that 
will serve, and 0 is not an odd integer. 


. DEFINITION SUBGROUP 
Let [ С, -] bea group and А C С. Then [4,-] is a subgroup of [G, -] if | 4, -] is 
itself a group. 


For [A,:] to be a group, it must have an identity element, which we'll denote 
by i,. Of course G also has an identity element, which we'll denote by ig. It turns 
out that і, = ig, but this equation does not follow from the uniqueness of a group 
identity because the element i,, as far as we know, may not be an identity for all of 
С, and we cannot yet say that ig is an element of A. However, i, = i4: i, because 
i, is the identity for [A,-], and i, = i, · ig because ig is the identity for [G, -]. Be- 
cause of the left cancellation law holding in the group [G, -], it follows that i, = ig. 

To test whether [4,-] is a subgroup of [G,-], we can assume the inherited 
properties of a well-defined operation and associativity, and we check for the three 
remaining properties required. 


* THEOREM ON SUBGROUPS 
For [С, -Ја group with identity i and A С С, [А, -] isa subgroup of [G, -] if it 
meets the following three tests: 


1. Ais closed under: . 
2. i € А. 
3. Every x € A has an inverse element in A. 


PRACTICE 22 | The definition of a group requires that the set be nonempty. In the theorem on sub- 
groups, why isn't there a specific test that A A (2? 


a. [Z, +] is a subgroup of the group [R, +]. Z is closed under addition, 0 є 7, 
and the negative of every integer is an integer. 
b. [{1,4}, 5] is a subgroup of the group [11, 2, 3, 4}, :;]. Closure holds: 


sii 4 
1]|1 4 
4 |4 1 


The identity 1 є (1,4), and 17! = 1,47! = 4). e 


PRACTICE 23 
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a. Show that [{0, 2, 4, 6}, +s] is a subgroup of the group [Z;, +s]. 
b. Show that [{1, 2, 4},-7] is a subgroup of the group [{1, 2, 3, 4, 5, 6}, -7]. и 


If [G, -] is a group with identity 7, then it is true that [ {i}, -] and [С, -] are sub- 
groups of [G,-]. These somewhat trivial subgroups of [G, +] are called improper 
subgroups. Any other subgroups of [С, +] are proper subgroups. 


PRACTICE 24 | Find all the proper subgroups of S5, the symmetric group of degree 3. (You can find 


them by looking at the group table; see Practice 13.) 


One point of confusing terminology: The set of a// bijections on a set A into it- 
self under function composition (like $5) is called the group of permutations on A, 
and any subgroup of this set (such as those in Practice 24) is called a permutation 
group. The distinction is that the group of permutations on a set A includes all bi- 
jections on A into itself, but a permutation group may not. Permutation groups are 
of particular importance, not only because they were the first groups to be studied, 
but also because they are the only groups if we consider isomorphic structures to 
be the same. We will see this result shortly. 

There is an interesting subgroup we can always find in the symmet- 
ric group S, for n — 1. We know that every member of S, can be written as a 
composition of cycles, but it is also true that each cycle can be written as the 
composition of cycles of length 2, called transpositions. In S}, for example, 
(5, 1, 7, 2, 3, 6) = (5,6) ° (5, 3) ° (5, 2) ° (5, 7) ° (5, 1). We can verify this by 
computing (5, 6) ° (5, 3) ° (5, 2) ° (5, 7) ° (5, 1). Working from right to left, 


125777! 
so | maps to 7. Similarly, 
7575552222 


so 7 maps to 2, and so on, resulting in (5, 1, 7, 2, 3, 6). It is also true that 
(5, 1, 7, 2, 3, 6) = (1, 5) ° (1, 6) ° (1,3) ° (1, 2) ° 2, 4) ° (1, 7) ° (4, 2). 

For any n> 1, the identity permutation i in 5, can be written as 
i = (a, b) ° (a, b) for any two elements a and b in the set {1,2,...,n}. This 
equation also shows that the inverse of the transposition (a, b) in S, is (a, Б). Now 
we borrow (without proof) one more fact: Even though there are various ways to 
write a cycle as the composition of transpositions, for a given cycle the number 
of transpositions will either always be even or always be odd. Consequently, we 
classify any permutation in S,, n > 1, as even or odd according to the number 
of transpositions in any representation of that permutation. For example, in 
$5, (5, 1, 7, 2, 3, 6) is odd. If we denote by A, the set of all even permutations in 
S, then A, determines a subgroup of [S, °]. The composition of even permutations 
produces an even permutation, and i € A,. If œ € A,, and о as a product of 
transpositions is а = œ; о 0° · · оо, then о! = ог оао · · о ат!. Each 
inverse of a transposition is a transposition, so о! is also even. 


PRACTICE 25 | Prove that f: A, — O,, given by f(a) = a ° (1, 2) is one-to-one and onto. 
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The order of the group [S,, °] (the number of elements) is n! What is the order 
ofthe subgroup [А, ° |? We might expect half the permutations in S, to be even and 
half to be odd. Indeed, this is the case. If we let O, denote the set of odd permuta- 
tions in 5, (which is not closed under function composition), then the mapping 
f: A, 2 О, defined by f(a) = a ° (1, 2) is a bijection. 


® THEOREM 


© THEOREM 


Because there is a bijection from A, onto O,, each set has the same number of 
elements. But 4, О, = Ø and A, ЈО, = S, so |A,| = |S,|/2 = n!/2. 


ON ALTERNATING GROUPS 
For n € N, n > 1, the set A, of even permutations determines a subgroup, called 
the alternating group, of [5,, °] of order п! /2. 


We have now seen several examples of subgroups of finite groups. In Example 
13b and Practice 23, there were three such examples, and the orders of the groups 
and subgroups were 


Group of order 4, subgroup of order 2 
Group of order 8, subgroup of order 4 
Group of order 6, subgroup of order 3 


The theorem on alternating groups says that a particular group of order п! has a 
subgroup of order п! /2. 

Based on these examples, one might conclude that subgroups are always half 
the size of the parent group. This conclusion is not always true, but there is a 
relationship between the size of a group and the size of a subgroup. This relation- 
ship is stated in Lagrange's theorem, proved by the great French mathematician 
Joseph-Louis Lagrange in 1771 (we will omit the proof here). 


LAGRANGE'S THEOREM 
The order of a subgroup of a finite group divides the order of the group. 


Lagrange's theorem helps us narrow down the possibilities for subgroups of 
a finite group. If |G| — 12, for example, we would not look for any subgroups of 
order 7 because 7 does not divide 12. Also, the fact that 6 divides 12 does not 
imply the existence of a subgroup of G of order 6. In fact, 4, is a group of order 
41/2 — 12, but it can be shown that 44 has no subgroups of order 6. Therefore the 
converse to Lagrange's theorem does not always hold. In certain cases the con- 
verse can be shown to be true—for example, in finite commutative groups (note 
that A, is not commutative). 

Finally, we consider subgroups of the group |Z, +]. For п any fixed element 
of N, the set nZ is defined as the set of all integral multiples ofn; nZ = {nz|z є Z}. 
Thus, for example, 37 = (0, + 3, + 6, + 9,... }. 


PRACTICE 26 | Show that for any n € №, [nZ, +] is a subgroup of [Z, +]. 
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* THEOREM 


* DEFINITION 


Not only is [nZ, + | a subgroup of [Z, + | for any fixed n, but sets of the form 
nZ are the only subgroups of [Z, + | To illustrate, let [S, +] be any subgroup of 
[Z, +]. If S = {0}, then S = OZ. If S # {0}, let m be a member of S, m = 0. 
Either m is positive or, ifm is negative, —m € S and —m is positive. The subgroup 
S, therefore, contains at least one positive integer. Let п be the smallest positive 
integer in S (which exists by the principle of well-ordering). We will now see that 
5 = nZ. 

First, since 0, n, and —n аге members of S and S is closed under +, nZ С S. 
To obtain inclusion in the other direction, let s є 5. Now we divide the integer 
s by the integer л to get an integer quotient q and an integer remainder r with 
0 = к< n. Thus, = ng + r. Solving for r,r = s + (—nq). But nq € S; there- 
fore —nq € S, and s € S, so by closure of S under +, r € S. If r is positive, we 
have a contradiction of the definition of n as the smallest positive number in S. 
Therefore, r = 0 and s = nq + r = nq. We now have 5 C nZ,and thus 5 = nZ, 
which completes the proof of the following theorem. 


ON SUBGROUPS OF [Z, +] 
Subgroups of the form [nZ, + | for n € № are the only subgroups of [Z, +]. 


Isomorphic Groups 


Suppose that [5, :] and [ T, + | are isomorphic groups; what would this mean? From 
the discussion of isomorphism in Section 8.1, isomorphic structures are the same 
except for relabeling. There must be a bijection from S to T that accomplishes the 
relabeling. This bijection must also preserve the effects of the binary operation; 
that 1s, it must be true that *operate and map" yields the same result as “тар and 
operate." The following definition is more precise. 


GROUP ISOMORPHISM 
Let [S,-] and [7, +] be groups. A mapping f: S > T is an isomorphism from 
[S,-] to [T, +] if 


1. the function fis a bijection. 
2. forall х,у € S, f(x: у) = f(x) + у). 


Property (2) is expressed by saying that f is a homomorphism. 


PRACTICE 27 | Illustrate the homomorphism property of the definition of group isomorphism by a 


commutative diagram. 


If isomorphic groups are really the same except for the relabeling accom- 
plished by the bijection, then we would expect that the identity of one group maps 


PRACTICE 28 
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to the identity of the other, that inverses map to inverses, and that if one group 
is commutative, so is the other. Indeed, we can prove that these expectations are 
correct. (The proofs do not make use of the one-to-one property of the isomor- 
phism, so an onto homomorphism also maps the identity to the identity, inverses 
to inverses, and preserves commutativity.) 

Suppose, then, that fis an isomorphism from the group [S,:] to the group 
[Т, +] and that i; and i; are the identities in the respective groups. Under the 
function f, i; maps to an element f(i;) in T. Let t be any element in T. Then, because 
fis an onto function, t = f(s) for some s Є S. It follows that 


füs) + t = fis) + f(s) 


= f(is:s) (because fis a homomorphism) 
= f(s) (because 1 is the identity in S) 
=f 
Therefore 
Similarly, 
t+ flis) —t 


The element f(is) acts like an identity element in |T, + ], and because the identity 
is unique, f(is) = ir. 


Prove that if fis an isomorphism from the group [S,:] to the group T, +], then for any 
s € S,f(s_') = —f(s) (іпуегѕеѕ map to inverses). (Hint: Show that f(s !) acts like the 
inverse of f(s).) 


PRACTICE 29 


Prove that if fis an isomorphism from the commutative group [5, +] to the group [7, +], 
then [T, + ] is a commutative group. 


[R*,-] and [R, + | are both groups. Let b be a positive real number, b # 1, and let 
f be the function from R* to R defined by 


f(x) = log,x 


Then f is an isomorphism. To prove it, we must show that f is a bijection 
(one-to-one and onto) and that fis a homomorphism (preserves the operation). 
We can show that fis onto: For r € R, b” e R* and f(b’) = log,b" = r. Also, f 
is one-to-one: If f(x) = f(x;), then log,x, = log,x,. Let p = log,x, = log, x. 
Then b" = x, and b? =x), so x, =X. Finally f is a homomorphism: For 
x, x, € К, Д x2) = logiG x2) = log, x, + 10вьх = f(x) + Д). Мое 
that 108,1 = 0, so f maps 1, the identity of [R*,-] to 0, the identity of [R, +]. 
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Also note that 
log,(1/x) = 108,1 — log,x = 0 — log,x = —log,x = —f(x) 
so f maps the inverse of x іп [R*,-] to the inverse of f(x) in [R, +]. Finally, both 


groups are commutative. ө 


Because the two groups in Example 14 are isomorphic, each is the mirror 
image of the other, and each can be used to simulate a computation in the other. 
Suppose, for example, that b = 2. Then [R, +] can be used to simulate the com- 
putation 64 · 512 in [R*,-]. First, map from R* to К: 


f(64) = log, 64 = 6 
f(512) = log, 512 = 9 


Now in [R, +] perform the computation 
6+9 = 15 
Finally, use f^! to map back to R*: 
f 15) = 2" = 32,768 
(In the age Bc—before calculators and computers—large numbers were multi- 
plied by using tables of common logarithms, where b = 10, to convert a multipli- 
cation problem to an addition problem, as addition is less prone to human error.) 


Either of two isomorphic groups can always simulate computations in the other, 
just as in Example 14. 


Consider the two groups [5, :] and [7, + ] as defined by the following tables: 


O1mwmÍdo!|nm 
о | ел | № ao 
о | о (ол Ко 
is llea |] se 
S| |ә 
= as. | 25 = 
= || S|] rs || os 


Both are groups of order 3, so an isomorphism is certainly possible. If fis to be 
an isomorphism, it must map i, to iz. Looking at the operation tables, is = 5 and 
іт = 0, so let f(5) = 0. As a guess, let /(2) = 1 and f(9) = 4. Now let's reorga- 
nize the [ 7, + | table: 


ој = || 4 


ор- |+ 
ь|о|-|о 
зо | 
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This table contains exactly the same data that were in the original T table; the rows 
and columns have just been shuffled. Written in this form, it’s clear that the T table 
is just a relabeling (using f) of the S table, so fis indeed an isomorphism. Note that 
inverses map to inverses: 


f27') = 9) = 4and -f(2) = -1=4 
f(O) = f2) = 1 апа -f(9) = —4 = 1 


And we can simulate the computation 9-2 = 5 in 5 by mapping to T, applying 
the + operation, and mapping back to S: 


O= 402) = 1 
4+1=0 
f^W0) = 5, which is 9-2 ә 


| EXAMPLE 16 | Let f: M,(Z) — М2) be given by 


Then 


EE 


soa—cec-gb-f,andd-h,or 


To show that fis onto, let 


ei d 
E 4 є M,Z) 


[aem m д 4) 1а a 


Then 
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Also, fis a homomorphism from [M,(Z), + ] to [M;(Z), + | because 
СЕ end 
c d g Ah (arg ear la Dari @ ar ip 
Dae e gl a b e f 
EE 


The function fis therefore an isomorphism from [M,(Z), + ] to [M,(Z), +]. e 


PRACTICE 30 | Let 5Z = {5z|z є Z}. Then [5Z, + ]isa group. Showthat f: Z — 52 givenby f(x) = 5x 


is an isomorphism from [Z, + ] to [5Z, + ]. 


If f is an isomorphism from [S,-] to [T, +], then f~' exists and is a bijec- 
tion. Further, f~! is also a homomorphism, this time from Т to S. To see this, let 
t, and h belong to T and consider / (t, + t). Because t;, t; € T and f is onto, 
ti = f(s,) and t, = f(s;) for some s, and s; in S. Thus, 


f (t 5)-f QG) + Д) 
= f (Fsi 52)) 
= (f ef (8 - s2) 
= 51°$2 


=f") Sb) 


Therefore we can speak of S and T аз being simply isomorphic, denoted by S = Т, 
without having to specify that the isomorphism is from S to 7 or vice versa. 

Checking whether a given function is an isomorphism from S to T, as in 
Practice 30, is not hard. Deciding whether S and 7 are isomorphic may be harder. 
To prove that they are isomorphic, we must produce a function. To prove that they 
are not isomorphic, we must show that no such function exists. Since we can’t try 
all possible functions, we use ideas such as the following: There is no one-to-one 
correspondence between S and T, S is commutative but T is not, and so on. 

We have noted that isomorphic groups are alike except for relabeling and 
that each can be used to simulate the computations in the other. Isomorphism of 
groups is really an equivalence relation, as Practice 31 shows; thus isomorphic 
groups belong to the same equivalence class. Thinking of isomorphic groups as 
"alike except for labeling" is consistent with the idea that elements in an equiva- 
lence class represent different names for the same thing. 


PRACTICE 31 


a. Let f: $ — T be an isomorphism from the group [S,-] to the group [7, +] and g: T — U be an 
isomorphism from [7, + | to the group [U, *]. Show that g ° f is an isomorphism from S to U. 

b. Let J bea collection of groups and define a binary relation p on J by Sp T < S = T. Show that 
p is an equivalence relation on J. и 
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We will finish this section by looking at some equivalence classes of groups 
under isomorphism. Often we pick out one member of an equivalence class and 
note that it is the typical member of that class and that all other groups in the class 
look just like it (with different names). 

A result concerning the nature of very small groups follows immediately 
from Exercise 24 at the end of this section. 


* THEOREM ON SMALL GROUPS 
Every group of order 2 is isomorphic to the group whose group table is 


1 a 
1 1 a 
ala 1 


Every group of order 3 is isomorphic to the group whose group table is 


1 
а 
b 


ao; |= | -= 
-—|С|% | 
$|—|o|c 


Every group of order 4 is isomorphic to one of the two groups whose group 
tables are 


о сә |= 

о СО |ә |=| -= 
+ јо | со 
о |а јо [о еы 
боа су | су 


We can also prove that any group is essentially a permutation group. Suppose 
[G, :] is a group. We want to establish an isomorphism from С to a permutation 
group; each element g of G must be associated with a permutation a, on some 
set. In fact, the set will be С itself; for any x € С, we define a,(x) to be gx. We 
must show that {a,| g € С} forms a permutation group and that this permutation 
group is isomorphic to С. First we need to show that for any g € С, о, is indeed 
a permutation on С. From the definition a,(x) = g: x, itis clear that а: С > С, 
but it must be shown that a, is a bijection. 


PRACTICE 32 | Show that a, as just defined is a permutation (bijection) on С. 


Now we consider P = {a,| g € G} and show that P is a group under func- 
tion composition. P is nonempty because G is nonempty, and associativity always 
holds for function composition. We must show that P is closed and has an identity 
and that each a, € P has an inverse in Р. To show closure, let a, and a, € P. For 
any x € G,(a,* o4 (x) = ада) = a (hx) = g*(h-x) = (eI) x. Thus, 
Qo ° An = 0 4 Nd à € P. 
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PRACTICE 33 


a. Let 1 denote the identity of G. Show that a, is an identity for P under function composition. 
b. For a, € P, а, € Р; show that a, = (aj). и 


We now know that [P, °] is a permutation group, and it only remains to show 
that the function f: С > P given by f(g) = a, is an isomorphism. Clearly, fis an 
onto function. 


PRACTICE 34 | Show that f: С — P defined by f(g) = a, is 


a. one-to-one. 
b. a homomorphism. " 


We have now proved the following theorem, first stated and proved by the 
English mathematician Arthur Cayley in the mid-1800s. 


* THEOREM CAYLEY’S THEOREM 
Every group is isomorphic to a permutation group. 


SECTION 9.1 REVIEW 


TECHNIQUES * The order of a subgroup of a finite group divides 
the order of the group. 

* The only subgroups of the group [Z, + ] are of the 
or group structure. form [nZ, +], where nZ is the set of all integral 

* Test whether a given subset ofa group is a subgroup. multiples of a fixed n € N. 

* Test whether a given function from one group to ° If is an isomorphism from one group to another, 


Ù Test whether a given set and operation have the 
properties necessary to form a semigroup, monoid, 


another is an isomorphism. f maps the identity to the identity and inverses to 

* Decide whether two groups are isomorphic. inverses, and it preserves commutativity. 
* [f Sand Tare isomorphic groups, they are identical 
MAIN IDEAS except for relabeling, and each simulates any 


computation in the other. 

* [somorphism is an equivalence relation on groups. 

* To within an isomorphism, there is only one group 
of order 2, one group of order 3, and two groups of 
order 4. 

* Every group is essentially a permutation group. 


* Many elementary arithmetic systems are instances 
of a semigroup, monoid, or group structure. 

* [n any group structure, the identity and inverse 
elements are unique, cancellation laws hold, and 
linear equations are solvable; these and other 
properties follow from the definitions involved. 

* Asubset of a group may itself be a group under the 
inherited operation. 


EXERCISES 9.1 
l. a. A binary operation: is defined on the set {a, Б, с, d) by the table on the left. Is - commutative? 
Is · associative? 


b. Let S = (p, q, r, s}. An associative binary operation - is partly defined on S by the table on the right. 
Complete the table to preserve associativity. Is - commutative? 
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a b [o d p q r S 
a a [o d a p p q r S 
b b [o a d q q r S 
C C a b d r p 
d d b a С $ $ q r 


2. Show that matrix multiplication on M,(Z) is associative. 


3. Each of the following cases defines a binary operation, denoted by -, on a given set. Which are associative? 
Which are commutative? 


x ifx is even 
.OnZ: x-y= 
Е be bU 


b. On N: x: y = (x + у) 
c On Rt: x-y = x* 
d. On Q: x-y = xy/2 
e. OnR*: x-y=1/a+y) 
4. Define binary operations on the set N that are 
a. commutative but not associative. 
b. associative but not commutative. 
c. neither associative nor commutative. 
d. both associative and commutative. 


For Exercises 5—7, determine whether the structures [S,:] are semigroups, monoids, groups, or none of these. 
Name the identity element in any monoid or group structure. 
5. а. S=N;x-y = min(x, y) 
b. S= R; x: y = (х + у) 
‚б = {а\2 |a e №; : = multiplication 


с 
d. S = {a + БУ | a,b e Z}: : = multiplication 
e. S = {а + bV2| a,b € Q, a and b not both 0}; -+ = multiplication 
f S= {1,-1,7, —i};- = multiplication (where ? = —1) 
6. d S = {1,2,4}; = -6 
b. S = {1, 2, 3, 5, 6, 10, 15, 30}; x+y = least common multiple of x and y 
c. S= N X N; (х,у) (о, ур) = G0. y») 
d. S = N X № (х,у) (х,у) = GG + x» ур) 
e. S = set of even integers;: = addition 
f. S = set of odd integers; - = addition 
7. a. S = set of all polynomials in R[x] of degree = 3;- = polynomial addition 
b. S = set of all polynomials in R[x] of degree = 3;- = polynomial multiplication 


з=, i 


а. 5 = {1, 2, 3, 45 = 5 


ze z}; : = matrix multiplication 


10. 


— 
— 


12. 


13. 
14. 
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e. S=R-{-l};x-y=xtyt+x 
f£ S= {ДАМ №; = function addition, that is, (f+ D) = f(x) + g(x) 


. Let A = (1,2). 


a. Describe the elements and write the table for the semigroup of transformations on A. 
b. Describe the elements and write the table for the group of permutations on A. 


. Given an equilateral triangle, six permutations can be performed on the triangle that will leave its image in the 


plane unchanged. Three of these permutations are clockwise rotations in the plane of 120°, 240°, and 360° 
about the center of the triangle; these permutations are denoted R,, А, and R5, respectively. The triangle 
can also be flipped about any of the axes 1, 2, and 3 (see the accompanying figure); these permutations are 
denoted F, F», and F}, respectively. During any of these permutations, the axes remain fixed in the plane. 
Composition of permutations is a binary operation on the set D, of all six permutations. For example, 
F, ° R, = F,. The set D, under composition is a group, called the group of symmetries of an equilateral 
triangle. Complete the group table below for [D;, °]. What is an identity element in [D;, °]? What is an 
inverse element for F}? For Ry? 


= R; Р Rs Fy F3 F3 


Fs Fa 


The set $3, the symmetric group of degree 3, is isomorphic to D3, the group of symmetries of an equilat- 
eral triangle (see Exercise 9). Find a bijection from the elements of 5; to the elements of D, that preserves 
the operation. (Hint: К, of D, may be considered a permutation in 5; sending 1 to 2, 2 to 3, and 3 to 1.) 


. In each case, decide whether the structure on the left is a subgroup of the group on the right. If not, why 


not? (Note that here S* denotes 5 — {0}.) 

a. [2;*, E [Z5, +5] 

b. [P, +]; [R[x], +] where P is the set of all polynomials in x over R of degree = 3 
c. [Z*,-]; [Q*,] 

d. [K, + 


In each case, decide whether the structure on the left is a subgroup of the group on the right. If not, 
why not? 


|; [R[x], +] where K is the set of all polynomials in x over R of degree = k for some fixed k 


У 


а. [A, °]; 15, °] where S is the set of all bijections оп № and А is the set of all bijections оп № 
mapping 3 to 3 


b. [Z, +]; (М2), +] 

c. [{0, 3; 6}, +s]; [Zs, +8] 

а. [4, +›];[75, *;] where A = {00000, 01111, 10101, 11010} 
Find all the distinct subgroups of [Z,5, +12]. 

a. Show that the subset 


Gat аз = (1, 4) ° (2, 3) 
a; = (1,2) » (3,4) оц = (1,3) ° (2, 4) 


forms a subgroup of the symmetric group $4. 
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b. Show that the subset 


с! a; = (1, 2) ° (3, 4) 
a, = (1, 2,3, 4) a = (1, 4) ° (2, 3) 
аз = (1, 3) ° (2,4) a, = (2, 4) 
a, = (1, 4, 3, 2) ag = (1,3) 


forms a subgroup of the symmetric group Sy. 
15. Find the elements of the alternating group 44. 
16. Let A = (p, q, r}. Then [A*, -] is the free monoid generated by A. 
a. What is ppqrp · qprr ? 


b. Let В = the set of all strings over А with an even number of q's. Then В С A. Prove that [B, -] is also 
a monoid. 


17. In each case, decide whether the given function is a homomorphism from the group on the left to the one 
on the right. Are any of the homomorphisms also isomorphisms? 


а. [Z, +], ZF fay 2 
b. [R, +], [R, +]; Ло) = Ixl 
c. [R*, -], [R*,-] (where R* denotes the set of nonzero real numbers); f(x) = |х| 


18. In each case, decide whether the given function is a homomorphism from the group on the left to the one 
on the right. Are any of the homomorphisms also isomorphisms? 


a. [R[x], +], [R, +]; fa т üt Tasit ayx + do) = a, Es an-ı Fated ао 


9 


- 1 if æ is an even permutation 
b. S И 9 И 2 , + 3 = . . . 
5,3 12+) Да) P if a is an odd permutation 
с. [Z X Z, +] where + denotes componentwise addition, [Z, + |; f(x, y) = x + 2y 


19. In each case, decide whether the given groups are isomorphic. If they are, produce an isomorphism 
function. If they are not, give a reason why they are not. 


a. [Z, +], 122, +] (where 127 = {12z|z e Z}) 
b. [Z,, +5], [5Z, +] 

c. [5Z, +], 122, +] 

d. [Ss, °], [Zo +6] 


20. In each case, decide whether the given groups are isomorphic. If they are, produce an isomorphism func- 
tion. If they are not, give a reason why they are not. 


a. {ах + aglai, а E R}, +], [C, +] 
b. [ +), DA o] 
c. [Z5, +›], D o] 
d. [Z3, +2], [Zs, +8] 
. Let M3(Z) be the set of all 2 X 2 matrices of the form 


2 


— 


where z e Z. 


a. Show that [М2(2), -] is a group, where · denotes matrix multiplication. 


22. 


23. 


24. 


25. 


26. 


27. 
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c 


. Let a function f. М(2) > Z be defined by 


(lh 1) = 


Prove that fis an isomorphism from [M*(Z), -] to [Z, +] 
. Use [Z, + ] to simulate the computation 


© 


in [M3(2), -]. 
. Use [M9(Z), -] to simulate the computation 2 + 3 in [Z, +]. 
a. Let S = (1, —1}. Show that [S,:] is a group where - denotes ordinary integer multiplication. 


Qa 


c 


. Let f be the function from the group [S,, °] to the group [S,-] given by 


1 ifa is even 


Да) = к 


Prove that fis a homomorphism. 

In any group |G, ·], show that 

ail-i 

b. (x )) ! = x for any x e G 

a. Show that any group of order 2 is commutative by constructing a group table on the set {1, a} with 1 
as the identity. 


b. Show that any group of order 3 is commutative by constructing a group table on the set {1, a, b} with 
1 as the identity. (You may assume associativity.) 

c. Show that any group of order 4 is commutative by constructing a group table on the set {1, a, b, c} with 
1 as the identity. (You may assume associativity.) There will be four such tables, but three of them are 
isomorphic because the elements have simply been relabeled from one to the other. Find these three 
groups and indicate the relabeling. Thus, there are two essentially different groups of order 4, and both 
of these are commutative. 

Let [G, -] be a group and let x, y є С. Define a relation p on G by xpy < g:x:g ! = y for some g € G. 

a. Prove that p is an equivalence relation on G. 

b. Prove that for each x € С, [x] = {x} if and only if G is commutative. 

For x a member of a group [G,-], we can define x" for any positive integer п by x! = x, x? = x : x, and 

x” = x" !- x for n > 2. Prove that in a finite group [G, :], for each x € С there is a positive integer k such 

that x* — i. 

Let [S,-] be a semigroup. An element i, € S is a left identity element if for all x € S, ij^ x = x. An 

element i, € S is a right identity element if for all x є S, x: ig = x. 

a. Prove that if a semigroup [5, -] has both a left identity element and a right identity element, then [5, -] 
is a monoid. 

b. Give an example of a finite semigroup with two left identities and no right identity. 


e 


. Give an example of a finite semigroup with two right identities and no left identity. 
d. Give an example of a semigroup with neither a right nor a left identity. 


28. 


29, 


30. 


3 


- 


32. 


33. 
34. 
35. 


36. 


3T: 


38. 


39. 


40. 


41. 


| Section 9.1 Algebraic Structures 713 


Let [S, -] be a monoid with identity i, and let x є S. An element x; ! in Sis a left inverse of x if xj! +x = i. 


An element xz! in S is a right inverse of x if x- x! = i. 
R S R 


a. Prove that if every element in a monoid [5, :] has both a left inverse and a right inverse, then [S,:] is a 
group. 
b. Let S be the set of all functions f'such that f: N — N. Then S under function composition is a monoid. 
Define a function f € S by f(x) = 2x, x є №. Then define a function g є 5 by 
x/2 ifx € N, x even 
g(x) = 
1 ifx € N, x odd 
Prove that g is a left inverse for f. Also prove that fhas no right inverse. 
Let [5, -] be a semigroup having a left identity i; (see Exercise 27) and the property that for every x € S, 
x has a left inverse y such that y : x = i,. Prove that [S,-] is a group. (Hint: y also has a left inverse in S.) 
An element of a semigroup [5, :] is idempotent if x: x = x. Prove that a group contains exactly one 
idempotent element. 


. Prove that if | S, -] is a semigroup in which the linear equations a: x = b and x: a = bare solvable for any 


a, b € S, then [S,-] is a group. (Hint: Use Exercise 29.) 


Prove that a finite semigroup that satisfies the left and right cancellation laws is a group. (Hint: Use 
Exercise 29.) 


Prove that a group [G,:] is commutative if and only if (x : y = x? : y? for each x, y є G. 
Prove that a group [G,:] in which x - x = i foreach x € G is commutative. 


Let [G,-] bea commutative group with identity 7. For a fixed positive integer k, let B, = (x|x e G, x* = i}. 

Prove that [B,,-] is a subgroup of [G,:]. 

Let [G,:] be a commutative group with subgroups [S,-] and [T,:]. Let ST = (s: t|s є S, є T}. Prove that 

[ST,:] is a subgroup of [G,-]. 

a. Let [G,:] be a group and let [S,:] and [7,-] be subgroups of [G,-]. Prove that [S  T,-] is a subgroup 
of [G,°]. 

b. Will [S U 7,-] be a subgroup of [G,-]? Prove or give a counterexample. 

For any group [G,-], the center of the group is А = {x e G|x: g = gx forall g € С). 

a. Prove that [4,:] is a subgroup of [G,°] 

b. Find the center of the group of symmetries of an equilateral triangle, [D3, °] (see Exercise 9). 

c. Prove that G is commutative if and only if G = A. 

d. Let x and y be members of G with x : y ! € A. Prove that x: y = y x. 

a. Let S, denote the group of permutations on a set A, and let a be a specific element of A. Prove that the 

set H, of all permutations in S, that map a to a forms a subgroup of $4. 

. If A has n elements, what is |H,|? 

a. Let [G,:] be a group and А C С, А =# Ø. Prove that [A,-] is a subgroup of [С,:] if for each 
x,y E A,x:y ! € A. This subgroup test is sometimes more convenient to use than the theorem on 
subgroups. 

b. Use the test of part (a) to work Exercise 35. 

a. Let [G,:] be any group with identity i. For a fixed a є С, а? denotes i and a " means (a") !. Let 
A = {а |с є Z). Prove that [A4,:] is a subgroup of G. (Hint: Use Exercise 40.) 

b. The group [С,:] is a cyclic group if for some а € G,A = (az € Z} is the entire group С. In this 
case, a is a generator of [G,:]. For example, 1 is a generator of the group [Z, + |; remember that 
the operation is addition. Thus, 1 = 0,1! = 1,12 21-122, = 1 +1 +1 = 3... ; 17! = 
(0): = =;12 = (1) = –2;17° = (17)! = —3,.... Every integer can be written as an integral 


c 


42. 
43. 


44. 


45. 


46. 


47 


| 714 Modeling Arithmetic, Computation, and Languages 


“power” of 1, and [Z, +] is cyclic with generator 1. Prove that the group [Z,, +5] is cyclic with 
generator 2. 


c. Prove that 5 is also a generator of the cyclic group [Z;, +7]. 
d. Prove that 3 is a generator of the cyclic group [24, +4]. 
Let [G,:] bea cyclic group with generator a (see Exercise 41). Show that С is commutative. 


a. Let [5, :| be a semigroup. An isomorphism from S to S is called an automorphism on S. Let Aut(S) be 
the set of all automorphisms on S, and prove that Aut(S) is a group under function composition. 

b. For the group [Z,, +4], find the set of automorphisms and show its group table under ° . 

Let [G,:] be а commutative group with identity i. Prove that the function f. С — С given by f(x) = х! 

is an isomorphism. 


Let f be a homomorphism from a group С onto a group Н. Show that fis an isomorphism if and only if 
the only element of G that is mapped to the identity of H 1s the identity of С. 


Let [G,:] be a group and g a fixed element of G. Define f. С — G by Дх) = g:x:g ! for any x e G. 
Prove that fis an isomorphism from С to С. 


. а. Consider the equivalence relation on the integers of congruence modulo n defined in Section 5.1. If 
n = 5, there are 5 equivalence classes: 


[0] = (..., –10, —5, 0, 5, 10,...} 
[1] = {..., 7-9, -4, 1,6, 11,...} 
[2] = {..., —8, -3,2, 7, 12,...} 
[3] = {..., -7, —2, 3, 8, 13,...} 
[4] = {..., 76, 1,4,9,14,...) 


Let E; = {[0], [1], [2], [3], [4]}. An operation + is defined on Е; by 
[x] + [у] = [x + y] 


For example, [2] + [4] = [2 + 4] = [6] = [1] (recall that an equivalence class can be named by any 
of its elements). Prove that [E;, + ] is a commutative group. 

b. [Z;, +5] is a commutative group with elements (0, 1, 2, 3, 4} (see Example 7). Prove that [Z;, +5] is 
isomorphic to the group [Е;, +]. 

c. Results a and b hold for any value of n. In the group £4, what is the inverse of [10]? What is the preim- 
age of [21] under the isomorphism from Z,,4to Е? 


SECTION 9.2 | CODING THEORY 


Introduction 


We talked about cryptographic codes (codes for secrecy) in Chapter 5. The codes 
we will talk about in this section are designed not to keep data secret but to cope 
with degraded data. Data can degrade over time in a storage device or can be 
corrupted over space, that is, during a transmission from one site to another over 
some medium. Bits are changed from 0 to 1 or vice versa through interference 
(“noise”), hardware failures, media damage, and so forth. The goal is to be able to 
detect, perhaps even to correct, such errors. 


Section 9.2 Coding Theory 715 


As an analogy, consider voice transmission over a poor-quality cell phone 
connection. The speaker says the black hat but the receiver hears the black cat. 
Because the received message makes sense and could have been the transmitted 
message, there is no way to detect that an error has occurred. An alternative is to 
encode the message to be transmitted by repeating it. Thus the code word for the 
black hat would be the black hat the black hat, and the code word for the black cat 
would be the black cat the black cat. A received message of the black hat the black 
cat would alert the receiver that an error has occurred in transmission. However, 
the received message is equally close to either of the two code words, so there is no 
way to guess the correct code word. Note that two errors could still go undetected. 

Now let’s make the code word three copies of the message: the black hat the 
black hat the black hat and the black cat the black cat the black cat. A received 
message of the black hat the black cat the black hat would signal that either one or 
two errors has occurred. If we assume it to be more likely that only one error has 
occurred, we decode the message to the closest code word, the black hat the black 
hat the black hat. This process is called maximum likelihood decoding and gives 
us the correct code word for a received message where no more than one error has 
occurred. Because we can detect up to two errors and correct the effects of one 
error, we have designed a double-error detecting, single-error correcting code. 

Here we’ve seen three ideas central to coding theory: redundancy in coding, 
maximum-likelihood decoding, and distance between code words. Redundancy 
uses additional bandwidth or storage space, and it also increases exposure to er- 
rors. Yet, as we’ve seen from our little example, it also increases the capability for 
detecting and perhaps correcting such errors. You are probably familiar with the 
idea of a parity bit, where at the end of a binary string one extra bit is added so 
that the total number of 15 in the string, including the parity bit, is even (an even 
parity bit scheme). A single bit error is detectable because it would result in an odd 
number of 15. But there's no way to tell which bit is in error. An even number of 
errors is undetectable, and any odd number of errors is indistinguishable from a 
single error. This code is single-error detecting. 

The codes we will examine are generalizations of the parity-bit code. 
A parity-bit code adds one bit to the end of an m-tuple message to turn it into an 
n-tuple code word where n = т + 1. Our code words will add additional bits to 
the end of an m-tuple message to create an n-tuple code word where the additional 
n — m bits are all special sorts of parity bits. These codes also rely on further 
results from group theory (indeed they are called group codes), so we need some 
additional background first. 


Background: Homomorphisms and Cosets 


According to the definition, for f to be an isomorphism it must be both a bijection 
and a homomorphism. 


ИШЕТ Consider the following functions from Z to Z: 


CDU 
g(x) = х +1 


The function f is a homomorphism from the group [Z, +] to the group[Z, + ] 
because f(x + у) = 0 = 0 + 0 = f(x) + f(y). However, f is not a bijection, 


PRACTICE 35 | Prove that р is an equivalence relation оп С. 
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so it is not an isomorphism. The function g is a bijection because g(x) = g(y) 
implies x + 1 = у + 1, or x = y, so g is one-to-one; g is also onto because for 
any z E Z,z 1є Z and g(z — 1) = z. But g is not a homomorphism because 
g(x + y) = (x + y) +1 (x + 1) + (y + 1) = gx) + g(y). Hence g is not an 
isomorphism. ө 


What сап be said about functions from a group to a group that аге homomor- 
phisms but not isomorphisms? More specifically, let [G,-] be a group with identity 
ic, |H, + ] be a group with identity i,,, and f be a homomorphism, f: С — Н. The 
range of f, (С), is a subset of H (remember that f might not be an onto function, so 
it isn't necessarily true that С) = H). In fact, f(G) is actually a subgroup of Н. 
It is easy enough to prove the three properties needed for /(G) to be a subgroup, 
and the proof will show that f(ig) = i, and —f(x) = f(x !) (Exercise 1). We can 
define a binary relation p on G by 


xpyef(x) = ҒО) 


апа р is an equivalence relation. 


REMINDER 


Given a homomorphism f 
from [G, -] to [H, +], the 
range f(G) is a subgroup 
of [H, +] and the kernel K 
is a subgroup of [G, >]. 


EXAMPLE 18 


PRACTICE 36 


If fis a one-to-one function, x p y would mean that f(x) = f(y) and therefore 
x = y. The equivalence classes formed by p would then be trivial, each containing 
only a single element of G. But in general, because fis not a bijection, it might not 
be a one-to-one function. Let [ig] be the equivalence class determined by p that 
contains the identity of С. Because /(1с) = im lig] = (x € G| f(x) = ij). This 
set is called the kernel of the homomorphism f. 

The kernel K is a subset of G, and in addition it is a subgroup of G. We need 
to prove the three properties necessary for a subgroup: 


1. К is closed: Let x and y be elements in К. Then f(x) = f(y) = iy. And 
/(х у) = f(x) + Ду) = ig + ig = ір, so ху isin К. 

2. ig belongs to К: Yes, because f(ig) = їн. 

3. Let x be an element іп К: Then f(x) = ig. Also f(x !) = —f(x) = 
—iy = igsox lis in K. 


Let R* = R — {0} and let R^ be the set of all positive real numbers. Then 
the function f defined by f(x) = |x| is a homomorphism from the group [R*, -] 
to the group| R*,-]. The kernel К of f is {x € R* | f(x) = |x| = 1}. Therefore 
Ve I ө 


The function f defined by f(x) = x 4 1 is а homomorphism from the group [Z, +] to the 
group [Z;, +]. Find the kernel К. 


For our discussion of cosets, we begin with a definition. 
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DEFINITION COSET 
Let [S, +] be a subgroup of a group [G, + ]. Then for x € С, sets of the form 
x + 8 = {x + s|s є S} are called left cosets of S in G. Sets of the form 
S + x = {s + x|s є S} are called right cosets of S in G. 


Of course, if G is commutative, then left cosets and right cosets are identical. 


Let S = (0,2, 4, 6}. Then [S,+,] is a subgroup of [2,,+,]. The left coset 
5 +,5 = {5 +,0,5 +,2,5 +, 4,5 +, 6} = {5, 7, 1, 3}. ө 


PRACTICE 37 


a. Verify that 5 +, 5 = 1 +; S = 3 +; 5 = 7 +; S. 
b. Compute the left coset 2 +, S. What аге its other names? и 


For [5, + ] any subgroup of a group [G, + ], we can define a binary relation р 
on G as follows: 


x py < y belongs to x + S, x’s left coset of Sin G 


Then x py, which is equivalent to y belonging to x + S, means у can be written as 
x + s for some element s є 5. It turns out that р is an equivalence relation on С: 


Reflexive property: x px because x = x + ig. (Because S 15 a subgroup of С, the 
identity iç of G is an element of S.) 

Symmetric property: if x py then у = x + s for some s є S. Then, because S is 
a subgroup, —s is in S and x = y + (—s), which puts x in the same coset as 
y, that is y px. 

Transitive property: if xpy and ypz then y = x + s, for some s, Є S and 
2= у +s, for some s, Є 5. Therefore, z = у + s, = (х + si) + s, = 
x + (S, + 55), which means z is in x's left coset so x pz. 


We have proved the following result. 


THEOREM ON COSET PARTITIONS 
Let [5, + | be a subgroup of a group [G, + ]. Then the set of left cosets of S in G 
forms a partition of С. One of these cosets is ig + S = S. 


This coset partition will play a key role in decoding our group codes. But first, 
we need to see how we actually create group codes. 


Generating Group Codes 


Suppose we know how to turn any m-bit binary message into an n-bit code word 
where m < n, and we also know how to reverse this process. There is then a set of 
n-bit code words, one for each m-bit binary string. An n-bit code word is transmit- 
ted and the received message is also a binary n-tuple. Two cases can occur: 
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1. The received n-tuple is identical to one of the code words. In this case, 
maximum-likelihood decoding will assume that no errors have occurred. 
This is actually based on another assumption, namely that the probability 
of errors occurring is very low, so that the probability of no errors is very 
high. Therefore the probability that a received code word is the same code 
word that was originally transmitted is more likely than that the transmit- 
ted code word was scrambled into a different code word. 

2. The received n-tuple does not match any code word. Maximum-likelihood 
decoding suggests that the received n-tuple should be decoded as the 
closest code word, as this would be the result of the fewest bit errors and 
therefore has the highest probability of giving the correct result. 


Keep in mind throughout this section that maximum-likelihood decoding 
does not guarantee 10096 accurate results; it guarantees only results with the high- 
est probability of being accurate. 

To determine the “closest” code word, we need to define what we mean by the 
distance between binary n-tuples. Hamming distance, defined here, is named for 
Richard W. Hamming, an American mathematician who pioneered the study of 
error-detecting and error-correcting codes in 1950. 


* DEFINITION HAMMING DISTANCE 
Let X and Y be binary n-tuples. The Hamming distance between X and Y, 
H(X, Y), is the number of components in which X and Y differ. The minimum 
distance of a code is the minimum Hamming distance between all possible pairs 
of distinct code words. 


PRACTICE 38 | For X = 01011 and Y = 11001, what is H(X, Y)? 


Each error that occurs in the transmission of a code word adds one unit to the 
Hamming distance between that code word and the received word. Suppose we 
picture the code words as specific binary n-tuples distinguished from the set S of 
all binary n-tuples, as in Figure 9.1. Suppose also that the minimum distance of 
the code is at least d + 1. Then any time a code word X is corrupted by d or fewer 
errors, it will be changed to an n-tuple X' that is not another code word, and the 
occurrence of errors can be detected. Conversely, if any combination of d or fewer 
errors can be detected, code words must be at least d + 1 apart. 


S 


Figure 9.1 
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Now suppose that the minimum distance of the code is at least 2d + 1. Then 
any time a code word X is corrupted by d or fewer errors, the received word X" 
will be such that H(X, X") = d but for any other code word Y, H(X', Y) > а + 1. 
Therefore X’ will be correctly decoded as X, the closest code word. Conversely, 
to correct any received word with d or fewer errors, the minimum distance of the 
code must be at least 2d + 1 so that neighborhoods of radius d around code words 
do not intersect (Figure 9.2). 


Figure 9.2 


As a result, we see that a code is d-error detecting if and only if its minimum 
distance is at least d + 1, and it is d-error correcting if and only its minimum 
distance is at least 2d + 1. As one would expect, it requires a stronger condition 
to correct errors than to merely detect them. 


Suppose a code has a minimum distance of 6. It can then detect any combination 
of = 5 errors. It can correct any combination of = 2 errors. If code words X and 
Y are such that H(X, Y) = 6, then there will be a word X' produced by 4 errors 
on X that will be incorrectly decoded as Y; there will be a word X" produced by 3 
errors on X that can be arbitrarily decoded correctly as X or incorrectly as Y. This 
code is double-error correcting, 5-error detecting. e 


The set of code words will be a subset of the set of all binary n-tuples, Z5. 
We want these code words to be sufficiently widely scattered in Z5 so that the 
minimum distance is large enough to allow some error correction. The minimum 
distance of the code is easy to compute if the code words form a subgroup of the 
group [Z5, +» |. In this case we have a group code; the identity is the n-tuple of all 
Os (we'll call this 0,), and each code word is its own inverse. Denote by W(X) the 
weight of a code word X, meaning the number of 15 it contains. 


* THEOREM ОМ MINIMUM DISTANCE OF GROUP CODE 
The minimum distance of a group code equals the minimum weight of all the 
nonzero code words. 


To prove this theorem, let d be the minimum distance of a group code; then 
there are two distinct code words X and Y with Н(Х, Y) = d. Because it's a group 
code, closure holds and X +, Y = Z is a code word. Z # 0 because X and Y аге 
distinct. Z will have 1s in exactly those components where Х and Y differ, so 
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W(Z) = Н(Х, Y) = d. Thus the minimum weight of the code is = d. If the mini- 
mum weight is < d, let M be a nonzero code word with WM) the minimum 
weight. Then (remember that 0, 15 a code word) H(M, 0,) = WM) < d, which 
contradicts the fact that d is the minimum distance of the code. Therefore the 
minimum distance equals the minimum weight. 


The set {00000, 01111, 10101, 11010} is a group code (you can check closure, 
identity, and inverses) in [Z3, +,]. The minimum distance of the code is 3, so it is 
а single-error correcting code (3 = 2 * 1 + 1). e 


АП well and good, but how can we produce subgroups of Z5 to use as code 
words, and how can we control the minimum distance of the code? Algebraic 
ideas again come to our rescue. Let Н be any n X r binary matrix with r < n. If 
X € Z5, we can perform the matrix multiplication X: H, where all additions are 
done modulo 2. The result of this multiplication 1s a binary r-tuple. 


|. EXAMPLE22 | Let r = 3, п = 5, and 


il i € 

~ i il 

biS © i 

| Q 0 

1 1 O0 

Then 
i i $ 
0 1 i 
(11101) 1 0 1|=(110) 

1 0 0 
il i € 


We can think of multiplication by H as a mapping from the group |Z5, + | to 
the group |Z5, +2]. Moreover, this mapping is a homomorphism (see Exercise 13 at 
the end of this section). We know that the kernel К of this homomorphism is a sub- 
group of |25, +2] and it consists of all those X in Z5 such that X: H = 0,, the zero 
of the group |Z5, +›|. We'll take К as the set of code words. Then we can easily 
determine the minimum weight (minimum distance) of the code simply by look- 
ing at H. If H has d distinct rows that add to 0, in |75, +2], say ij, ... , ij, we can 
choose an Xin Z5 having 15 exactly in the i, ... , ij, components. Then X: H = 0,, 
so that X 15 a code word, and W(X) = d. On the other hand, if X is a code word 
with W(X) = d and X has 1’s exactly in components 11, ... , iz,, then the equation 
X-H = 0, forces rows ij, ... , ij, Of H to sum to 0,. Therefore, the minimum 
weight of the code equals the minimum number of distinct rows of H that add to 0,. 
In particular, to produce a single-error correcting code, we must have minimum 
distance at least 3, so we would have to choose an H with no row consisting of all 
Os and no two rows that are alike (these would add to 0,). 


| Section 9.2 Coding Theory 721 


| EXAMPLE 23 | The code words of Example 21 were generated using the matrix Н where 


i @ I 
| Sere TU 
Н=|1 0 0 
0 1 0 
© @ 1 


H has no row of all 0s and no two rows that are alike, but rows 1, 2, and 4 add to 
(0,0,0). Again, we see that the minimum distance of this code is 3. e 


PRACTICE 39 | For each code word X of Example 21, verify that Х· Н = 03, where Н is given in 


Example 23. = 
From now on, we will assume that the matrix H has the form 
H= H а) 
1, 


where I, is the r X r identity matrix and B is an arbitrary (n — r) X r binary 
matrix. In computing the product X: Н for some X € Z7, we multiply elements 
of X by corresponding elements of the columns of H, and then sum modulo 2. For 
each column of H, the pattern of 1s in the column determines which components 
of X contribute to the sum. If the sum is to be 0 (as is true when X: H = 0,), then 
those selected components of X must sum to 0 and therefore must consist of an 
even number of 15. The I, portion of H has the effect that each column of H selects 
a distinct component from among the last r components of X. Each of the last r 
components of X therefore controls the even parity check for one of the r multipli- 
cations that are done. A matrix H that matches Equation (1) is called a canonical 
parity-check matrix. 

For X to be a code word, the first n — r components of X can be arbitrary, but 
the final r components will then be determined. The maximum number of code 
words is therefore the maximum number of ways to select binary (n — r)-tuples, 
or 2" ", Let m = n — r. We can code all members of Z7 in Z5 by leaving the first 
m components alone and then choosing the last r components so that the even 
parity check works for each column of Н. Such a code is called an (n, m) code. 
The first m components of a code word are the information bits, and the last r 
components are the check bits. 


The matrix H of Example 23 is a canonical parity-check matrix where n — 5,r — 3, 
and m — n — r — 2. H can thus generate 2" — 2? — 4 code words. The four 
members of 75 аге 00, 01, 10, and 11. Each can be coded as a member of 25 by 
keeping the first two digits and adding the appropriate check digits. To code 10, 
for instance, we have 
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ї @ 1 
Li a 
(10C,C,C;) 1 0 0 | = (000) 
DNO 
0 0 1 


Multiplication by the first column of Н gives 1:1 + 1: Су; therefore C, must 
equal | to give the sum value of 0. Similarly C; = 0 and С; = 1. We encode 10 as 
10101. d 


PRACTICE 40 | Use the encoding procedure of Example 24 to code 00, 01, and 11 in Z5. Compare the 
results with Example 21. 


For a given n X r canonical parity-check matrix Н, we know how to encode 
all of Z7 = Z5 " as a subgroup of Z5, and we also know how to determine from Н 
the minimum distance of the resulting code. Now let's turn the problem around. 
Suppose we want to encode Z;’ for some m as, say, a single-error correcting code. 
How big will the code words have to be (what is n) or, equivalently, how many 
check bits must be added (what is 7)? Once we know the dimensions of a parity- 
check matrix H, how can we find a canonical H that generates the code? We know 
that to produce a single-error correcting code, H must have no row consisting 
of all 0s and no two rows that are alike. That means that in the canonical form 


B 
Н = H , B cannot contain the row of all 0s nor can it contain any row with a 
single 1 because this would match one of the rows in І. The rows are r-tuples, 
so from the 2" possible r-tuples, we have to subtract those that cannot occur in B. 
В сап have at most 2”— 1 —r rows so m, the number ofrows in B, mustbe = 2”—1—. 
If m and r are such that 


т = 2'—1]—т 


the resulting code is called a perfect code. 


А (7, 4) code is a perfect code. Here n = 7, т = 4,r = n — m = 3, and 
4 = 2? — 1 — 3. The matrix Н will be a 7 X 3 matrix of the form 


[4 


where the m = 4 rows of B are all of the 3-tuples with at least two 15. e 


PRACTICE 41 


a. Write an H matrix for the (7, 4) perfect code. 
b. A (7, 4) code can encode all of 72 in 77. Using your Н from part (a), write the set of binary 
4-tuples that H encodes and write the code word for each one. п 
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Decoding Group Codes 


Now suppose that some encoding scheme has been used to encode all members of 
Z5 in 25. There will be 2” of these code words scattered among the 2" binary n- 
tuples. These code words are known to us (remember that secrecy is not the issue 
here). When an n-tuple X is received, we have a process for decoding: we decode 
X as the closest code word in terms of Hamming distance, making the assumption 
that the fewest errors have occurred. This process is not truly an algorithm, how- 
ever; there may not be a unique closest code word, and even if there is, we might 
still decode incorrectly if enough errors have occurred. 

Nonetheless, let's concentrate on how to find the closest code word(s) for a 
received word X. There is certainly a brute force approach; we can create an array 
of all 2" code words and when an X is received, we just compare X to each one 
in turn to find the closest code word. But wait—this requires storage for an expo- 
nential array size as well as an exponential sequential search process. Even for a 
relatively modest m-value of 32, 2? is a very large number. But, suppose our code 
is a group code generated by an n X r canonical parity-check matrix Н. Then we 
will ultimately be able to decode by searching an array of only 2" elements. In a 
single-error correcting code with m = 32, r can be as low as 6 (m = 2" — 1 — р), 
and 2° = 64 is an acceptable search array. 

Here's how this decoding works. Recall that the set of code words equals the 
kernel К of the homomorphism Н induces from [Z?, +,] to the group [Z5, +,]. 
We therefore know that К is a subgroup of [Z7, +,] and we also know that the set 
of left cosets of К in [77, +,] partitions the set Z7. Cosets have the form X +, К 
where X € Zz. Any given coset 


X+,K = {Х+,С,|С,є К} 


K has 2” elements, so the size of each coset is 2”. Because the set of left cosets 
partitions Z}, a set of size 2", there must be 2"/2" = 2" ^" = 2” distinct cosets. 

When a word X is received, X belongs to the coset X +, К. As we just noted, 
each element E; of this coset has the form X +, C; where C; is a code word. Both Е, 
and C; are binary n-tuples, so they are each self-inverse (— E; = E; and — C; = СЭ. 
Therefore the equation 


E; = X t4C; (2) 
can be written as 
X+,E; = С, (3) 


From (2) we can see that l's in Е, occur in exactly those components where 
X and C; differ. Thus the weight of E; equals the distance between X and C;, 
and the closest code word to X is the one for which the corresponding Е; has 
minimum weight. To decode X, look for the element in the coset of X having 
minimum weight and, by (3), add that element to X. The result is the code word 
to which we decode X. The coset element having minimum weight is called 
the coset leader, and it may not be unique. If there are two “minimum-weight” 
elements in a given coset, one 1s chosen arbitrarily as the coset leader. This just 
means that no word in this particular coset can be accurately decoded because 
it has too many errors. 
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Summary: To decode a received n-tuple X, find the coset to which X belongs and 
add that coset's leader to X. 


Consider the code of Example 21. Here n = 5 and К = (00000, 01111, 10101, 
11010). Suppose the 5-tuple X — 11011 is received. Because the set of code 
words is so small, we can easily pick out the closest code word, which is 11010. 
We would decode X as 11010. (From Example 23 we know that the canonical 
matrix H for this code is 5 X 3, so r = 3. Knocking off the three check bits, the 
original information bits were 11.) 

Now let's try our decoding procedure. The elements of X's coset are 


11011 +, 00000 = 11011 
11011 +, 01111 = 10100 
11011 +, 10101 = 01110 
11011 +, 11010 = 00001 


The coset leader (the element of minimum weight in this coset) is clearly 00001. 
Adding this to X we get 


11011 +, 00001 = 11010 


and we decode_X to 11010, as before. e 


But we haven't really solved the efficiency problem. To find all the elements 
of X's coset requires adding all the code words to X, which still requires knowing 
all 2" code words. For a better approach, we need one new idea. 


© DEFINITION SYNDROME 
In the group code generated by an n X r parity-check matrix Н, for any X € Z5, 
the r-tuple X; Н is the syndrome of X. 


The syndrome is useful because of the following theorem. 


* THEOREM ОМ SYNDROMES AND COSETS 
Let H bean n X r parity-check matrix generating a group code K. Then for 
X, Y € Z}, X and Y are in the same left coset of K in |Z}, +,] if and only if X and 
Y have the same syndrome. 


Proof: Suppose X and Y are in the same left coset of К in [Z7, +,]. Then 
Y = X +, С; for some C, € K, and Y-H = (X+,C) -H = X-H t5C;: H (be- 
cause multiplication by Н is a homomorphism) = X: H +,0, (because C; is in 
the kernel of this homomorphism) = X: Н. Therefore Y: H = X-H and X and Y 
have the same syndrome. 

Now suppose that Y-H=X-H. Then Y-H+,X-H=0,, or 
(Y +,X) +H = 0,, which makes Y +,X a code word C; in K. If Y +,X = C, 
then Y = X +, C; and Y and X are in the same coset. End of proof. 
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PRACTICE 42 | Example 26 shows four members of one coset. The parity-check matrix that generated the 
code for this example is 


POs il 
JEN] | 
H-2!1 0 0 
0) 0 
QNM j| 
Compute the syndrome for each member of the coset. п 


Now suppose that we somehow have available a list of the 2” coset leaders. It 
is then easy to find which coset leader corresponds to X by making use of the fact 
from the previous theorem that X and the coset leader of X’s coset will have the 
same syndrome. 


In the (5, 2) code of Example 26, n = 5, m = 2, andr = 3. The size of each coset 
18 2" = 4, and there are 2” = 8 distinct cosets. Here 1s a list of the coset leaders 
and their corresponding syndromes. The syndromes were computed using the ma- 
trix H of Practice 42, and the coset leaders were found by brute force. 


Coset leaders Syndromes 
00000 000 
00001 001 
00010 010 
00011 011 
00100 100 
10000 101 
00110 110 
01000 111 


A received word of 10101 is decoded by computing its syndrome: 
(10101)H — 000. The coset leader sharing this same syndrome is 00000. The 
received word is decoded as 


10101 45 00000 — 10101 


The received word is a code word, so it is assumed that no errors have occurred. 

A received word of 11000 is decoded by computing its syndrome: 
(11000)H = 010.The coset leader sharing this same syndrome is 00010, so the 
received word is decoded as 


11000 +, 00010 = 11010 


assuming that a single error has occurred. 
A received word of 10011 has a syndrome of 110, so it can be decoded as 


10011 +,00110 = 10101 


But because its coset leader has weight 2, we might instead generate a flag noting that 
at least two errors have occurred and that decoding cannot be done with certainty. Ө 


| 726 Modeling Arithmetic, Computation, and Languages 


At this point we have traded one difficulty for another, namely, How do we 
find the list of 2” coset leaders? This isn’t always easy to do. In Example 27, the 
“brute force” involved writing all 2” = 32 binary 5-tuples, computing the 32 syn- 
dromes to group the 5-tuples into 8 cosets, and then reviewing the 4 members of 
each coset to find the coset leader. In two cosets, there was a tie for coset leader, 
and one value was picked arbitrarily in each case. 

But—if the code is a perfect single-error correcting code, then the coset lead- 
ers are easy to find. Recall that in a perfect code, m = 2” — 1 — r, and because 
r= п — m,itisalsotrue that n = 2” — 1. Then rows ofthe matrix Н are r-tuples 
that are the binary representations of the numbers 1, 2, ... , 2” — 1 (there's no 0, 
row in Н). The code word 0, is the coset leader corresponding to the syndrome 0,. 
Any other syndrome is a binary r-tuple representing a digit d, 1 S d = X — 1. 
The value d is also represented by a row of H, say, row q. The coset leader for this 
syndrome is the binary n-tuple with a 1 in component д and 0s elsewhere. In this 
case there is no ambiguity about the coset leader, and every received word is at 
most distance 1 from a code word. 


| EXAMPLE2O | 23 A (7, 4) code is a perfect code for which r = 3. One matrix generating such а 


code 15 


Il 
O Фо н но нн 
oF OF но н 
= © о н н но 


Given the syndrome 101, the row of Н that represents this binary number is 
row 2. The coset leader for this syndrome is therefore 0100000 (a 1 in component 2, 
Os elsewhere). You can check that (0100000)H = 101. The table of coset leaders 
and the corresponding syndromes is partially filled in here. 


Coset leaders Syndromes 
0000000 000 
001 
010 
011 
100 
0100000 101 
110 
111 


A received word of 1001100 has a syndrome of (1001100)H = 101, so it is de- 
coded as 


1001100 +,0100000 = 1101100 ө 
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SECTION 9.2 REVIEW 


TECHNIQUES 


Find the kernel of a given homomorphism from a 
group [G,-] to a group [H, +]. 


(0) Given a canonical n X r parity-check matrix Н, 


2 


write the set of binary m-tuples that Н encodes and 
write the code word for each one. 

Given m such that Z5' is to be encoded as a single- 
error correcting code, find a canonical parity-check 
matrix to generate the code. 

Given a canonical n X r parity-check matrix H for a 
perfect code, be able to decode a received n-tuple by 
computing its syndrome and finding its coset leader. 


MAIN IDEAS 


If f is a homomorphism from a group [G,:] to a 
group [H, +], then f(G) is a subgroup of [H, + | 
and the kernel K (the set of all elements in G map- 
ping to ін) is a subgroup of [G, ·]. 


EXERCISES 9.2 


If [S, +] is a subgroup of the group [G, +], then 
the set of left cosets of S in С forms a partition of G. 
The error-detecting and error-correcting capabili- 
ties of a binary code are functions of the minimum 
distance of the code. 

In a group code, the minimum distance is the mini- 
mum weight of the nonzero code words. 

A parity-check matrix H can be used to generate a 
group code, in which case the minimum distance of 
the code can be determined from H. 

A canonical n X r parity-check matrix provides an 
easy procedure to encode Zy in Z; where m = n — г. 
For a group code generated by an n X r parity- 
check matrix H, each word X in Z7 is decoded 
by using its syndrome to locate the coset in Z5 to 
which X belongs and adding the coset leader to X. 
If the code is a perfect code, the coset leader can be 
determined from H. 


1. Let f be a homomorphism from a group [G;] to a group [H, +]. Prove that [ /(С), + | is a subgroup of 


[H, +]. 


2. Let f be a homorphism from the group [Z, + | to the group [Z, +] given by f(x) = 2x. 


a. Verify that fis a homorphism. 
b. Is fan isomorphism? Prove or disprove. 
c. What is the subgroup [ f(Z), +] of [Z, +]? 


3. The function f defined by f(x) = x 42 is a homomorphism from [Z, +] to [Z,, +]. Find the kernel К. 
4. The function f defined by f(x) = x 44 is a homomorphism from [Z,5, + ,|to [Zs, +]. Find the kernel К. 
5. A function f: Z X Z  Z is defined by f(x, y) = x + y. 

a. Prove that fis a homomorphism from the group [Z X 7, +] (where + means componentwise addi- 


tion) to the group [Z, +]. 
b. Find the kernel K. 


6. Let F be the set of all functions Ё К > К. For fg € Е, let + be defined as follows: For 


xeR, (f+ 9) =f@ + gx). 
a. Prove that [F, + ] is a group. 


b. Let a € R. Define a function a: F > F by o( f) = f(a). Prove that o is a homorphism from [F, + ] to 


[R, +]. 
c. Find the kernel K. 


7. Let S = (0, 4, 8}. Then [S, +42] is a subgroup of [Z,,, +12]. Find the members of the left coset 7 +1, S. 


8. Let S = {i,(2, 3)). Then [S, °] is a subgroup of the symmetric group [53, °]. Find the members of the left 
coset (1, 2, 3) ° S and the right coset 5 ° (1, 2, 3). Explain this result. 
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9. Consider the canonical parity-check matrix 10. Consider the canonical parity-check matrix 


[« онно н 
[« нн о о н н 
=- о о н = LA 


a. Prove that the code generated by Н is single- 
error correcting. 


II 
oo OF rr Oo Oo н 
[« к^ © O O e oro 
_ ooo o FR m.e me m 


O o. oO. O ннн 


b. Write the set of binary m-tuples H encodes 
and write the code word for each one. a. Prove that the code generated by H is single- 
error correcting. 


b. Write the set of binary m-tuples H encodes 
and write the code word for each one. 


11. Give an example of a canonical parity-check matrix that will generate a single-error correcting code for 
the set of words in Z$. 
12. Give a canonical parity-check matrix for a single-error correcting (15, 11) code. 


13. Let H be an n X r binary matrix mapping |Z5, +2] to |Z5, +2] by the operation X: H for X in Z5. Prove 
that this function is a homomorphism by showing that for X, Y in Z3, (X +,Y)-H = X- H +, Y-H. 


14. Which of the following are perfect codes? Which are single-error correcting? 
а. (5, 3) b. (12, 7) c. (15, 11) 
15. Complete the coset leader/syndrome table of Example 28. 


16. Use your table from Exercise 15 and the matrix H from Example 28 to decode the following received 
words. 


a. 1010011 b. 0001110 c. 0101101 


For Exercises 17 and 18, use a canonical parity-check matrix for the (15, 11) perfect code (see Exercise 12) to 
decode the given received words. (Hint: Because this is a perfect code, you do not need to generate the entire 
coset leader/syndrome table to solve these problems.) 


17. 011000010111001 18. 110111001010011 


SECTION 9.3 | FINITE-STATE MACHINES 


The algebraic structures of Section 9.1 served as models for various simple arithmetic 
systems. However, we would surely agree that computation should go beyond mere 
arithmetic. We would like a model that captures the general nature of computation. 
Perhaps looking at a simplified version of a modern digital computer would be a start. 

A computer stores information internally in binary form. At any instant, the 
computer contains certain information, so its internal storage is set in certain patterns 
of binary digits, which we'll call the state of the computer at that instant. Because a 
computer contains a finite amount of storage, there is a finite (although large) num- 
ber of different states that the computer can assume. An internal clock synchronizes 
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the actions of the computer. On a clock pulse, input can be read, which can change 
some of the storage locations and thus change the state of the machine to a new state. 
What the new state is will depend on what the input was, as well as what the previous 
state was. If these two factors are known, the change is predictable and nonrandom. 
Because the contents of certain storage cells are available as output, the state of the 
machine determines its output. In this way, over a succession of clock pulses, the 
machine produces a sequence of outputs in response to a sequence of inputs. 


Definition 


The finite-state machine is a model that captures the characteristics of a computer. 
As you read the definition, look for the following properties in the behavior of our 
abstract machine: 


— 


Operations of the machine are synchronized by discrete clock pulses. 

2. The machine proceeds in a deterministic fashion; that is, its actions in 

response to a given sequence of inputs are completely predictable. 

The machine responds to inputs. 

4. There is a finite number of states that the machine can attain. At any given 
moment, the machine is in exactly one of these states. Which state it will 
be in next is a function of both the present state and the present input. The 
present state, however, depends on the previous state and input, while the 
previous state depends on its previous state and input, and so on, going all 
the way back to the initial configuration. Thus, the state of the machine at 
any moment serves as a form of memory of past inputs. 

5. The machine is capable of output. The nature of the output is a function of the 

present state of the machine, meaning that it also depends on past inputs. 


шә 


FINITE-STATE MACHINE 

М = [S, I, О, fs, fo] is a finite-state machine if S is a finite set of states, J is a 
finite set of input symbols (the input alphabet), O is a finite set of output symbols 
(the output alphabet), and /; and fo are functions where fy: S X I — S and 
fo: S — О). The machine is always initialized to begin in a fixed starting state sọ. 


The function / is the next-state function. It maps a (state, input) pair to a 
state. Thus, the state at clock pulse t;+1, state(¢;,,), is obtained by applying the 
next-state function to the state at time ¢; and the input at time ¢;: 


state(t;, 1) = fs(state(t;), input(t;)) 


The next-state function is indeed a function, so for any (state, input) pair, 
there is a unique next state. The function fo is the output function. When fo is 
applied to a state at time /;, we get the output at time f;: 


output(t;) = fo(state(f;)) 
Notice that the effect of applying function fo is available instantly, but the effect 
of applying function f; is not available until the next clock pulse. 
Examples of Finite-State Machines 


To describe a particular finite-state machine, we have to define the three sets and 
two functions involved. 
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A finite-state machine M is described as follows: S = {so, S1, s2}, = (0, 1}, 
О = {0, 1}. Because the two functions f; and fo act on finite domains, they can be 
defined by a state table, as in Table 9.1. The machine M begins in state sọ, which 
has an output of 0. If the first input symbol is a 0, the next state of the machine is 
then s,, which has an output of 1. 


TABLE 9.1 
Present state Next state Output 
Present input 
0 1 
So Sy So 0 
51 So Sy 1 
85 $5 So 1 


If the next input symbol is a 1, the machine stays in state s; with an output of 1. By 
continuing this analysis, we see that an input sequence consisting of the characters 
01101 (read left to right) would produce the following effect: 


Time b p a & th i 
Input 0 YF I @ 1 = 
State So S б S Sy So 
ОШ ОШ |(0 d d Y 4" @ 


The initial 0 of the output string is spurious—it merely reflects the starting state, 
not the result of any input. In a similar way, the input sequence 1010 produces an 
output of 00111. 

Another way to define the functions fs and fo (in fact all of M) is by a directed 
graph called a state graph. Each state of M with its corresponding output is the 
label of a node of the graph. The next-state function is given by directed arcs of 
the graph, each arc showing the input symbol(s) that produces that particular state 
change. The state graph for M appears in Figure 9.3. 


REMINDER 


Each input symbol must 
appear on one and only 
one transition arc from 
each state. 


Figure 9.3 ө 


PRACTICE 43 
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For the machine M of Example 29, what output sequence is produced by the input sequence 
11001? 


m 
PRACTICE 44 | A machine M is given by the state graph of Figure 9.4. Give the state table for M. 
Figure 9.4 L| 
PRACTICE 45 | A machine M is described by the state table shown in Table 9.2. 
a. Draw the state graph for M. 
b. What output corresponds to an input sequence of 2110? 
TABLE 9.2 
Present state Next state Output 
Present input 
0 1 2 
So ‘Sy xp on 0 
Sy Si Sq So 1 
a 


The machine of Example 29 is not particularly interesting. If finite-state ma- 
chines model real-world computers, they should be able to do something. Let’s try 
to build a finite-state machine that will add two binary numbers. The input will 
consist of a sequence of pairs of binary digits, each of the form 00, 01, 10, or 11. 
Each pair represents one column of digits of the two numbers to be added, least 
significant digits first (so in this particular case, read the inputs column by column 
right to left). Thus to add the two numbers 


011 
101 


the number pairs are 11, 10, and 01. The output gives the least significant digits of 
the answer first. Recall the basic facts of binary addition: 


0 0 1 1 
0 1 0 | 
0 1 1 10 


(Note that in the fourth addition a carry to the next column takes place.) 
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A moment’s thought shows us that we can encounter four cases in adding the 
digits in any given column, and we will use states of the machine to represent 
these cases. 


e The output should be 0 with no carry—state s. 

e The output should be 0 but with a carry to the next column—state 5. 
e The output should be 1 with по carry—state s;. 

e The output should be 1 with a carry to the next column—state s3. 


State sọ, as always, is the starting state. We have already indicated the output for 
each state, but we need to determine the next state based on the present state and 
the input. For example, suppose we are in state s, and the input is 11. The output 
for the present state is 0, but there is a carry, so in the next column we are adding 
1 + 1 + 1, which results in an output of 1 and a carry. The next state is s3. 


PRACTICE 46 | In the binary adder under construction: 


a. What is the next state if the present state is s; and the input is 11? 
b. What is the next state if the present state is s, and the input is 10? L| 


After considering all possible cases, we have the complete state graph of 
Figure 9.5. 


Output — 0, no carry Output — 1, no carry 


Output = 1, carry 


Figure 9.5 


The operation of this machine in adding the two numbers 011 and 101 (read right 
to left, that is, low-order digits first) can be traced as follows: 


Time b t t t t 
Input 11 10 01 00 - 
State So S Sy Sy $5 
Output | O 0 0 O 1 
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The output (written right to left) is 1000 when we ignore the initial 0, which does 
not reflect the action of any input. Converting this arithmetic to decimal form, we 
have computed 3 + 5 = 8. Note the symmetry of this machine with respect to the 
inputs of 10 and 01, reflecting that binary addition is commutative. 


PRACTICE 47 | Compute the sum of 01110110 and 01010101 by using the binary adder machine of 
Figure 9.5. 


Recognition 


We have already noted that a given input signal may affect the behavior of a 
finite-state machine for longer than just one clock pulse. Because of the (limited) 
memory of past inputs represented by the states of a machine, we can use these 
machines as recognizers. A machine can be built to recognize, say by producing 
an output of 1, when the input it has received matches a certain description. We 
will soon discuss more fully the capabilities of finite-state machines as recognizers. 
Here we will simply construct some examples. 


When binary data are transmitted (or stored) as strings of bits, an extra bit is often 
tacked onto the end of each string as a parity bit. Under an even parity scheme, the 
parity bit (a 0 or a 1) is chosen so that the total number of 1s in the string, including 
the parity bit, is an even number. Under an odd parity scheme, the parity bit is cho- 
sen so that the total number of 15 in the string is odd. When the string is received, 
its parity is checked. Assuming an even parity scheme, if the parity of the received 
string is not even, then an error has occurred in transmitting the string, and a re- 
quest for retransmission can be made. A parity bit therefore serves as a simple 
single-error-detection mechanism. (Note that a parity bit does not detect errors in 2 
bits, although it does detect errors in 3 bits.) The machine described in Figure 9.6 is 
a parity check machine. When the input received through time /; contains an even 
number of 15, then the output at time f;, , is 1; otherwise, the output is 0. 


Figure 9.6 ә 


Suppose we want to design a machine having an output of 1 exactly when the input 
string received to that point ends in 101. As a special case, an input sequence con- 
sisting of just 101 could be handled by progressing directly from state s, to states 
51, 55, and s4 with outputs of 0 except for s3, which has an output of 1. This much 
of the design results in Figure 9.7a. This figure shows that we want to be in state 
55 whenever the input has been such that one more | takes us to s, (with an output 
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of 1); thus we should be in s; whenever the two most recent input symbols were 
10, regardless of what came before. In particular, a string of 1010 should put us in 
55; hence, the next-state function for s, with an input of 0 is s,. Similarly, we can 
use 5 to “remember” that the most recent input symbol received was 1, and that a 
01 will take us to s3. In particular, 1011 should put us in s,; hence, the next-state 
function for s, with an input of 1 is sı. The rest of the next state function can be 
determined the same way; Figure 9.7b shows the complete state graph. 


OmOmOmO 
(a) 


(b) 


Figure 9.7 


Notice that the machine is in state s, at the end of an input of 0110 and at the end of 
an input of 011010—in fact, at the end of any input ending in 10; yet s cannot dis- 
tinguish between these inputs. Each state of M represents a class of indistinguishable 
input histories, s, being the state representing all inputs ending in 101. e 


PRACTICE 48 | Draw the state graph for a machine producing an output of 1 exactly when the input string 


received to that point ends in 00. 


* DEFINITION 


Now we want to see exactly what sets finite-state machines can recognize. 
Remember that recognition is possible because machine states have a limited 
memory of past inputs. Even though the machine is finite, a particular input signal 
can affect the behavior of a machine "forever." However, not every input signal 
can do so, and some classes of inputs require remembering so much information 
that no machine can detect them. 

To avoid writing down outputs, we will designate those states of a finite- 
state machine with an output of 1 as final states and denote them in the state graph 
with a double circle. Then we can give the following formal definition of recogni- 
tion, where /* denotes the set of finite-length strings over the input alphabet. 


FINITE-STATE MACHINE RECOGNITION 

A finite-state machine M with input alphabet / recognizes a subset S of /* if M, 
beginning in state sọ and processing an input string a, ends in a final state if and 
only ifa є S. 
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PRACTICE 49 | Describe the sets recognized by the machines in Figure 9.8. 


Figure 9.8 


* DEFINITION 


In Practice 49d, state s, is a "dead" state from which there is no recovery. The 
appearance of a 0 in states so, 51, or s; irrevocably ruins the pattern the machine 
is able to recognize. In contrast, the machine of Example 31 (or any machine that 
recognizes strings with a certain ending) will never have a dead state because 
there is always hope that subsequent input will match the ending pattern. 


Regular Sets and Kleene's Theorem 


We want a compact, symbolic way to describe sets such as those appearing in the 
answer to Practice 49. We will describe such sets by using regular expressions; 
each regular expression describes a particular set. First, we define what regular 
expressions are; then we will see how a regular expression describes a set. We as- 
sume here that / is some finite set of symbols; later / will be the input alphabet for 
a finite-state machine. 


REGULAR EXPRESSIONS OVER / 
Regular expressions over J are the 


1. symbol @ and the symbol A. 
2. symbol i for any i € I. 
3. expressions (AB), (A V B), and (A4)* if A and B are regular expressions. 
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(This definition ofa regular expression over / is still another example of a recursive 
definition.) 


© DEFINITION REGULAR SET 
Any set represented by a regular expression according to the following conven- 
tions is a regular set: 


1. Ø represents the empty set. 

2. А represents the set {A} containing the empty string. 

3. irepresents the set {i}. 

4. For regular expressions A and B, 
a. (AB) represents the set of all elements of the form o B where a belongs 

to the set represented by А and В belongs to the set represented by B. 

b. (AB) represents the union of A's set and B’s set. 
c. (A)* represents the set of all concatenations of members of A’s set. 


In our discussion, we will bea little sloppy and say things like “the regular set АВ” 
instead of “the set represented by the regular expression AB.” Informally, an element 
in AB is an item from A followed by an item from B. An element in А V B is a single 
item chosen from either A or B. An element in (4)* is zero or more repetitions of ele- 
ments from A. We note that A, the empty string, is a member of the set represented by 
A* for any А because it is the case of zero repetitions of elements from А. In writing 
regular expressions, we can eliminate parentheses when no ambiguity results. The 
regular expression 0* V 10 therefore consists of A, 0, 00, 000, 0000, ... , 10. 


| EXAMPLE 32 | Here are some regular expressions and a description of the set each one represents. 


a, OOD) Any number (including none) of 1s, followed by a 
single 0, followed by any number (including none) 
of 01 pairs. 

Б. Ww Ie A single 0 or any number (including none) of 1s. 

@ (OW Dre Any string of Os or 15, including A. 


d. 11((10)*11)*(00*) A nonempty string of pairs of 1s interspersed with 
any number (including none) of 10 pairs, followed by 
at least one 0. One can see how awkward this verbal 
description is and appreciate the compactness of a 
regular expression description. e 


PRACTICE 50 | Which strings belong to the set described by the regular expression? 
a. 10100010; (0*10)* 
b. 011100; (0 V (11)*)* 
с. 000111100; ((011 V 11)*(00)*)* п 


PRACTICE 51 | Write regular expressions for the sets recognized by the machines of Practice 49. 


© THEOREM 
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A regular set may be described by more than one regular expression. For 
example, the set of all strings of 0s and 1s, which we already know from Example 
32(c) to be described by (0 V 1)*, is also described by the regular expression 
[(0 V. 1*)* V (01)*]*. We might, therefore, write the equation 


(0 V 1)* = [OV 1*)* v (0D)*]* 


Although we may be quite willing to accept this particular equation, it can be 
difficult to decide in general whether two regular expressions are equal, that is, 
whether they represent the same set. An efficient algorithm that will make this 
decision for any two regular expressions has not been found. 

We have introduced regular sets because, as it turns out, these are exactly the 
sets finite-state machines are capable of recognizing. This result was first proved 
by the American mathematician Stephen Kleene in 1956. We state his theorem 
here without proof. 


KLEENE'S THEOREM 
Any set recognized by a finite-state machine is regular, and any regular set can be 
recognized by some finite-state machine. 


Kleene's theorem outlines the limitations as well as the capabilities of finite state 
machines—there are certainly many sets that are not regular. For example, con- 
sider $ = (0"1"| n = 0} where a" stands for a string of n copies of a. Strings in 
S have some number of 0s followed by the same number of 15. S is not regular. 
(Notice that 0*1* does not do the job.) By Kleene's theorem, there is no finite- 
state machine capable of recognizing S. Yet S seems like such a reasonable set, 
and surely we humans could count a string of 0s followed by 1s and see whether 
we had the same number of 15 as 05. This lapse suggests some deficiency in our 
use of a finite-state machine as a model of computation. We will investigate this 
further in Section 9.4. 


Machine Minimization 


Although we have treated finite-state machines as abstractions, circuits that act like 
finite-state machines can be built from electronic devices like the logic elements of 
Section 8.2 and others. If we wish to construct a physical machine, the number of 
internal states is a factor in the cost of construction. Minimization is the process 
of finding, for a given finite-state machine M, a machine M' with two properties: 


1. If M and M' are both begun in their respective start states and are given 
the same sequence of input symbols, they will produce identical output 
sequences. 

2. M' has, if possible, fewer states than M (if this is not possible, then M is 
already a minimal machine and cannot be further reduced). 


Unreachable States 

First, let's observe that we can remove any unreachable states of M, those states 
that cannot be attained from the starting state no matter what input sequence 
occurs. 
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Let M be given by the state table of Table 9.3. Although the state table contains the 
same information as the state graph (Figure 9.9), the graph shows us at a glance 
that state s, can never be reached from the starting state sọ. If we simply remove 
state s, and its associated arcs, we have the state graph of Figure 9.10 for a ma- 
chine M' with one less state than M that behaves exactly like M; that is, it gives the 
same output as M for any input string. 


TABLE 9.3 
Present state Next state | Output 
Present input 
0 1 
50 Sy 53 0 
Sy 53 50 0 
$5 Sy 53 1 
53 So Sy 1 


Figure 9.9 Figure 9.10 e 


PRACTICE 52 | What state(s) is/are unreachable from s, in the machine of Table 9.4? Try to get your an- 


swer directly from the state table. 


TABLE 9.4 
Present state Next state Output 
Present input 
0 1 

So Sy S4 0 

51 54 Sy А 

$5 СЯ Я 1 

53 53 Sy 0 

54 So So 1 " 


Because the state graph of a finite-state machine is a directed graph, it has 
an associated adjacency matrix. Warshall’s algorithm (Section 7.1) can be used to 
detect unreachable states. 


* DEFINITION 
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Minimization Procedure 

Assuming now that all unreachable states have been removed from M, we will 
continue to look for a reduced machine M'. The key to finding a reduced M', if 
one exists, lies in the notion of equivalent states. 


EQUIVALENT STATES 
Two states s; and s; of M are equivalent if for any a € /*, /0(5, o!) = fos; о), 
where /* again denotes the set of finite-length strings over the input alphabet. 


In this definition of equivalent states, the definition of the output function has 
been extended to denote the sequence of output symbols obtained by repeatedly 
applying fo to a sequence a of input symbols. Thus equivalent states of a machine 
produce identical output strings for any input string. 


PRACTICE 53 | Prove that state equivalence is an equivalence relation on the states of a machine. 


For the time being, we will postpone the problem of how to identify equivalent 
states in a given machine M. Let's simply assume that we have somehow found 
which states are equivalent and have partitioned the states of M into the correspond- 
ing equivalence classes. These classes have two properties: (1) АП states in the same 
class have the same output, and (2) for each input symbol, all states in the same class 
proceed under the next-state function to states that are all in the same class. 


PRACTICE 54 | Show that properties 1 and 2 are satisfied when M is partitioned into classes of equivalent 


states. 


© DEFINITION 


We define a machine M' whose states are the equivalence classes of M. M' has 
the same input and output alphabet as M, and its start state is the class to which 
So, the start state of M, belongs. The output of a class is the output symbol com- 
mon to all states of M in that class (property 1). The next state of class X under 
an input symbol is that class to which all states of M in X proceed under that 
input symbol (property 2). M' is a well-defined machine. M' produces the same 
output strings when processing a given input string as does M. Also, the number 
of states of M' (equivalence classes of M) will be no greater than the number of 
states of M. 

The minimization problem for M thus boils down to finding the equivalent 
states of M. Perhaps we should note first that the obvious approach of directly try- 
ing to satisfy the definition of equivalent states will not work. Given two states s; 
and s; of M, we cannot actually compare the outputs corresponding to each pos- 
sible input string. Fortunately, the problem is not as infinite as it sounds; we only 
need to identify k-equivalent states. 


K-EQUIVALENT STATES 
Two states s; and s; of M are k-equivalent if for any a € /* where о has no more 
than k symbols, fols; œ) = fols; a). 
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It is not hard to see that k-equivalence is an equivalence relation on the states 
of M (check the reflexive, symmetric and transitive properties). It is possible to test 
two states of M for k-equivalence directly, since we can actually produce the finite 
number of input strings having no more than k symbols. However, it turns out 
that we don’t have to do this. We can begin by finding 0-equivalent states. These 
are states producing the same output for 0-length input strings, that is, states hav- 
ing the same associated output symbol. Thus, we can identify the 0-equivalence 
classes directly from the description of M. 


Let M be defined by the state table of Table 9.5. (Here we’ve started writing 
0, 1, 2... for states instead of sj, 51, 55, ... . ) The O-equivalence classes of the 
states of M are 


40, 2,5) and (193,4: 6) 


TABLE 9.5 
Present state Next state Output 
Present input 
1 
0 2 3 0 
1 3 2 1 
2 0 4 0 
3 1 5 1 
4 6 5 1 
5 2 0 0 
6 4 0 il e 


Our procedure to find k-equivalent states is a recursive one; we know how to 
find 0-equivalent states, and we will show how to find k-equivalent states once we 
have identified states that are (k — 1)-equivalent. Suppose, then, that we already 
know which states аге (k — 1)-equivalent. If states s; and s; are A-equivalent, they 
must produce the same output strings for any input string of length К or less, 
which includes any string of length Ё — 1 or less. Thus, s; and s; must at least be 
(k — 1)-equivalent. But they also must produce the same output strings for any 
k-length input string. 

An arbitrary k-length input string consists of a single arbitrary input symbol 
followed by an arbitrary (A — 1)-length input string. If we apply such а k-length 
string to states s; and s; (which themselves have the same output symbol), the 
single input symbol moves s; and s; to next states s; and s;; then s; and s; must 
produce identical output strings for the remaining, arbitrary (k — 1)-length string, 
which will surely happen if s; and s; are (k — 1)-equivalent. Therefore, to find k- 
equivalent states, look for (k — 1)-equivalent states whose next states under any 
input symbol are (k — 1)-equivalent. 
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Consider again the machine M of Example 34. We know the 0-equivalent states. 
To find 1-equivalent states, we look for 0-equivalent states with 0-equivalent next 
states. For example, the states 3 and 4 are 0-equivalent; under the input symbol 
0, they proceed to states 1 and 6, respectively, which are 0-equivalent states, and 
under the input symbol 1 they both proceed to 5, which of course is 0-equivalent 
to itself. Therefore, states 3 and 4 are 1-equivalent. Similarly, states 1 and 3 are 
1-equivalent and states 4 and 6 are 1-equivalent. Therefore the class (1, 3, 4, 6} of 
1-equivalent states is unchanged from the class of 0-equivalent states. But states 
0 and 5, themselves 0-equivalent, proceed under the input symbol 1 to states 3 
and 0, respectively, which are not 0-equivalent states. So states 0 and 5 are not 
1-equivalent; the input string 1 will produce an output string of 01 from state 0 
and of 00 from state 5. States 0 and 2 are 1-equivalent. Therefore the l-equivalence 
classes for M are 


10:2) 5) СЕЗ 0) 


To find 2-equivalent states, we look for 1-equivalent states with 1-equivalent 
next states. States 1 and 3, although 1-equivalent, proceed under input 1 to states 
2 and 5, respectively, which are not l-equivalent states. Therefore, states 1 and 
3 are not 2-equivalent. The states 0 and 2, 1 and 6, and 3 and 4, respectively, are 
2-equivalent. The 2-equivalence classes for M are 


ЮЛ БШО вя 


The 3-equivalence classes for M are the same as the 2-equivalence classes. Ф 


* DEFINITION PARTITION REFINEMENT 
Given two partitions 77, and т, of a set S, т; is a refinement of 77, if each block 
of 7, is a subset of a block of т». 


In Example 35 each successive partition of the states of M into equiva- 
lence classes is a refinement of the previous partition. This refinement will al- 
ways happen; k-equivalent states must also be (k — 1)-equivalent, so the blocks 
of the (k — 1)-partition can only be further subdivided. However, the subdivi- 
sion process cannot continue indefinitely (at worst it can go on only until each 
partition block contains only one state); there will eventually be a point where 
(k — l)-equivalent states and k-equivalent states coincide. (In Example 35, 
2-equivalent and 3-equivalent states coincide.) Once this happens, all next states for 
members of a partition block under any input symbol fall within a partition block. 
Thus, k-equivalent states are also (k + l)-equivalent and (k + 2)-equivalent, 
and so on. Indeed, these states are equivalent. 

The total procedure for finding equivalent states is to start with 0-equivalent 
states, then l-equivalent states, and so on, until the partition no longer subdivides. 
A pseudocode description of this algorithm is given. It isn't nearly as complex as 
it looks, but it does involve checking many pairs of states, just as in Example 35. 
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AGORITHM MINIMIZE 


Minimize (finite-state machine table M) 

//produces a minimized version of М 

Local variable: 

boolean flag //flag for loop exit when nonequivalent states found 


find 0-equivalent states of M 
repeat 
while untested equivalence classes remain do 
select untested equivalence class 
while untested state pairs in current class remain do 
select untested state pair in current class 
flag — false 
while untried input symbols remain and not //ag do 
select untried input symbol 
for both states in current pair, find next state 
under current input symbol 
if next states not equivalent then 
flag — true 
end if 
end while 
if flag then 
mark current states for different classes; 
end if 
end while 
form new equivalence classes 
end while 
until set of new equivalence classes — set of old equivalence classes 
end Minimize 


| EXAMPLE 36 | For the machine М of Examples 34 and 35, the reduced machine М” will have 


states 
АЕ 
в = 15} 
C= 16} 
D= ETE 


The state table for М” (Table 9.6) is obtained from that for M. Machine M' (starting 
state A) will reproduce M’s output for any input string, but it has four states instead 
of seven. 
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TABLE 9.6 
Present state Next state Output 
Present input 
0 1 
А А D 0 
B A A 0 
С D A 1 
D Є B 1 e 
| EXAMPLE 37 | We will minimize M where M is given by the state table of Table 9.7. 
TABLE 9.7 
Present state Next state Output 
Present input 
0 1 
0 3 1 1 
1 4 1 0 
2 3 0 1] 
3 2 3 0 
4 1 0 3 


The 0-equivalence classes of M are 


ООО р 


States 0 and 2 under input | go to states 1 and 0, which are not 0-equivalent. The 
1-equivalence classes of M are 


ТОУ 


No further refinement is possible. Let 


A = {0} 
B = {2,4} 
С = {1,3} 


The reduced machine is shown in Table 9.8. 


TABLE 9.8 
Present state Next state Output 
Present input 
0 1 
А (© C 1 
B (© А 1 
C B © 0 
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PRACTICE 55 | Minimize the machines whose state tables are shown in Tables 9.9 and 9.10. 


TABLE 9.9 
Present state Next state Output 
Present input 
0 1 
0 2 1 1 
1 2 0 1i 
2 4 3 0 
3 2 3 1 
4 0 1 0 
TABLE 9.10 
Present state Next state Output 
Present input 
0 1 
0 1 3 1 
1 2 0 0 
2 1 3 0 
3 2. 1 0 = 


Sequential Networks and Finite-State Machines 


The output of a finite-state machine is a function of its present state, and the pres- 
ent state of the machine is a function of past inputs. Thus, the states of a machine 
have certain memory capabilities. In the combinational networks of Chapter 8, 
which use AND gates, OR gates, and inverters, the output is virtually instanta- 
neous and a function only of the present input. To build a finite-state machine we 
need one additional element that provides the memory missing from our previous 
logic networks. 

A delay element is the simplest of a class of elements known as flip-flops. It 
is regulated by a clock, has a single binary input, and its output at time ¢ + 1 is 
the input signal it received at time ¢. The delay element is therefore a “memory 
device” that captures input for the duration of one clock pulse. Figure 9.11 repre- 
sents the delay element at time t + 1, with the signal propagating from right to 
left. When the delay element is receiving an input of d(t + 1), the output is the 
previous input d(t). 


d(t) d(t+1) 


— 


Figure 9.11 


EXAMPLE 38 
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When one or more delay elements are introduced into a combinational net- 
work, the network is known as a sequential network. Unlike a combinational 
network, loops (where output from a circuit becomes part of its input) are allowed, 
provided that at least one delay element is incorporated into the loop. The delay 
element prevents the confusion that results when a circuit tries to act on its current 
output. Input sequences can be run through sequential networks (hence the name) 
provided that the clock pulse synchronizes input signals as well as delay elements. 


The delay element in Figure 9.12 feeds the output from the terminal AND gate of 
the network back into the initial OR gate at the next clock pulse. The initial output 
of the delay element is assumed to be 0. The input sequences ху = 10010 and 
x; = 11000 (read left to right) produce the effect shown in the table. 


| 


а 


Figure 9.12 
Тіте to ti t; їз t4 
X1 1 0 0 1 0 
Xo 1 1 0 0 0 


Delay output 0 0 1 al 0 
с=с = у=”, 


Circuit output 


Any finite-state machine can be built using a sequential network. (We'll 
assume that all input and output values are binary; if not, they can be encoded 
in binary form.) The general structure of such a network is shown in Figure 9.13. 
It consists of two parts: (1) a combinational network (no delay elements) and (2) 
some loops containing all the delay elements. 

To build the network for a given finite-state machine, we represent each 
machine state as a binary number, beginning with zero (0000, 0001, 0010, 
0011, 0100, and so on); the assignment of states to numbers is arbitrary. At 
any moment, each delay element in the network has a 0 or 1 signal on its 
output line, so the collection of these output signals is a binary number and 
therefore represents one of the states of the machine. As these signals feed into 
the combinational network, they represent the current state; circuits within the 
combinational network compute the next state, which is the pattern of 0s and 1s 
on the input lines to the delay elements. 
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Input ———X — — Output 
———— L——— 
———— | pj 
— — Combinational — 

network 
> 
-—>| 
г> 
Hel 
Hell 
. 
e 
. 
а 
Figure 9.13 


If the finite-state machine has q states, the binary numbering of the states 
requires a certain number of bits. For example, to represent six different 
states requires 3-bit numbers ranging from 000 to 101. To represent g different 
states requires [109,4] bits. Each bit in the binary number is the signal from a 
delay element, so the network will require [log, q] delay elements. The essence 
of the construction is to translate the state table of the finite-state machine into 
truth functions for the outputs and delay elements, and then to construct the logic 
network for each truth function, as we did in Chapter 8. 


| EXAMPLE 39 | Consider the finite-state machine of Example 29, with state table as shown in 


Table 9.11. 
TABLE 9.11 
Present state Next state Output 
Present input 
0 1 
So S4 So 0 
S] So S 
S2 Sp So 1 


Because there are only two input symbols and two output symbols, only one input 
line x (taking on values of 0 or 1) and one output line y are needed for the network. 
There are three states, so we need [log,3] = 2 delay elements. We arbitrarily as- 
sociate states with binary numbers (see Table 9.12) represented as signals on the 
inputs or outputs of the delay elements. 
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TABLE 9.12 
а, d» 
So 0 0 
S4 0 1 
52 1 0 


Now we use the information contained in the state table to write three truth 
functions. One truth function describes the behavior of the output y(t); it is a func- 
tion of the two variables d,(¢) and d;(f) representing the present state. The other 
two truth functions describe the behavior of d,(t + 1) and d,(t + 1), representing 
the next state; these are functions of x(t), 4,00), and d;(f), the present input and the 
present state. Table 9.13 shows these truth functions. 


TABLE 9.13 
x(t) ай — dt) yt) d,(t+1) d,{t 1) 
0 0 0 0 0 1 
1 0 0 0 0 0 
0 0 1 1 1 0 
1 0 1 1 0 1 
0 1 0 1 1 0 
1 1 0 1 0 0 


In constructing the third line of Table 9.13, for example, x(t) = 0,d,(ft) = 0, 
and d,(t) = 1, meaning that the present input is 0 and the present state is s,. The 
output associated with state s; is 1, so y(t) = 1. The next state associated with 
input 0 and present state s, is s2, so di(t + 1) = 1 andd,(t + 1) = 0. Notice that 
there are some don’t-care conditions for these functions because the configuration 
d,(t) = 1 and d,(t) = 1 does not occur. 

The canonical sum-of-products form for each of these truth functions is 


y(t) = did, + did, (y 15 not a function of x) 
di(t + 1) = х'а d, + x'djd; 
dy(t + 1) = x'did + хаа, 


Using а Karnaugh map and appropriate choices for the don’t-care conditions, these 
expressions can be simplified to 


y(t) = d, + d, 
di(t + 1) = x'd, + x'd, = x'(d, dj) 
Ot + 1) = ха, + x'did; 
The logic networks for these expressions go into the “combinational network” 


box in Figure 9.13. Thus Figure 9.14 is a wiring diagram for the finite-state 
machine. 
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x(t) NW y(t) 
» 


d(t) 
> 
d,(t) 
ж 
О) 
ха, 
S eed d,(t+1) 
x'd'd' ) 2 
d,(t) d,(tt+1) 
d,(t) Gl sel); 
Figure 9.14 ө 


PRACTICE 56 | Construct a sequential network for the parity check machine of Example 30. 


SPECIAL INTEREST PAGE 


FSMs Behind the Game 


Everyone is familiar with the use of storyboards in de- 
signing a movie. Storyboards are sequences of illus- 
trated panels that set the scene, sketch the action, and 
outline the dialogue. Thumbing rapidly through pages of 
such panels creates a “flipbook,” a primitive animation 
of the movie. So these panels, laid end-to-end, are a one- 
dimensional layout of the movie. 

What differentiates a video game from a movie? A 
movie (a film) is static in the sense that it always comes 
out the same way; there are no variations in the scenes or 
changes in the ending. A video game incorporates user 
interaction, and based on user input, different outcomes 
can occur at various points throughout the game. Design 
of a video game requires a two-dimensional approach. 

If you freeze the video game at some point where 
it is waiting for user input, there is a certain scene on 
the screen with particular characters in place, and these 
characters may have certain attributes at this point in 
time. We can consider this to be the “state” of the game 
at the moment. User input is provided, and based on that 
input and the current state, something happens to move 
the game to a predefined new state. Sound familiar? Yes, 
this is essentially a finite-state machine. “Finite state ma- 
chines are the nuts and bolts of game AI"! 

Here’s an outline, written as an English paragraph, 
of a simple video game: The user starts the game by 


Choose 
level 2 


Choose 
level 1 


Lantern 
Find 
codeword 


Plant 
tree 


Open treasure 
chest 


Slay 
dragon 


e Chapter 


choosing to play at level 1 or level 2. In this simple 
game, the only entity is the user’s avatar, call it object 
А. A user who chooses level 1 sends А into a room 
where A must pick up one of three objects: a book, a 
sword, or a lantern. If the book is chosen, A enters a 
room where the book can be searched for a secret code 
word. When the code word is found, A moves on to the 
opening screen for level 2. If the sword is chosen, A en- 
ters a room with a fearsome dragon. When A slays the 
dragon, A receives the book and enters the book room 
to search for the code word. If the lantern is chosen, 
A enters a room to search for a treasure chest. When 
the treasure chest is opened, it contains a sword and 
A moves on to the room with the fearsome dragon. At 
level 2, A chooses between door | and door 2. Choos- 
ing door 1 ends the game. Choosing door 2 moves A 
into a garden, where A must find and plant a lilac tree. 
Upon planting the tree, the game ends. 

A state graph for a finite-state machine to model the 
game behavior described above looks something like the 
diagram shown. 

Ultimately this design has to be translated into 
computer code. Details will vary with the program- 
ming language used, of course, but in any event there 
must be a way to make multiple choices from a given 
state depending on the user action: 


if (A.state = levell and A.pick = lantern) 
A.state LanternRoom 

else if (A.state = levell and A.pick = sword) 
A.state — SwordRoom 

else if (A.state = levell and A.pick = book) 
A.state — BookRoom 

else if (A.state = level2 and A.pick = Doorl) 
A.state — GameEnds 


Door 1 


and so forth. 

In a more complex video game there 
will be many entities, and the computer code 
must go through a similar decision process 
for each one to program its correct action 
based on its current state and user input. The 
environment itself may be an entity, requir- 
ing changes of background (from the lantern 
room to the sword room, for example). 


! AI for Game Development, David M Bourg and Glenn Seemann, O'Reilly Media, Inc., 2004, ISBN-13: 978-0-596-00555-9 
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SECTION 9.3 REVIEW 


TECHNIQUES 


e Compute the output string for a given finite-state 
machine and a given input string. 

* Draw а state graph from a state table and vice versa. 

W) Construct a finite-state machine to act as a recog- 
nizer for a certain type of input. 

* Find a regular expression given the description of 
a regular set. 

* Decide whether a given string belongs to a given 
regular set. 

* Minimize finite-state machines. 

* Construct sequential networks for finite-state 
machines. 


EXERCISES 9.3 


MAIN IDEAS 


* Finite-state machines have a synchronous, deter- 
ministic mode of operation and limited memory 
capabilities. 

* The class of sets that finite-state machines can rec- 
ognize is the class of all regular sets; hence, their 
recognition capabilities are limited. 

* Unreachable states can be removed from a machine. 

* After unreachable states have been removed from a 
machine, a minimized version of that machine can 
be found that produces the same output strings for 
all input strings. 

* Any finite-state machine can be built using a net- 
work of AND gates, OR gates, inverters, and delay 
elements. 


1. For each input sequence and machine given, compute the corresponding output sequence (starting state 1s 


always sy). 


a. 011011010 


b. abccaab 


Present state 


a 
So 52 
$4 So 
52 52 


S5 S4 


Next state Output 
Present input 
b с 
So S3 a 
S2 S3 b 
So 54 а 
So So Ç 
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c. 0100110 


2. a. For the machine described in Exercise la, find all input sequences yielding an output sequence of 
0011110. 


b. For the machine described in Exercise 1b, find all input sequences yielding an output sequence of 
abaaca. 

c. For the machine described in Exercise 1c, what will be the output for an input sequence a,a5a3a445 
where а; є {0, 1},1 s i 5? 


In Exercises 3—6, write the state table for the machine, and compute the output sequence for the given input 
sequence. 


3. 00110 5. 01011 
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In Exercises 7—10, draw the state graph for the machine, and compute the output sequence for the given input 


sequence. 
7. 10001 9. acbbca 
Present state Next state Output Present state Next state Output 
Present input Present input 
0 1 a b c 
So So Sp 1 So S4 S4 S4 
51 51 So 0 $ 52 52 S4 
S2 So S4 (0) So So Sp S4 1 
8. 0011 10. 21021 
Present state Next state Output Present state Next state Output 
Present input Present input 
0 1 0 1 2 
So So 53 0 So S3 S4 So 1 
S4 So S4 1 S4 S3 So S4 2 
S2 $1 53 0 So S, S4 S4 0 
53 $1 S2 1 S3 S4 S4 So 0 
54 S4 S4 Sp 2 
11. a. Construct a finite-state machine that complements each bit of the binary input string (read left to right). 
b. Write the output for the input sequence 01011. 

12. a. Construct a finite-state machine that will compute x + 1 where x is the input given in binary form, least 
significant digit first (in this case, read the input right to left). You could use the binary adder of Figure 
9.5 by writing 1 as 00... 01 with the correct number of leading 05, but that's much too complicated. 

b. Write the output for x = 1101. 

13. a. Construct a finite-state machine that will compute the bitwise AND of two binary input strings. 

b. Write the output for the input sequence consisting of the strings 11011 and 10010 (read left to right). 

14. a. Construct a finite-state machine that will compute the bitwise OR of two binary input strings. 

b. Write the output for the input sequence consisting of the strings 11011 and 10010 (read left to right). 

15. a. Construct a delay machine having input and output alphabet (0, 1} that, for any input sequence аа›а;... 
produces an output sequence of 00a,a,a3... . 

b. Explain (intuitively) why a finite-state machine cannot be built that, for any input sequence aj25a5... , 
produces the output sequence 0a,0a50a,... . 

16. a. Construct a finite-state machine that will compute ће 2's complement of p where р is a binary number 
input with the least significant digit first. (See Exercise 27, Section 8.2.) (In this case, read the input 
right to left.) 

b. Use the machine of part (a) to find the 2's complement of 1100 and of 1011. 


17. You are designing a Windows-based, event-driven program to handle customers for a small business. 
You design the user interface with three screens. The opening screen contains an exit button to quit the 
program and displays a list box of customer names. Double-clicking on one of the entries in the list box 
brings up a second screen showing complete data for that customer. This screen contains a button to get 
back to the opening screen. The opening screen also contains a button that brings up a form to enter the 
data for a new customer. Draw the state graph for a finite-state machine that describes the high-level user 
interaction with the program. 


18. 


20. 
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Whenever a video disk is inserted into a DVR, the machine automatically turns on and plays the disk. At 
the end of the recorded part of the disk, the machine turns off. To program the DVR, you must manually 
turn it on and then select the menu function; when you are finished, you turn the machine off, but its timer 
is set. At the appropriate time, the machine records, then at the appropriate time it turns itself completely 
off. Draw the state graph for a finite-state machine that describes the behavior of the DVR. 


. You have an account at First National Usury Trust (FNUT) and a card to operate their ATM (automated 


teller machine). Once you have inserted your card, the ATM will allow you to process a transaction only 
if you enter your correct code number, which is 417. Draw the state graph for a finite-state machine de- 
signed to recognize this code number. The output alphabet should have three symbols: “bingo” (correct 
code), “wait” (correct code so far), and “dead” (incorrect code). The input alphabet is (0, 1, 2, ... , 9}. 
To simplify notation, you may designate an arc by /— {3}, for example, meaning that the machine will 
take this path for an input symbol that is any digit except 3. (At FNUT, you get only one chance to enter 
the code correctly.) 


An elevator in a three-story building services floors 1, 2, and 3. Input consists of a signal to the elevator 
of an up-or-down request (U or D) together with the floor from which the signal originates. The elevator 
responds to an input signal by moving to the correct floor. For example, when the elevator is on floor 1 
and receives a D-3 signal, it moves to floor 3. Draw a state graph for a finite-state machine that describes 
the elevator behavior. 


For Exercises 21—24, determine whether the given machine recognizes the given input string. 


21. 


22. 


11010 23. 0101 


01110111 


For Exercises 25—28, construct finite-state machines that act as recognizers for the input described by produc- 
ing an output of 1 exactly when the input received to that point matches the description. The input and output 
alphabet in each case is (0, 1}. 


26. 


27. 


28. 


29. 


30. 


3 


n 


v © с meo cmo c sg 
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. set of all strings containing an even number of 0s 

. set of all strings consisting of two or more 15 followed by a 0 

. set of all strings containing two consecutive 05 and the rest 15 

. set of all strings ending with one or more 0s 

. set of all strings where the number of 0s is a multiple of 3 

. set of all strings containing at least four 15 

. set of all strings containing exactly one 1 

. set of all strings beginning with 000 

. set of all strings where the second input is 0 and the fourth input is 1 


. set of all strings consisting entirely of any number (including none) of 01 pairs or consisting entirely of 
two 15 followed by any number (including none) of 0s 

b. set of all strings ending in 110 

c. set of all strings containing 00 

A paragraph of English text is to be scanned and the number of words beginning with “соп” counted. 
Design a finite-state machine that will output a 1 each time such a word is encountered. The output alpha- 
bet is (0, 1). The input alphabet is the 26 letters of the English alphabet, a finite number of punctuation 
symbols (period, comma, and so on), and a special character В for blank. To simplify your description, 
you may use / — (mj, for example, to denote any input symbol not equal to m. 


a. In many computer languages, any decimal number N must be presented in one of the following forms: 
sd* sd*.d* q* d*.q* (1) 


where s denotes the sign (s є {+, — }), d is a digit (d є (0, 1,2, ... , 9}), and d* denotes a string of 
digits where the string may be of any length, including length zero (the empty string). Thus, the follow- 
ing would be examples of valid decimal numbers: 


+2.74 —.58 129 + 


Design a finite-state machine that recognizes valid decimal numbers by producing an output of 1. The 
input symbols are +, —,., and the 10 digits. To simplify notation, you may use d to denote any digit 
input symbol. 

b. Modify the machine of part (a) to recognize any sequence of decimal numbers as defined in part (a) 
separated by commas. For example, such a machine would recognize 


+2.74,—.58,129,+ 


The input alphabet should be the same as for the machine of part (a) with the addition of the symbol c 
for comma. 


c. Suppose a decimal number must be presented in a form similar to that for part (a) except that any deci- 
mal point that appears must have at least one digit before it and after it. Write an expression similar to 
expression (1) in part (a) to describe the valid form for a decimal number. How would you modify the 
machine of part (a) to recognize such a number? 


. Let M be a finite-state machine with л states. The input alphabet is (0). Show that for any input sequence 


that 1s long enough, the output of M must eventually be periodic. What is the maximum number of inputs 
before periodic output begins? What is the maximum length of a period? 
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32. At the beginning of the chapter, we learn: 


Your team at Babel, Inc., is writing a compiler for a new programming language, currently code-named 
ScrubOak after a tree outside your office window. During the first phase of compilation (called the lexical 
analysis phase) the compiler must break down statements into individual units called tokens. In particular, 
the compiler must be able to recognize identifiers in the language, which are strings of letters, and also 
recognize the two keywords in the language, which are if and in. 


How can the compiler recognize the individual tokens in a statement? 
Construct a finite-state machine that operates on a stream of characters and moves into one of two final 


states representing that a complete keyword has just been processed or that a complete nonkeyword identi- 
fier has just been processed. Use f? to denote a separating blank between tokens. 


For Exercises 33—38, give a regular expression for the set recognized by the finite-state machine. 


33. 


35. 


36. 
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For Exercises 39—42, give a regular expression for the set recognized by the finite-state machine. 


41. 


39. Present state Next state Output Present state Next state Output 
Present input Present input 
0 1 0 1 
Sp S5 S4 0 So So S 1 
S S4 So 0 S4 S3 S 1 
S, S3 S3 1 So S3 S, 0 
S5 S5 S5 0 S5 S5 S5 0 
54 55 53 0 
55 S2 S3 1 
40. Present state Next state Output 42. Present state Next state Output 
Present input Present input 
0 1 0 1 
So S5 S4 1 So S4 S4 1 
Si S So 1 51 54 So 0 
S2 So Sp 0 S2 S4 S3 0 
53 So S2 (0) S3 S3 S4 1 
S4 S4 S4 0 


43. Give a regular expression for each of the following sets. 
a. set of all strings of 0s and 15 beginning with 0 and ending with 1 
b. set of all strings of 0s and 1s having an odd number of 0s 
c. (101, 1001, 10001, 100001, ...} 

44. Give a regular expression for each of the following sets. 
a. set of all strings of 0s and 15s containing at least one 0 
b. set of all strings of a’s and b’s where each a is followed by two b’s 
c. set of all strings of 0s and 15 containing exactly two 05 
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45. Does the given string belong to the given regular set? 
a. 01110111; (1*01)*(11 V 0*) 
b. 11100111; [(1*0)* V 0*11]* 
c. 011100101; 01*10*(11*0)* 

46. Does the given string belong to the given regular set? 
a. 1000011; (10* V 11)*(0*1)* 
b. 011110; 0*11(1* V 10) 
с. 101110; [(101)*10*]* 


47. Write a regular expression for the set of all arithmetic expressions indicating the addition or subtraction of 
two positive integers. 


48. Write a regular expression for the set of all alphanumeric strings beginning with a letter, which is the set 
of legal identifiers in some programming languages. 


49. Write regular expressions for each of the strings described in Exercise 25. 


50. Write regular expressions for each of the strings described in Exercise 26. 


5]. Write regular expressions for each of the strings described in Exercise 27. 


52. Write regular expressions for each of the strings described in Exercise 28. 


53. a. Prove that if A is a regular set, then the set A^ consisting of the reverse of all strings in А is also regular. 
5 5 5 5 


b. For any string а, let а“ be the reverse string. Do you think ће set {оао є I*} is regular? 


54. Prove that if A is a regular set whose symbols соте from the alphabet /, then /* — A is a regular set. 


A number of programming languages define "regular expressions" somewhat differently than we have done 
in this chapter. In these instances, the regular expression is meant to describe a pattern for a set of strings so 
that an arbitrary string can be matched against the pattern to see whether the string belongs in the set. Examples 
are (a) searching a string to see whether it matches the format of a valid e-mail address and (b) extracting all 


instances of the form href = 


А3 


.." from an HTML document. Perl (Practical Extraction and Report Language) 


is a language strong in text processing; following are some of the syntax rules for regular expressions in Perl. 


? 
+ 


No 
\s 
SS 


repeat the preceding character or group 0 or more times [this is familiar] 

repeat the preceding character or group 0 or 1 times 

repeat the preceding character or group 1 or more times 

(period)—matches any single character 

(period asterisk)—matches arbitrary string of any length 

(backslash lowercase s)—matches any whitespace character (space, tab, newline) 
(backslash uppercase s)—matches any nonwhitespace character 


For Exercises 55—62, decide which of the given strings match the given regular expression. 


55. Regular expression: bet?er Strings: beer, beter, better, bettter 
56. Regular expression: bet*er Strings: beer, beter, better, bettter 
57. Regular expression: bet+ er Strings: beer, beter, better, bettter 
58. Regular expression: b.?t Strings: bit, but, beet, bt 
59. Regular expression: b.+t Strings: bit, but, beet, bt 
60. Regular expression: b\St Strings: bit, but, beet, bt 
61. Regular expression: b\st Strings: bit, but, beet, b t 


62. Regular expression: b\s*t Strings: bit, bt, bt, b t 
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63. Identify any unreachable states of M. 64. Identify any unreachable states of M. 
Present state Next state Output Present state Next state Output 
Present input Present input 
0 1 а b с 
So So So 0 So S4 So S3 0 
S4 So S4 1 S4 S4 S3 So 1 
S2 So So 1 Sp S3 S2 S4 0 
53 S4 S4 So 0 
For Exercises 65—74, minimize the given machine. 
65. Present state Next state Output 66. Present state Next state Output 
Present input Present input 
0 1 0 1 
0 3 6 1 0 5 3 1 
1 4 2 0 1 5 2 0 
2 4 1 0 2 1 3 0 
3 2 0 1 3 2 4 1 
4 5 0 1 4 2 0 1 
5 3 5 0 5 1 4 0 
6 4 2 1 
67. Present state Next state Output 68. Present state Next state Output 
Present input Present input 
0 1 0 1 
0 1 2 0 0 7 1 1 
1 2 3 1 1 0 3 1 
2 3 4 0 2 5 1 0 
3 2 1 1 3 7 6 1 
4 5 4 1 4 5 6 0 
5 6 7 0 5 2 3 0 
6 5 6 1 6 3 0 1 
7 8 1 0 7 4 0 0 
8 7 3 0 
69. Present state Next state Output 70. Present state Next state Output 
Present input Present input 
0 1 0 1 
0 1 3 0 0 1 3 0 
1 2 4 1 1 2 0 0 
2 5 4 0 2 0 3 0 
3 1 2 2 3 2 1 0 
4 2 1 1 
5 4 0 2 
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71. Present state Next state Output 72. Present state Next state Output 
Present input Present input 
a b с 0 1 
0 1 4 0 1 0 1 3 1 
1 4 2 3 0 1 2 0 0 
2 3 4 2 1 2 4 3 1 
3 4 0 1 0 3 0 1 1 
4 1 0 2 0 4 2 4 0 
73. Present state Next state Output 74. Present state Next state Output 
Present input Present input 
0 1 0 1 
0 3 0 0 0 3 5 1 
1 4 3 1 1 1 6 1 
2 1 4 0 2 0 4 0 
3 0 4 1 3 1 6 1 
4 5 2 0 4 5 3 0 
5 2 3 1 5 4 1 0 
6 2 5 1 


75. Construct a sequential network for the finite-state machine of Exercise 8. 


76. Construct a sequential network for the finite-state machine of Exercise 1а. Make use of don’t-care condi- 
tions to simplify the network. 


SECTION 9.4 


TURING MACHINES 


In Section 9.3, we noted that because S = {0"1"|n = 0} is not a regular set, 
Kleene's theorem tells us that it is not recognizable by any finite-state machine. 
We didn't actually prove that S is not a regular set, however; we only noted that 
we were not able to come up with a regular expression for it. Let's take a slightly 
different approach. 

Suppose S is recognized by a finite-state machine M with m states. Then all 
strings from 5 and only strings from 5 lead M from its start state to a final state. 
Now letus run Ma number of times on successive input strings of A, 0, 0°, 0? ... , 0”. 
At the end of processing each of these m + 1 strings, M will be in some state. 
Because M has only m distinct states, there must be two strings from this list, 
say 0" and 0", у # w, each of which lead M from the start state to the same state. 
(This is actually a result of the pigeonhole principle of Chapter 4, where the items 
are the input strings and the bins into which we put the items are the states M is 
in after processing the strings.) Because M recognizes S, the input string 0”1” will 
cause M to end in a final state. But because M is in the same state after processing 
0” as after processing 0”, the string 0"1", which does not belong to S, will take M 
to the same final state. This contradiction proves that no finite-state machine can 
recognize 5. 
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We probably consider ourselves to be finite-state machines and imagine that 
our brains, being composed of a large number of cells, can take on only a finite, 
although immensely large, number of configurations, or states. We feel sure, how- 
ever, that if someone presented us with an arbitrarily long string of 0s followed by 
an arbitrarily long string of 15, we could detect whether the number of 05 and 15 
was the same. Let’s think of some techniques we might use. 

For small strings of 05 and 15, we could just look at the strings and decide. 
Thus, we can tell without great effort that 000111 є S and that 00011 є S. How- 
ever, for the string 


0000000000000001111111111111111 


we must devise another procedure, probably resorting to counting. We would 
count the number of 05 received, and when we got to the first 1, we would write 
the number of 0s down (or remember it) for future reference; then we would begin 
counting 15. (This process is what we did mentally for smaller strings.) 

However, we have now made use of some extra memory, because when we 
finished counting 15, we would have to retrieve the number representing the total 
number of 0s to make a comparison. But such information retrieval is what the 
finite-state machine cannot do; its only capacity for remembering input is to have 
a given input symbol send it to a particular state. We have already seen that no 
finite-state machine can “remember” 0" for arbitrarily large n because it runs out 
of distinct states. In fact, if we try to solve this problem on a real computer, we 
encounter the same difficulty. If we set a counter as we read in 0s, we might get an 
overflow because our counter can go only so high. To process 0"1" for arbitrarily 
large n requires an unlimited auxiliary memory for storing the value of our coun- 
ter, which in practice cannot exist. 

Another way we humans might attack the problem of recognizing S is to wait 
until the entire string has been presented. Then we would go to one end of the 
string and cross out a 0, go to the other end and cross out a 1, and then continue 
this back-and-forth operation until we ran out of 0s or 15s. The string belongs to S 
if and only if we run out of both at the same time. Although this approach sounds 
rather different from the first one, it still requires remembering each of the inputs, 
since we must go back and read them once the string is complete. The finite-state 
machine, of course, cannot reread input. 

We have come up with two computational procedures—algorithms—to de- 
cide, given a string of 0s and 15, whether that string belongs to S$ = (0"1"|n = 0}. 
Both require some form of additional memory unavailable in a finite-state 
machine. Evidently, the finite-state machine is not a model of the most general 
form of computational procedure. 


Definition 


To simulate more general computational procedures than the finite-state 
machine can handle, we use a Turing machine, proposed by the British 
mathematician Alan M. Turing in 1936. A Turing machine is essentially 
a finite-state machine with the added ability to reread its input and also to 
erase and write over its input. It also has unlimited auxiliary memory. Thus, 
the Turing machine overcomes the deficiencies we noted in finite-state ma- 
chines. Unlimited auxiliary memory makes the Turing machine a hypothetical 
"machine"—a model—not a real device. 
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A Turing machine consists of a finite-state machine and an unlimited tape 
divided into cells, each cell containing at most one symbol from an allowable 
finite alphabet. At any one instant, only a finite number of cells on the tape are 
nonblank. We use the special symbol b to denote a blank cell. The finite-state 
unit, through its read—write head, reads one cell of the tape at any given moment 
(Figure 9.15). 


Finite-state unit 


Figure 9.15 


By the next clock pulse, depending on the present state of the unit and the 
symbol read, the unit either does nothing (halts) or completes three actions: 


1. Print a symbol from the alphabet on the cell read (it might be the same 
symbol that’s already there). 

2. Go to the next state (it might be the same state as before). 

3. Move the read—write head one cell left or right. 


We can describe the actions of any particular Turing machine by a set of 
quintuples of the form (s, i, i’, s', d), where s and i indicate the present state 
and the tape symbol being read, i’ denotes the symbol printed, s' denotes the 
new state, and d denotes the direction in which the read—write head moves 
(R for right, L for left). 


(s, i, P 5', d) 


Present state Present symbol Symbol printed Next state Direction of move 


Thus, a machine in the configuration illustrated by Figure 9.16a, if acting ac- 
cording to the instructions contained in the quintuple (2, 1, 0, 1, R), would move to 


Figure 9.16 
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the configuration illustrated in Figure 9.16b. The symbol 1 being read on the tape 
has been changed to a 0, the state of the unit has been changed from 2 to 1, and the 
head has moved one cell to the right. 

The term “Turing machine" is used both in the generic sense and also as 
the collection of quintuples that describe the actions of any particular ma- 
chine. This is the same double usage for both the name of the abstraction 
and any instance of that abstraction that we mentioned for Boolean algebra in 
Chapter 8. 


* DEFINITION TURING MACHINE 
Let S be a finite set of states and / a finite set of tape symbols (the tape alphabet) 
including a special symbol 5. A Turing machine is a set of quintuples of the 
form (s, i, i’, s', d) where s, s' € S; i, i' € I; and d € (R, L} and no two 
quintuples begin with the same s and i symbols. 


The restriction that no two quintuples begin with the same s and i symbols 
ensures that the action of the Turing machine is deterministic and completely 
specified by its present state and symbol read. If a Turing machine gets into a con- 
figuration for which its present state and symbol read are not the first two symbols 
of any quintuple, the machine halts. 

Just as in the case of ordinary finite-state machines, we specify a fixed start- 
ing state, denoted by 0, in which the machine begins any computation. We also 
assume an initial configuration for the read—write head, namely, a position over 
the farthest left nonblank symbol on the tape. (If the tape is initially all blank, the 
read—write head can be positioned anywhere to start.) 


| EXAMPLE% | С) A Turing machine is defined by the set of quintuples: 


(00), 00) 1,0), A9) 
СОТА) 
(05 7. 1L. 1. 72) 
(БОЛО I, 22) 
(iL, 1L, 00), 1L. 29) 


The action of this Turing machine when processing a particular initial tape is 
shown by the sequence of configurations in Figure 9.17, which also shows the 
quintuple that applies at each step. Again, which quintuple applies is deter- 
mined by the present state and present symbol; as a result, the order in which 
quintuples are applied has nothing to do with the order in which they are pre- 
sented in the machine's definition, quintuples can be used more than once, or 
may not be used at all. 
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Figure 9.17 


(0, 0, 1, 0, R) 


(0, 1, 0, 0, R) 


(0, 1, 0, 0, R) 


(0, 0, 1, 0, R) 


(0, b, 1, 1, L) 


a, 1, 0, 1, R) 


a, 1, 0, 1, R) 
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Since there are no quintuples defining the action to be taken when in state 1 reading 
b, the machine halts with final tape: 


JERE ШЕР 
ө 


The tape serves as a memory medium for a Turing machine, and in general, 
the machine can reread cells of the tape. Since it can also write on the tape, the 
nonblank portion of the tape can be as long as desired, although there are still only 
a finite number of nonblank cells at any time. Hence the machine has available an 
unbounded, though finite, amount of storage. Because Turing machines overcome 
the limitations of finite-state machines, Turing machines should have consider- 
ably higher capabilities. In fact, a finite-state machine is a very special case of a 
Turing machine, one that always prints the old symbol on the cell read, always 
moves to the right, and always halts on the symbol b. 


PRACTICE 57 | Consider the following Turing machine: 


(0, 0, 0, 1, К) 

(0, 1, 0, 0, R) 

(0, b, Б, 0, А) 

OTIO ER) 

Сый) 
a. What is the final tape, given the initial tape 


TPDEEE 


(Since it is tedious to draw all the little squares, you don’t need to do so; just write down the contents 
of the final tape.) 
b. Describe the behavior of the machine when started on the tape 


TETE 


c. Describe the behavior of the machine when started on the tape 
PEELE 


Parts (b) and (c) of Practice 57 illustrate two ways in which a Turing machine can 
fail to halt: by endlessly cycling or by moving forever along the tape. 


Turing Machines as Set Recognizers 


Although the Turing machine computations we have seen so far are not particularly 
meaningful, we will use the Turing machine to do two kinds of jobs. First, we’ll 


© DEFINITION 
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use it as a recognizer, much as we considered finite-state machines as recognizers 
in the previous section. We can even give a very similar definition, provided we 
first define a final state for a Turing machine. A final state in a Turing machine 
is one that is not the first symbol in any quintuple. Thus, on entering a final state, 
whatever the symbol read, the Turing machine halts. 


TURING MACHINE RECOGNITION (ACCEPTANCE) 

A Turing machine T with tape alphabet / recognizes (accepts) a subset S of /* if 
T, beginning in standard initial configuration on a tape containing a string a of 
tape symbols, halts in a final state if and only a є S. 


Note that our definition of acceptance leaves open two possible behaviors for Т 
when applied to a string o of tape symbols not in S. T may halt in a nonfinal state, 
or T may fail to halt at all. 

We can now build a Turing machine to recognize our old friend 
S = {0"1"|n = 0). The machine is based on our second approach to this rec- 
ognition problem, sweeping back and forth across the input and crossing out 
0—1 pairs. 


We want to build a Turing machine that will recognize 5 = {0"1"|n = 0). We will 
use one additional special symbol, call it X, to mark out (“erase”) the Os and 15 
already examined. Thus the tape alphabet is J = {0, 1, b, X}. State 6 is the only 
final state. The quintuples making up 7 are given below. Each quintuple or group 
of quintuples has a “comment” describing its function. Just like well-written com- 
ments in computer code, this makes the Turing instructions much easier to under- 
stand. Be sure to include comments in any Turing machine definition. 


(0, b, b, 6, R) Recognizes the empty tape, which is in S. 

(@, O26 Л) Erases the leftmost 0 and begins to move right. 

(1, 0, 0, 1, А) 

(1L. 1. 31. Ё) Moves right in state 1 until it reaches the end ofthe binary 
(„ББ 2, 1D) string; then moves left in state 2. 

(1, x, 4, 2, D) 

(@, 1,25 35/5) Erases the rightmost 1 and begins to move left. 
(ОАА) Moves left over 15. 

(3, 0, 0, 4, L) Goes to state 4 if more Os are left. 

CEN Dp IR) Goes to state 5 if no more Os in string. 

(4, 0, 0, 4, L) Moves left over Os. 

(4, X, X, 0, К) Finds left end of binary string and begins sweep again. 
(ОЗЕРО) No тоге 15 in string; machine accepts. 


Reading down the columns in Figure 9.18, we can see the key configurations (skip- 
ping some routine steps) in the machine’s behavior on the tape 


which, of course, it should accept. e 
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b000111b 


bxootitl 


bxool 


bX0011Xb 


bX0011Xb 


bXXO0O11Xb 


Т 
2 
Figure 9.18 


bXXOIXXb 


4 
3 


bXXO1X Xb 
Т 
4 


bXXOIXXb 


4 
0 


bXXX1xXXb 


1 
1 


bXXX1XXb 


f 
2 


bXXXXXXb 


4 
3 


bXXXXXXb 


4 
5 


bXXXXXXb 
Т 
6 


PRACTICE 58 


following input tapes: 


a. 

SDO00000E 
“TEELE 
C. 

ПППППППППЕ 


For the Turing machine of Example 41, describe the final configuration after processing the 


Notice how each state of the Turing machine in Example 41 is designed to 
accomplish a certain task, as indicated by the comments. The job of state 1, for 
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example, is to move right until the end of the binary string is found, marked by 
either a blank or an_X, then turn the computation over to state 2. A change of state 
should occur only when something significant happens. For example, a Turing 
тае machine cannot pass right over an indeterminate number of Is by changing state 
inomaenins Willer at each move because its behavior would then be tied to a specific input tape. On 
in general, not just for the other hand, if the machine needs to count over a certain fixed number of 15, 
special cases. Test using then changing states at each move would accomplish this. 

a variety of input tapes. 


REMINDER 


Give the states of your 
Turing machine big 


PRACTICE 59 | Design a Turing machine to recognize the set of all strings of Os and 15 ending in 00. (This 
set can be described by the regular expression (0 V 1)*00, so you should be able to use a 
Turing machine that changes no tape symbols and always moves to the right.) Be sure to 
include comments. 


PRACTICE 60 | Modify the Turing machine of Example 41 to recognize {0"17"|n = 0}. = 


Turing Machines as Function Computers 


The second job for which we will use the Turing machine is to compute functions. 
Given a particular Turing machine 7 and a string a of tape symbols, we begin T 
in standard initial configuration on a tape containing о. If T eventually halts with 
a string В on the tape, we may consider $ as the value of a function evaluated at 
a. Using function notation, 7(o) = В. The domain of the function T consists of 
all strings a for which Т eventually halts. We can also think of T as computing 
number-theoretic functions, functions from a subset of № into № for any k = 1. 
We will think of a string of 1s of length n + 1 as the unary representation of the 
nonnegative integer п; we'll denote this encoding of n by n. (The extra 1 in the 
encoding enables us to distinguish 0 from a blank tape.) Then a tape containing 
the string лу * n, *...* n, сап be thought of as the representation of the &-tuple 
(пу, No, ... , п) of nonnegative integers. If T begun in the standard initial configu- 
ration on such a tape eventually halts with a final tape that is the representation 
m of a nonnegative integer т, then T has acted as a k-variable function Т“, where 
T(n,, nj, ... , nj) = m. If T begun in standard initial configuration on such a tape 
either fails to halt or halts with the final tape not a representation of a nonnegative 
integer, then the function 7* is undefined at (n,, n, ... , п). There is no need to 
identify final states when using the Turing machine as a function computer. 

There is thus an infinite sequence 7', 7?, ... , T^,... of number-theoretic 
functions computed by Т associated with each Turing machine 7. For each k, the 
function 7* is a partial function on №“, meaning that its domain may be a proper 
subset of №. A special case of a partial function on №“ is a total function on №“, 
where the function is defined for all k-tuples of nonnegative integers. 


| EXAMPLE% | Let a Turing machine T be given by the quintuples 
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If T is begun in standard initial configuration on the tape 


Therefore, T defines a one-variable function T! that maps 2 to 3. In general, T 
maps 7 to n + 1,so T'(n) = n + 1, a total function of one variable. e 


In Example 42, we began with a Turing machine and observed a particular 
function it computed, but we can also begin with a number-theoretic function and 
try to find a Turing machine to compute it. 


* DEFINITION TURING-COMPUTABLE FUNCTION 
A Turing-computable function is a number-theoretic function computed by 
some Turing machine. 


A Turing-computable function f'can in fact be computed by an infinite number of 
Turing machines. Once a machine Tis found to compute f, we can always include 
extraneous quintuples in T, producing other machines that also compute f. 


| EXANPLE4S | We want to find a Turing machine that computes the function f defined as follows: 


a ity — il if п» = 0 
fim, m) = M RE ifn, = 0 


Thus fis a partial function of two variables. Let’s consider the Turing machine 
given by the following quintuples. 


(@, te, tl, ©, £8) | m 
ОЕ) Passes right over n, to n, 
Gl, il, 31, 25. 9) Counts first 1 in m. 

(2, b, b, 3, R) Ny = 0; halts. 

(2, 1, 1, 4, R) 

(4, 1, 1, 4, R) Finds the right end of m. 
(4, b, b, 5, L) 

(G, I, @, 6,25) Erases last 1 in m. 

(6, 1, 1, 6, L) 2 Е 
Орт Passes left to л), erasing *. 
(Hy 1L. By 21, 35) Erases л}. 


(7, b, b, 8, L) n, erased; halts with n; — 1 on tape. 


PRACTICE 61 
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If T is begun on the tape 


then 7 will halt with final configuration 


ip iibi 


| 


8 


This configuration agrees with the requirement that f(1, 3) = 2. If T is begun on 
the tape 


then 7 will halt with final configuration 


i) X i к il 5 @ 


Т 


3 


Because the final tape is not m for any nonnegative integer m, the function com- 
puted by T is undefined at (1, 0)—just as we want. It is easy to see that this Turing 
machine computes fand that fis therefore a Turing-computable function. ө 


Design a Turing machine to compute the function 


nc ifn =2 
1 ifn <2 п 


Ап) = | 


Church-Turing Thesis 


In this chapter we have talked about models of “computation” or of “computa- 
tional procedures." Although we have not defined the term, by a computational 
procedure we mean an algorithm. We have talked about algorithms often in this 
book and have given a number of algorithms for various tasks. Recall that our 
(somewhat intuitive) definition of an algorithm is a set of instructions that can be 
mechanically executed in a finite amount of time in order to solve some problem. 
Given input appropriate to the task, the algorithm must eventually stop (halt) and 
produce the correct answer if an answer exists. (If no answer exists, let us agree 
that the algorithm can either halt and declare that no answer exists, or it can go on 
indefinitely searching for an answer.) 

Now we ask: Is the Turing machine a better model of a computational proce- 
dure than the finite-state machine? We are quite likely to agree that any Turing 
computable function fis a function whose values can be found by a computational 
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procedure or algorithm. In fact, if fis computed by the Turing machine 7} then 
the set of quintuples of 7 is itself the algorithm; as a list of instructions that can 
be carried out mechanically, it satisfies the various properties in our notion of an 
algorithm. Therefore, we are probably willing to accept the proposal illustrated by 
Figure 9.19. The figure shows “computable by algorithm" as a “cloudy,” intuitive 
idea and “Turing computable” as a mathematically precise, well-defined idea. The 
arrow asserts that any Turing-computable function is computable by an algorithm. 

Given the simplicity of the definition of a Turing machine, it is a little startling 
to contemplate Figure 9.20, which asserts that any function computable by any 
means we might consider to be an algorithm is also Turing computable. Combin- 
ing Figures 9.19 and 9.20, we get the Church- Turing thesis (Figure 9.21), named 
after Turing and another well-known mathematician, Alonzo Church. 


Computable by 
algorithm 


Computable by 


Computable by 


algorithm algorithm 


| 
| 
| 
Y 


Turing Turing Turing 
computable computable computable 
Figure 9.19 Figure 9.20 Figure 9.21 


* THESIS CHURCH-TURING THESIS 
A number-theoretic function is computable by an algorithm if and only if it is 
Turing computable. 


Because the Church- Turing thesis equates an intuitive idea with a mathemati- 
cal idea, it can never be formally proved and must remain a thesis, not a theorem. 
What, then, is its justification? 

One piece of evidence is that whenever a procedure generally agreed to be 
an algorithm has been proposed to compute a function, someone has been able to 
design a Turing machine to compute that function. (Of course, there is always the 
nagging thought that someday this might not happen.) 

Another piece of evidence is that other mathematicians, several of them at 
about the same time Turing developed the Turing machine, proposed other models 
of a computational procedure. On the surface, each proposed model seemed quite 
unrelated to any of the others. However, because all the models were formally 
defined, just as Turing computability is, it was possible to determine on a formal, 
mathematical basis whether any of them were equivalent. All the models, as well 
as Turing computability, were proved equivalent; that is, they all defined the same 
class of functions, which suggests that Turing computability embodies everyone's 
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concept of an algorithm. Figure 9.22 illustrates what has been done; here solid 
lines represent mathematical proofs and dashed lines correspond to the Church— 
Turing thesis. The dates indicate when the various models were proposed. 


Computable by 


= : 
E ^ algorithm A - 
7 in: 
ш i 
P i 
"ad У 
go ^ N S 
^ \ \ M 
P \ N 9 
Рай А Y м У 
Turing Church, Kleene Post Markov Smullyan 
1936 Late 1930s Early 1940s 1954 1961 

Figure 9.22 


The Church-Turing thesis is now widely accepted as a working tool by re- 
searchers dealing with computational procedures. If, in a research paper, a method 
is set forth for computing a function and the method intuitively seems to be an 
algorithm, then the Church- Turing thesis is invoked and the function is declared 
to be Turing computable (or one of the names associated with one of the equiva- 
lent formulations of Turing computability). This invocation means that the author 
presumably could, if pressed, produce a Turing machine to compute the function, 
but again, the Church- Turing thesis is so universally accepted that no one bothers 
with these details anymore. 

Although the Church- Turing thesis is stated in terms of number-theoretic 
functions, it can be interpreted more broadly. Any algorithm in which a finite set 
of symbols is manipulated can be translated into a number-theoretic function by 
a suitable encoding of the symbols as nonnegative integers, much as input to a 
real computer is encoded and stored in binary form. Thus, by the Church- Turing 
thesis we can say that if there is an algorithm to do a symbol manipulation task, 
there is a Turing machine to do it. 

By accepting the Church- Turing thesis, we have accepted the Turing machine 
as the ultimate model of a computational procedure. Turing machine capabilities 
exceed those of any actual computer, which, after all, does not have the unlim- 
ited tape storage of a Turing machine. It is remarkable that Turing proposed this 
concept in 1936, well before the advent of the modern computer. 


Decision Problems and Uncomputability 


We have spent quite a bit of time discussing what Turing machines can do. By 
the Church—Turing thesis, they can do a great deal indeed, although not very ef- 
ficiently. It is even more important, however, to consider what Turing machines 
cannot do. Because a Turing machine's abilities to perform tasks exceed those of 
an actual computer, if we find something no Turing machine can do, then a real 
computer cannot do it either. In fact, by invoking the Church- Turing thesis, no 
algorithm exists to do it, and the task is not computable. The type of task we have 
in mind here is generally that of determining the truth value of each of a number 
of related statements. 
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T DEFINITION DECISION PROBLEM 
| A decision problem asks if an algorithm exists to decide whether individual 


statements from some large class of statements are true. 


The solution to a decision problem answers the question of whether an 
algorithm exists. A positive solution consists of proving that an algorithm ex- 
ists, and it is generally given by actually producing an algorithm that works. A 
negative solution consists of proving that no algorithm exists. Note that this 
statement is much stronger than simply saying that a lot of people have tried 
but no one has come up with an algorithm—this might simply mean that the 
algorithm is hard. It must be shown that it is impossible for anyone ever to come 
up with an algorithm. When a negative solution to a decision problem is found, 
the problem is said to be unsolvable, uncomputable, or undecidable. This ter- 
minology can be confusing because the decision problem itself—the question 
of whether an algorithm exists to do a task—has been solved; what must forever 
be unsolvable is the task itself. 


Examples of Decision Problems 
We will look at some decision problems that have been answered. 


ЭХХ ЫЗ Does an algorithm exist to decide, given integers a, b, and c, whether a? = b? + c?? 
Clearly, this question is a solvable decision problem. The algorithm consists of 
multiplying b by itself, multiplying с by itself, adding the two results, and comparing 
the sum with the result of multiplying a by itself. e 


Obviously, Example 44 is a rather trivial decision problem. Historically, much 
of mathematics has concerned itself at least indirectly with finding positive solu- 
tions to decision problems, that is, producing algorithms. Negative solutions to 
decision problems arose only in the twentieth century. 


One of the earliest decision problems to be formulated was Hilbert's tenth prob- 
lem, tenth in a list of problems David Hilbert posed to the International Congress 
of Mathematicians in 1900. The problem is: Does an algorithm exist to decide for 
any polynomial equation P(x;, x», ... , x,) = 0 with integral coefficients whether 
it has integral solutions? For polynomial equations of the form ax + by + c = 0, 
where a, b, and c are integers, it is known that integer solutions exist if and only 
if the greatest common divisor of a and 5 also divides c. Thus, for particular sub- 
classes of polynomial equations, there might be algorithms to decide whether in- 
teger solutions exist, but the decision problem as stated applies to the whole class 
of polynomial equations with integer coefficients. When this problem was posed 
and for some time after, the general belief was that surely an algorithm existed and 
the fact that no one had found such an algorithm merely implied that it must be 
difficult. In the mid-1930s, a startling result by Kurt Gódel, described in the next 
example, began to cast doubt on this view. It was not until 1970, however, that this 
problem was finally shown to be unsolvable. ө 


EXAMPLE 46 
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The decision problem for propositional wffs asks whether an algorithm exists to 
decide whether any given propositional wff is a tautology. This is a solvable de- 
cision problem; the solution algorithm consists of constructing and examining the 
truth table for the wff. The decision problem for predicate wffs asks whether an 
algorithm exists to decide the validity of any given predicate wff. This is an undecid- 
able problem; such an algorithm does not exist, which is exactly why we resorted to 
the formal derivation rules of predicate logic to help establish validity for any given 
wff. Because of the completeness and correctness of predicate logic, a wff is valid 
if and only if we can produce a proof sequence for it. However, this has only shifted 
one decision problem to an equivalent one—there is no algorithm to decide whether 
a proof sequence exists for any given predicate wff, much less a mechanical way to 
know what steps to use to produce a proof sequence if one does exist. 

An alternative formulation to the logic systems we discussed in Chapter 1 
is to identify certain strings of symbols as axioms and to give rules of inference 
whereby a new string can be obtained from old strings. Any string that is the last 
one in a finite list of strings consisting of either axioms or strings obtainable by 
the rules of inference from earlier strings in the list is said to be a theorem. The 
decision problem for such a formal theory is: Does an algorithm exist to decide 
whether a given string in the formal theory is a theorem of the theory? 

The work of Church and the famous twentieth-century logician Kurt Gédel 
showed that any formal theory that axiomatizes properties of arithmetic (making 
commutativity of addition an axiom, for example) and is not completely trivial (not 
everything is a theorem) is undecidable. Their work can be considered good news 
for working mathematicians because it means that ingenuity in answering questions 
in number theory will never be replaced by a mechanical procedure. ө 


A particular Turing machine Т begun on а tape containing a string о will either 
eventually halt or never halt. The halting problem for Turing machines is a deci- 
sion problem: Does an algorithm exist to decide, given a Turing machine 7 апа 
string a, whether T begun on a tape containing o will eventually halt? Turing 
proved the unsolvability of the halting problem in the late 1930s. ө 


Halting Problem 

We will prove the unsolvability of the halting problem after two observations. 
First, it might occur to us that “run T on o" would constitute an algorithm to see 
whether T halts on a. If within 25 steps of T’s computation T has halted, then 
we know T halts on a. But if within 25,000 steps T has not halted, what can we 
conclude? 7 may still eventually halt. How long should we wait? This so-called 
algorithm will not always give us the answer to our question. 

A second observation is that the halting problem asks for one algorithm to be 
applied to a large class of statements. The halting problem asks, Does an algorithm 
exist to decide, for any given (T, а) pair, whether T halts when begun on a tape 
containing o? The algorithm comes first, and that single algorithm has to give the 
correct answer for all (T, a) pairs. In the notation of predicate logic, the halting 
problem asks about the truth value of a statement in the form 


(3 algorithm)(V(T, а))( ...) 


© DEFINITION 


© THEOREM 
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Consider the following statement, which seems very similar: Given a par- 
ticular (T, o) pair, does an algorithm exist to decide whether T halts when begun 
on a tape containing a? Here, the (T, œ) pair comes first and an algorithm is cho- 
sen based on the particular (T, a); for a different (T, œ), there can be a different 
algorithm. The statement has become (V(T, o))(3 algorithm)(...). This problem 
is solvable. Suppose someone gives us a (T, а). Two algorithms are (1) “say yes" 
and (2) “say no." Since Г acting on а either does or does not halt, one of these 
two algorithms correctly answers the question. This solution may seem trivial 
or even sneaky, but consider again the problem statement: Given a particular 
(T, а) pair, does an algorithm exist to decide, and so forth. Such an algorithm 
does exist; it is either to say yes or to say no—we are not required to choose 
which one is correct! 

This turnabout of words changes the unsolvable halting problem into a trivi- 
ally solvable problem. It also points out the character ofa decision problem, asking 
whether a single algorithm exists to solve a large class of problems. An unsolvable 
problem has both a good side and a bad side. That no algorithm exists to solve a 
large class of problems guarantees jobs for creative thinkers who cannot be re- 
placed by Turing machines. But that the class of problems considered is so large 
might make the result too general to be of interest. 

We will state the halting problem again and then prove its unsolvability. 


HALTING PROBLEM 
The halting problem asks: Does an algorithm exist to decide, given any Turing ma- 
chine Гапа string a, whether T begun on a tape containing о will eventually halt? 


ON THE HALTING PROBLEM 
The halting problem is unsolvable. 


Proof. We want to prove that something does not exist, a situation made to order 
for proof by contradiction. Therefore we assume that the halting problem is solv- 
able and that a single algorithm exists that can act on any (T, œ) pair as input and 
eventually decide whether 7 running оп о halts. We are asking this algorithm to 
solve a task of symbol manipulation, since we can imagine the set of quintuples 
of T encoded as some unique string sr of symbols; we'll use (sr, œ) to denote the 
string sy concatenated with the string a. The task then becomes transforming the 
string (sr, а) into a string representing a yes (the Turing machine with descrip- 
tion s; halts when begun on a tape containing а) or a no (the Turing machine 
with description sy does not halt when begun on a tape containing o). Ву the 
Church-Turing thesis, because we have assumed the existence of an algorithm 
that performs this task, we can assume the existence of a single Turing machine 
X that performs this task. Thus X acts on a tape containing (sr, a) for any T and 
а and eventually halts, at the same time telling us whether Т on о halts. To be 
definite, suppose that X begun on (sy, a) halts with a 1 left on the tape if and only 
if T begun on a halts, and X begun on (sy, œ) halts with a 0 left on the tape if and 
only if T begun on a fails to halt; these are the only two possibilities. Figure 9.23 
illustrates Turing machine X. 
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Input Output 


Halts with 0 on tape 
exactly when Т never 


> halts ona 
(s,, a) 


> Halts with 1 on tape 
exactly when T 


eventually halts on a 


Turing machine X 


Figure 9.23 


Now we add to_X’s quintuples to create a new machine Y. Machine Y modifies 
X’s behavior so that whenever X halts with a 1 on its tape, Y goes to a state that 
moves Y endlessly to the right so that it never halts. If X halts with a 0 on its tape, 
so does Y. Figure 9.24 illustrates the behavior of Y. 


Input Output 


Halts with 0 on tape 
exactly when T never 
> halts on o 


(sp a) ——>| 


> Never halts 
exactly when T 


eventually halts on a 


Turing machine Y 


Figure 9.24 


Finally, we modify Y to get a new machine Z that acts on any input В by first 
copying В (see, for example, Exercise 16) and then turning the computation over 
to Y so that Y acts on (B, B). What happens if we run Z on its own description, sz? 
This situation is shown in Figure 9.25. 


Input Output 


Halts with 0 on tape 
exactly when Z never 


> halts on s; 


$,—($,5) — 


* Never halts 
exactly when Z 
\ ) eventually halts on s, 


i 


Turing machine Z 


Figure 9.25 
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By the way Z is constructed, if Z acting on sz halts, it is because Y acting on 
(sz, Sz) halts, and that happens because X acting on (sz, sz) halts with a 0 on the 
tape; but if this happens, it implies that Z begun on sz fails to halt! Therefore, 


Z on sz halts > Z on sz fails to halt (1) 


This implication is very strange; let's see what happens if Z on s; does not 
halt. By the way Z is constructed, if Z acting on sz does not halt, neither does Y act- 
ing on (sz, 5). Now Y acting on (sz, sz) fails to halt exactly when X acting on (sz, 5) 
halts with a 1 on the tape; but this result implies that Z begun on sz halts! Therefore 


Z on =; fails to halt > Z on szhalts (2) 


Together, implications (1) and (2) provide an airtight contradiction, so our assump- 
tion that the halting problem is solvable is incorrect. End of Proof. 

The proof of the unsolvability of the halting problem depends on two ideas. 
One is that of encoding a Turing machine into a string description, and the other 
is that of having a machine look at and act on its own description. Notice also that 
neither (1) nor (2) alone in the proof is sufficient to prove the result. Both are need- 
ed to contradict the original assumption of the solvability of the halting problem. 

We have previously encountered another proof of this nature, where the 
observation that makes the proof work is self-contradictory. You might want to 
review here the proof of Cantor's theorem in Chapter 5. 


Computational Complexity 


As a model of computation, the Turing machine has provided us with a way to 
prove the existence of unsolvable (uncomputable) problems. Not only does the 
Turing machine help us find the limits of computability, but it can also help us 
classify problems that are computable—that have an algorithm for their solu- 
tion—by the amount of work required to carry out the algorithm. 

Finding the amount of work required to carry out an algorithm sounds like anal- 
ysis of algorithms. We have analyzed a number of real algorithms in this book and 
classified them as O(log n), O(n), O(n’), or what have you. By the Church-Turing 
thesis, any algorithm can be expressed in Turing machine form. In this form, the 
amount of work is the number of Turing machine steps (one per clock pulse) required 
before the Turing machine halts. (We assume here that we are considering only tasks 
that “have answers" so that the Turing machine halts on all appropriate input.) 

Turing machine computations are quite inefficient. Therefore if algorithms А 
and A' both solve the same problem, but А is expressed as a description of a Turing 
machine and J’ as pseudocode for instructions in a high-level programming lan- 
guage, then comparing the number of operations each algorithm performs is rather 
meaningless. Therefore we will assume that all algorithms are expressed in Turing 
machine form so that we can readily compare the efficiency of different algorithms. 

Rather than discuss whether a Turing machine algorithm is O(n) or O(n’), 
let us simply note whether it is a polynomial-time algorithm. (Only quite trivial 
algorithms can be better than polynomial time, because it takes a Turing ma- 
chine и steps just to examine its tape.) Problems for which no polynomial-time 
algorithms exist are called intractable. Such problems may be solvable, but only 
by inefficient algorithms. 


© DEFINITION 


Ф DEFINITION 
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Р 
P is the collection of all sets recognizable by Turing machines in polynomial time. 


Consideration of set recognition in our definition of P is not as restrictive as it 
may seem. Because the Turing machine halts on all appropriate input, it actually 
decides, by halting in a final or nonfinal state, whether the initial string was or was 
not a member of the set. Many problems can be posed as set decision problems by 
suitably encoding the objects involved in the problem. 

For example, consider the Hamiltonian circuit problem (Section 7.2) of 
whether a graph has a cycle that uses every node of the graph. We may define 
some encoding process to represent any graph as a string of symbols. Strings that 
are the representations of graphs become appropriate input, and we want to de- 
cide, given such a string, whether it belongs to the set of strings whose associated 
graphs have Hamiltonian circuits. If we can build a Turing machine to make this 
decision in polynomial time, then the Hamiltonian circuit problem belongs to P. 

We noted in Section 6.2 that the Hamiltonian circuit problem is solvable by the 
brute-force approach of tracing all possible paths, but this is an exponential solution 
because of the number of paths. We said that there is no known efficient (polynomial) 
algorithm to solve the Hamiltonian circuit problem, so we have no proof that the 
Hamiltonian circuit problem belongs to P. But there is also no proof that the Hamil- 
tonian circuit problem does not belong to P. Might a clever, efficient algorithm some- 
day be found? To see why this is unlikely, we'll consider a new kind of Turing machine. 

Ordinary Turing machines act deterministically, due to our restriction that no 
two quintuples begin with the same present state/present symbol pair. A relaxation 
of this requirement results in a nondeterministic Turing machine, which may 
have a choice of actions at any step. А nondeterministic Turing machine recogniz- 
es a string on its tape if some sequence of actions leads to halting in a final state. 


NP 
NP is the collection of all sets recognizable by nondeterministic Turing machines 
in polynomial time. (NP comes from nondeterministic polynomial time.) 


Although a set in P requires that a deterministic Turing machine be able to 
make a decision (in polynomial time) about whether some string on its tape does 
or does not belong to the set, a set in NP requires only that a nondeterministic 
Turing machine be able to verify (in polynomial time) by a fortuitous choice of ac- 
tions that an input string is in the set. Given a graph that has a Hamiltonian circuit, 
for example, this fact can be confirmed in polynomial time by a nondeterministic 
Turing machine that picks the correct path, so the Hamiltonian circuit problem 
belongs to NP. Another way to think about a nondeterministic Turing machine is 
to imagine “parallel processing." At every clock pulse where there is a choice of 
action, new versions of the machine are created, one for each possible action. For 
the Hamiltonian circuit problem, there may be an exponential number of versions 
created, but the one version tracing the Hamiltonian circuit can complete its path 
through the graph in polynomial time. 

Ifa Turing machine can decide in polynomial time whether an arbitrary string 
belongs to a set, it can surely use the same process to verify a member of the set 
in polynomial time. Therefore P € NP. However, it is not known whether this 
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inclusion is proper, that is, whether P C NP so that there could be NP problems— 
including perhaps the Hamiltonian circuit problem—that are intractable. 

The Hamiltonian circuit problem belongs to a third class of problems known 
as NP-complete problems, meaning that not only are they in NP, but if a poly- 
nomial-time decision algorithm were ever found for any one of them, that is, if 
any of them were ever found to be in Р, then indeed we would have P = NP. 
A large number of problems from many different fields have been found to be 
NP-complete since this idea was formulated in 1971. 


I4 INPS The problem of deciding, for an arbitrary propositional wff, whether it is а tau- 
tology is NP-complete. No efficient algorithm has been found for its solution. A 
brute-force algorithm would explore each of the possible truth assignments to the 
statement letters. Like the Hamiltonian circuit problem, we see the exponential 
(nonpolynomial) nature of this inefficient approach—a wff with n statement letters 
has a truth table with 2" rows. A related problem called the satisfiability problem 
asks for a decision as to whether there exists any truth assignment that can “sat- 
isfy” the wff—make it true. This is also an NP-complete problem, and in fact was 
the first problem discovered to be NP-complete. Its brute-force solution algorithm 
also relies on testing all truth value assignments. 

The graph-coloring problem (given an arbitrary graph and a positive integer k, 
color the nodes of the graph using k colors so that all adjacent nodes are different 
colors) is NP-complete. Again, no efficient algorithm has been found; the brute- 
force approach says to assign colors to nodes so that adjacent nodes are different 
colors, and if you run into a place where this becomes impossible, backtrack and 
modify your color assignments. Basically, this says to try all possible color assign- 
ments (similar to trying all truth assignments, but with А" possibilities). 

The general Sudoku puzzle consists of an и? X п” grid made up of n X n 
blocks where each row, each column, and each block must contain exactly one 
of the digits 1 through n”. (See Exercise 53 in Section 2.4 for an example of the 
popular п = 3 version.) This is an NP-complete problem. No efficient solution 
algorithm is known; a brute-force solution tries all possible number assignments. 

Once again, no polynomial-time decision algorithm has been found for any 
of these or the many other NP-complete problems, and if an efficient procedure 
could be found to solve any one of them, such a procedure would exist for all 
other problems in NP. Therefore it is now suspected that P C NP and that all these 
problems are intractable, but to prove this remains a tantalizing goal in computer 
science research. e 


SECTION 9.4 REVIEW 


TECHNIQUES MAIN IDEAS 

* Describe the action of a given Turing machine опа * Turing machines have a deterministic mode of 
given initial tape. operation, the ability to reread and rewrite input, 

(2) Construct a Turing machine to recognize a given set. and an unbounded auxiliary memory. 


( Construct a Turing machine to compute a given ° A finite-state machine is a special case of a Turing 
number-theoretic function. machine. 


| Section 9.4 Turing Machines 


779 


* Turing machines can be used as set recognizers and 


as function computers. 


* The Church-Turing thesis equates a function 
computable by an algorithm with a Turing- 
computable function. Because this thesis expresses 
a relationship between an intuitive idea and a 


* A decision problem asks if an algorithm exists to 


decide whether individual statements from a large 


formally defined one, it can never be proved but 


has nonetheless been widely accepted. 


EXERCISES 9.4 


For Exercises 3—26, be sure to include comments with any Turing machine definition. 


1. Consider the Turing machine 


(0, 0, 0, 0, L) 
(0, 1, 0, 1, R) 
(0, b, b, 0, L) 
(1, 0, 0, 1, А) 
(1,1, 0, 1, А) 


a. What is its behavior when started on the tape 


ү е 


b. What is its behavior when started on the tape 


2. Consider the Turing machine 


ТРЕ 


(0, 1, 1,0, А) 
(0, 0, 0, 1, А) 
(1, 1, 1; 1 R) 
(1, 5, 1, 2, L) 
(2,1, 1, 2, L) 
(2, 0, 0, 2, L) 
(2, b, 1, 0, R) 


a. What is its behavior when started on the tape 


[elele] 


b. What is its behavior when started on the tape 


PTEE 


class of statements are true; if no algorithm exists, 
the decision problem is unsolvable. 

* The halting problem is unsolvable. 

e P C NP, but it is unknown whether P C NP. 


3. Find a Turing machine that recognizes the set of all unary strings consisting of an even number of 15 (this 


includes the empty string). 


780 Modeling Arithmetic, Computation, and Languages 


. Find a Turing machine that recognizes the set of all strings of 0s and 15 containing at least one 1. 
. Find a Turing machine that recognizes 0*10*1. 
. Find a Turing machine to accept the set of nonempty strings of well-balanced parentheses. (Note that 


(C)(€))) is well balanced and (( )(( )) is not.) 


7. Find a Turing machine that recognizes {07"1"27"|n = 0}. 


15. 


16. 


17. 


. Find a Turing machine that recognizes {w * w^|w є (0, 1}* and w is the reverse of the string w}. 
. Find a Turing machine that recognizes {уу * w;|w;, w, € (0, 1}* and м; # wo}. 


. Find a Turing machine that recognizes the set of palindromes on (0, 1}*, that is, the set of all strings in 


10, 1}* that read the same forward and backward, such as 101. 


. Find a bit-inverter Turing machine that replaces every 0 in a string of 0s and 1s with a 1 and every 1 with a 0. 
. Find a Turing machine that changes a unary string to a string of the same length with alternating 1s and Os. 
. Find a nonhalting Turing machine that begins with a single 1 on its tape and successively generates strings 


of the form 0"10", n = 1, that is, such strings appear every so often on the tape. 


. Find a Turing machine that, given an initial tape containing a (possibly empty) string of 15, adds a single 


0 to the left end of the string if the number of 15s is even and adds two Os to the left end of the string if the 
number of 15 is odd. 


Find a Turing machine that converts a string of 0s and 15s representing a nonzero binary number into a 
string of that number of 15. As an example, the machine should, when started on a tape containing 


SD0000E 


halt on a tape containing 


Find a Turing machine that, given an initial tape containing a nonempty string of 15, marks the right end 
of the string with a * and puts a copy of the string to the right of the *. As an example, the machine should, 


when started on a tape containing 
ИПППППЕ 
IDBBHRHHHHDE 


What number-theoretic function of three variables is computed by the following Turing machine? 


halt on a tape containing 


(0, 1, b, 0, А) 
(0, *, b, 1, А) 
(1, 1, 1, 2, R) 
(2, *, *, 3, R) 
(3.1.1.252) 
(2, 1, 1, 4, R) 
(4, 1, 1, 4, R) 
(4, *, 1, 5, А) 
(5, 1, 5, 5, R) 
(5, b, b, 6, R) 
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18. What number-theoretic function of one variable is computed by the following Turing machine? 


(0, 1, 1, 1, А) 
(1, b, b, 9, К) 
(1, 1, 1, 2, А) 
(2, b, b, 3, L) 
(3, 1, 5, 9, L) 
(2, 1, 1, 4, R) 
(4, b, b, 5, L) 
(5, 1, 5, 3, L) 
(4, 1, 1, 6, L) 
(6, 1, 1, 6, L) 
(6, b, 1, 7, L) 
(7, b, 1, 8, L) 
(8, b, 1, 9, L) 


19. Find a Turing machine to compute the function 


_ fl ifn-0 
Л) =, itn £0 


20. Find a Turing machine to compute the function 


п ifn is even 
n+ 1 ifnisodd 


fn) = | 

21. Find a Turing machine to compute the function 
f(n) = 2n 

22. Find a Turing machine to compute the function 


n/3 if 3 divides n 
f(n) = ; 
undefined otherwise 
23. Find a Turing machine to compute the function 
fn, т) = т tnm 


24. Find a Turing machine to compute the function 


n, ifn is even 


fni, m) = | 


п; ifn, is odd 
25. Find a Turing machine to compute the function 


m Ny if лп, Em Ny 


fn, m) = || 


otherwise 
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26. Find a Turing machine to compute the function 


ті, т) = шах(л\, n) 


27. Do Exercise 21 again, this time making use of the machines T, and T, of Exercises 16 and 23, respective- 
ly, as "functions." (Formally, the states of these machines would have to be renumbered as the quintuples 
are inserted into the “main program," but you may omit this tiresome detail and merely “invoke Т,” or 


"invoke 75.") 


28. Describe verbally the actions of a Turing machine that computes the function /(л\, т) = л; n», that 
is, design the algorithm but do not bother to create all the necessary quintuples. You may make use of 


Exercises 16 and 23. 


SECTION 9.5 


© DEFINITIONS 


FORMAL LANGUAGES 


Suppose we come upon the English language sentence, “The walrus talks loudly.” 
Although we might be surprised at the meaning, or semantics, of the sentence, 
we accept its form, or syntax, as valid in the language, meaning that the various 
parts of speech (noun, verb, and so on) are strung together in a reasonable way. In 
contrast, we reject, “Loudly walrus the talks,” as an illegal combination of parts 
of speech, or as syntactically incorrect and not part of the language. We must 
also worry about correct syntax in programming languages, but in these, unlike 
natural languages (English, French, and so on), legal combinations of symbols are 
specified in detail. Let’s give a formal definition of /anguage; the definition will 
be general enough to include both natural languages and programming languages. 


ALPHABET, VOCABULARY, WORD, LANGUAGE 

An alphabet or vocabulary V is a finite, nonempty set of symbols. A word over 
V is a finite-length string of symbols from V. The set V* is the set of all words 
over V. (See Example 6 in Chapter 3 for a recursive definition of V*.) A language 
over V is any subset of V*. 


Viewing syntactically correct English language as a subset L of the set of all 
strings over the usual alphabet, we feel that, “Тһе walrus talks loudly,” belongs to 
L while, *Loudly walrus the talks,” does not. 

For any given language L, how can we describe L, that is, specify exactly 
those words belonging to L? If L is finite, we can just list its members, but if L is 
infinite, can we find a finite description of L? Not always—there are many more 
languages than possible finite descriptions. Although we will consider only lan- 
guages that can be finitely described, we can still think of two possibilities. We 
may be able to describe an algorithm to decide membership in L; that is, given 
any word in V*, we could apply our algorithm and receive a yes or no answer as 
to whether the word belongs to L. Or we may be able to describe a procedure al- 
lowing us only to generate members of L, that is, crank out one at a time a list of 
all the members of L. We will settle for languages for which this second option is 
possible and describe such a language by defining its generative process, or giving 
a grammar for the language. 


© DEFINITION 
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Before we give a formal definition of what constitutes a grammar, let’s look 
again at why, “The walrus talks loudly," seems to be an acceptable sentence by 
seeing how it could be generated. Starting from the notion of sentence, we would 
agree that one legitimate form for a sentence is a noun phrase followed by a verb 
phrase. Symbolically, 


sentence — noun-phrase verb-phrase 
A legitimate form of noun phrase is an article followed by a noun, 
noun-phrase — article noun 
and a legitimate form of verb phrase is a verb followed by an adverb, 
verb-phrase — verb adverb 
We would also agree with the substitutions 


article — the 
noun — walrus 
verb — talks 

adverb — loudly 


Thus we can generate the sentence, “The walrus talks loudly,” by making succes- 
sive substitutions: 


sentence => noun-phrase verb-phrase 
= article noun verb-phrase 
=> the noun verb-phrase 
= the walrus verb-phrase 
= the walrus verb adverb 
= the walrus talks adverb 
= the walrus talks loudly 


The foregoing boldface terms are those for which further substitutions can be 
made. The nonboldface terms stop or terminate the substitution process. These 
ideas are incorporated in the next definition. 


PHRASE-STRUCTURE (TYPE 0) GRAMMAR 
A phrase-structure grammar (type 0 grammar) С is a 4-tuple, С = (V, V7, S, P), 
where 
V = vocabulary 
Vr = nonempty subset of V called the set of terminals 
S = element of V — V; called the start symbol 
P = finite set of productions of the form a — В where a is a word over V 
containing at least one nonterminal symbol and В is a word over V 


EXAMPLE 49 


© DEFINITION 
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Here is a very simple grammar: С = (V, Vr, 5, P) where V = {0,1,5}, 
Vr = 10, 1}, and P = {S > 05,5 1). e 


The productions of a grammar allow us to transform some words over V into 
others; the productions can be called rewriting rules. 


GENERATIONS (DERIVATIONS) IN A LANGUAGE 

Let С be a grammar, С = (V, Vr, S, P), and let у and w, be words over V. 
Then у; directly generates (directly derives) w;, written ул => у», if a — B 
is a production of G, w, contains an instance of о, and w, is obtained from у! 
by replacing that instance of o with B. If у, w2, ... , у, are words over V and 
Wi F Wa, W2 => Wa, ... , Wa-1 P4 w,, then w; generates (derives) w,, written 
W; => w,. (By convention, у = w.) 


PRACTICE 62 | Show that in the grammar of Example 49, 0S = 00001. 


In the grammar of Example 49, 005 = 0005 because the production S — 0S has 
been used to replace the 5 in 00S with 0S. Also 005 = 000005. ө 


© DEFINITION 


LANGUAGE GENERATED BY A GRAMMAR 
Given a grammar G, the language L generated by G, sometimes denoted L(G), 
is the set 


L={wev;|S>w} 


In other words, L is the set of all strings of terminals generated from the start 
symbol. 


REMINDER 


Productions may be used 
in any order. The only 
requirement is that the 
left side of the production 
must appear in the string 
you are processing. 


Notice that once a string w of terminals has been obtained, no productions can 
be applied to w, and w cannot generate any other words. 

The following procedure generates a list of the members of L: Begin with 
the start symbol $ and systematically apply some sequence of productions until 
a string w, of terminals has been o btained; then w, € L. Go back to S and re- 
peat this procedure using a different sequence of productions to generate another 
word w, € L, and so forth. Actually, this procedure doesn’t quite work because we 
might start on an infinite sequence of direct derivations that never leads to a string 
of terminals and thus never contributes a word to our list. Instead, we need to run 
a number of derivations from S simultaneously (parallel processing), checking on 
each one after each step and adding the final word to the list of members of L for 
any that terminate. That way we cannot get stuck waiting indefinitely while un- 
able to do anything else. 
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PRACTICE 63 | Describe the language generated by the grammar G of Example 49. 


Languages derived from grammars such as we have defined are called formal 
languages. If the grammar is defined first, the language will follow as an outcome 
of the definition. Alternatively, the language, as a well-defined set of strings, may 
be given first, and we then seek a grammar that generates it. 


Let L be the set of all nonempty strings consisting of an even number of 15. Then 
І is generated by the grammar С = (V, Vr, S, P) where V = {1, S}, Ут = {1}, 
and P = {S > SS, S 11}. A language can be generated by more than one 
grammar. L is also generated by the grammar С’ = (V', Vr, S', P) where 
p spo hana = S ПП. SINUS ө 


PRACTICE 64 


a. Find a grammar that generates the language L = (0"10"| n = 0}. 
b. Find a grammar that generates the language L = (0"10"| л = 1}. и 


Trying to describe concisely the language generated by a given grammar and 
defining a grammar to generate a given language can both be quite difficult tasks. 
We'll look at another example where the grammar is a bit more complicated than 
any we've seen so far. Don't worry about how you might think up this grammar; 
just convince yourself that it works. 


Let L = (a"b"c"|n = 1}. A grammar generating L is С = (V, Vr, S, P) where 
V = {a, b, c, S, B, C}, Vr = (a, b, с}, and P consists of the following productions: 


09096 3. (CIR => BC 5. bB — bb I. CC — е 
DESC. 4. aB — ab @ (XC — (е 


It is fairly easy to see how to generate any particular member of L using these pro- 
ductions. Thus, a derivation of the string a7b’c’ is 


S — aSBC 
— aaBCBC 
— aaBBCC 
— aabBCC 
— aabbCC 
— aabbcC 
= aabbcc 


In general, L C L(G) where the outline of a derivation for any a"b"c" is given be- 
low; the numbers refer to the productions used. 


REMINDER 


“Throwing in” produc- 
tions to get the language 
L you want often means 
that you generate more 
than L. The productions 
must generate exactly the 
strings in L. 
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S= a" (ВС)! 
= a"(BC)" 
= a"B"C” 
Sd E 
SE 
= ар се 


ж 
== npyn,.n 
7 @ (e 


We must also show that L(G) C L, which involves arguing that some productions 
must be used before others and that the general derivation shown above is the only 
sort that will lead to a string of terminals. ө 


In trying to invent a grammar to generate the L of Example 52, we might 
first try to use productions of the form B — b and C > c instead of productions 
4 through 7. Then we would indeed have L C L(G), but L(G) would also include 
words such as a"(bc)". 

Formal languages were developed in the 1950s by linguist Noam Chomsky 
in an attempt to model natural languages, such as English, with an eye toward 
automatic translation. However, since a natural language already exists and is 
quite complex, defining a formal grammar to generate a natural language is very 
difficult. Attempts to do this for English have been only partially successful. 


We can describe a formal grammar that will generate a very restricted class of 
English sentences. The terminals in the grammar аге the words “the,” “a,” “river,” 
“walrus,” “talks,” “flows,” “loudly,” and “swiftly,” and the nonterminals are the 
words sentence, noun-phrase, verb-phrase, article, noun, verb, and adverb. 


The start symbol is sentence and the productions are 


sentence — noun-phrase verb-phrase 
noun-phrase — article noun 
verb-phrase — verb adverb 
article — the 

article > a 

noun — river 

noun — walrus 

verb — talks 

verb — flows 

adverb — loudly 

adverb — swiftly 


We know how to derive, “The walrus talks loudly,” in this grammar. Here is a 
derivation of, “A river flows swiftly”: 
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sentence => noun-phrase verb-phrase 
= article noun verb-phrase 
= a noun verb-phrase 
= ariver verb-phrase 
= ariver verb adverb 
= ariver flows adverb 
= ariver flows swiftly 


A few other sentences making various degrees of sense, such as, “a walrus 
flows loudly,” are also part of the language defined by this grammar. The difficulty 
of specifying a grammar for English as a whole becomes more apparent when we 
consider that a phrase such as, “time flies,” can be an instance of either a noun fol- 
lowed by a verb or of a verb followed by a noun. This phrase is “ambiguous” (see 
Exercises 35 and 36 in this section). ө 


Programming languages are less complex than natural languages, and their 
syntax often can be described successfully using formal language notation. 


| EXAMPLE 54 | A section of formal grammar to generate identifiers in some programming lan- 


guage could be presented as follows: 


identifier — letter 
identifier — identifier letter 
identifier — identifier digit 
letter — a 

letter — b 


letter — z 
digit — 0 
digit — 1 


digit > 9 


Here the set of terminals is (a, b, ... , z, 0, 1, ... , 9) and identifier is the start 
symbol. e 


A shorthand that can avoid a long listing of productions is called Backus-Naur 
form (BNF). The productions of Example 54 can be given in BNF by three lines 
(as in Example 7 of Chapter 3): 


<identifier> = «letter | cidentifier- «letter | identifier «digit 
«leder: т | |@ |. |ж 
digit: оа 
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In BNF, nonterminals are identified by < >, the production arrow becomes 
::=, and | stands for “or,” identifying various productions having the same left- 
hand symbol. 

In modern times, BNF notation was originally used to define the programming 
language Algol (ALGOrithmic Language) in the early 1960s. However, it appears 
that a similar notation was used between 400 все and 200 все to describe the rules 
of Sanskrit grammar.” ө 


The markup language HTML (Hypertext Markup Language) is a specific language 
for writing documents that will be transmitted across a network and displayed by a 
Web browser. The browser parses and interprets the various tags that identify legal 
elements of an HTML document and displays the elements accordingly. The struc- 
ture of each HTML element, such as a TABLE element, is fixed; someone writing 
an HTML document using TABLE elements must follow the structure of a TABLE 
element as defined in HTML. XML (Extensible Markup Language) is a gener- 
alized language for defining specific markup languages, that is, it’s a language 
for defining languages. The elements of an XML document can have whatever 
structure the user chooses; the user defines these elements in an associated DTD 
(Document Type Definition). The DTD can be part of the XML document, or it can 
be in a separate file referenced by the XML document (and as such, available for 
a number of XML documents to use). The following DTD gives a grammar for an 
XML document about manufactured parts, and defines the structure of a “PART- 
SLIST" element, an “ITEM” element, and so forth. 


<!DOCTYPE PARTSLIST { 
<!ELEMENT PARTSLIST (ITEM*)> 
<!ELEMENT ITEM (PARTNUMBER, USEDINLIST)> 
<!ELEMENT PARTNUMBER (#PCDATA)> 
<!ELEMENT USEDINLIST (USEDIN*)> 
<!ELEMENT USEDIN (MAKE, MODEL)> 
<!ELEMENT MAKE (#PCDATA)> 
<!ELEMENT MODEL (#PCDATA)> 

ls 


The DTD uses notation similar to that for regular expressions, where * means 
zero or more instances, + means one or more instances, and ? means zero instances 
or one instance. So in the preceding DTD, a PARTSLIST consists of one or more 
ITEMs, and a USEDINLIST consists of zero or more USEDINs. The notation #PC- 
DATA stands for a “parsed character string”. In BNF notation, the grammar becomes 


<PARTSLIST> ::= <ITEM>|<ITEM><PARTSLIST> 
«ITEM? ::= <PARTNUMBER> <USEDINLIST> 
<PARTNUMBER® ::— <identifier> 

<USEDINLIST>::=A| <USEDIN>|<USEDIN><USEDINLIST> 
<USEDIN> ::= <MAKE><MODEL> 

<MAKE>::= <identifier> 

<MODEL ::= <identifier> 

<identifier> ::= [as defined in Example 55] 


?“Panini-Backus Form Suggested,” Ingerman, Р. Z ., Communications of the ACM, vol. 10, No. 3, 1967. 


REMINDER 


The erasing convention 
says how to do erasing 

IF erasing is to be done. 

It does not require the 
production S — А in every 
grammar. 
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A Web browser parses the XML document to determine whether it is a valid in- 
stance of the language defined by its associated DTD, and if so, displays it. ө 


Classes of Grammars 


Before we identify some types of grammars, let’s look at one more example. 


Let L be the empty string A together with the set of all strings consisting of an 
odd number n of 05, п = 3. The grammar С = (V, Vr, S, P) generates L where 
V = {0, A, В, Е, Е, W, X, Y, Z, S}, Ут = {0}, and the productions are 


S — FA 0X > X0 0Z > 70 
S — FBA JA => OW WBZ — EB 
FB — FOEBO FX — FOW Fou. 

ЕВ —0 YA — Z0A Ал 

ЕВ — ХВҮ WO = OW 


ЕУ 
The derivation 5 => FA = ЛА = A produces Л. The derivation 


5 = ЕВА 
=> FOEBOA 
= FOXBYOA 
Š FXOBOYA 
=> FOWOBOZOA 
= FOOWBZO0A 
=> FOOEBOOA 
= F000004 


=; 00000 


produces five 05. Notice how X and Y, and also W and Z, march back and forth 
across the strings of 05, adding one more 0 on each side. This activity is highly 
reminiscent of a Turing machine read-write head sweeping back and forth across 
its tape and enlarging the printed portion. e 


The preceding grammar allows the erasing productions F > A and A > A. 
To generate any language containing A, we have to be able to erase somewhere. In 
the following grammar types, we will limit erasing, if it occurs at all, to a single 
production of the form S — A, where S is the start symbol; if this production oc- 
curs, we will not allow S to appear on the right-hand side of any other productions. 
This restriction allows us to crank out A from S as a special case and then get on 
with other derivations, none of which allow any erasing. Let's call this the erasing 
convention. The following definition defines three special types of grammars by 
further restricting the productions allowed. 


© DEFINITIONS 
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CONTEXT-SENSITIVE, CONTEXT-FREE, AND REGULAR GRAMMARS; 
CHOMSKY HIERARCHY 

A grammar С is context-sensitive (type 1) if it obeys the erasing convention and 
if, for every production a — f (except S — A), the word £ is at least as long as 
the word a. A grammar G is context-free (type 2) if it obeys the erasing conven- 
tion and for every production a > f, a is a single nonterminal. A grammar С 

is regular (type 3) if it obeys the erasing convention and for every production 

a — p (except S — A), a is a single nonterminal and £ is of the form t or tW, 
where t is a terminal symbol and Wis a nonterminal symbol. This hierarchy of 
grammars, from type 0 to type 3, is called the Chomsky hierarchy. 


© DEFINITION 


In a context-free grammar, a single nonterminal symbol on the left of a 
production can be replaced wherever it appears by the right side of the pro- 
duction. In a context-sensitive grammar, a given nonterminal symbol can per- 
haps be replaced only if it is part of a particular string (context)—hence the 
names context-free and context-sensitive. It is clear that any regular grammar 
is also context-free, and any context-free grammar is also context-sensitive. 
The grammar of Example 49 is regular (the two productions have the single 
nonterminal S on the left, and on the right either 1—a terminal—or 0S—a ter- 
minal followed by a nonterminal). Both grammars of Example 51 are context- 
free but not regular (again the single nonterminal S appears on the left of all 
productions, but the right sides consist of three symbols or two nonterminals 
or two terminals, respectively). The grammar of Example 52 is context-sensi- 
tive but not context-free (the productions do not shrink any strings, but some 
left sides have multiple symbols). The grammars of Example 53 and Example 
54 are context-free but not regular (for example, the first three productions of 
Example 54 violate the requirement for a regular grammar). Finally, the gram- 
mar of Example 57 is a type 0 grammar, but it is not context-sensitive (for 
example, the production EB — 0 is a "shrinking" production; also, the erasing 
convention is violated). 


LANGUAGE TYPES 
A language is type 0 (context-sensitive, context-free, or regular) if it can be 
generated by a type 0 (context-sensitive, context-free, or regular) grammar. 


Because of the relationships among the four grammar types, we can classify 
languages as shown in Figure 9.26. Thus, any regular language is also context-free 
because any regular grammar is also a context-free grammar, and so on. However, 
although it turns out to be true, we do not know from what we have done that these 
sets are properly contained in one another. For example, the language L described 
in Example 57 was generated in that example by a grammar that was type 0 but 
not context-sensitive, but that does not imply that L itself falls into that category. 
Different grammars can generate the same language. 
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Formal Language Hierarchy 


Context-sensitive 


Context-free 


Regular 


Figure 9.26 


DEFINITION EQUIVALENT GRAMMARS 


Two grammars are equivalent if they generate the same language. 


EXAMPLE 58 


Example 57 gave a grammar G to generate a language L, which is A together with all 
odd-length strings of Os of length at least 3. We will now give three more grammars 
equivalent to G. (See if you agree that each of these grammars also generates L.) 

G, = (V, Vr, S, P) where V = (0, A, B, S}, Vr = {0}, and the productions are 


5 А AB — 00 
S— ABA 0A — 000A 
4А 0 


G, is context-sensitive but not context-free. 
С, = (V, Vr, S, P) where V = (0, A, S}, Vr = {0}, and the productions are 


S => A A — 00А 
S — 00A A -—0 


G, is context-free but not regular. 
G; = (V, Vr, S, P) where V = (0, A, B, C, S}, Vr = {0}, and the productions 
are 


S => Л (В) 
S — 0A 1] — WC 
A — 0B C — 0B 


G; is regular. 
Thus, when all is said and done, L is a regular language. e 
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PRACTICE 65 | Give the derivation of 00000 in Gi, G>, and G3. 


This is a somewhat confusing point and worth repeating. Because a given 
language can be generated by multiple grammars, if you find a grammar G for a 
language L that is type x but not type y, that does not necessarily make L type x 
but not type y. There might be a different grammar that also generates L and is a 
type у grammar. 


Formal Languages and Computational Devices 


The language L of Example 57 can be described by the regular expression 
А V (000)(00)*, so L is a regular set. From Example 58, L is also a regular language. 
It is not coincidental that a regular set turned out to be a regular language. It can be 
shown that for any finite-state machine, the set it recognizes is a regular language. It 
can also be shown that for any regular language, there is a finite-state machine that 
recognizes exactly that language. (In the proofs of these results, the productions of a 
regular grammar correspond to the state transitions ofa finite-state machine.) Hence 
those sets recognized by finite-state machines—the regular sets—correspond to 
regular languages. Therefore the class of sets recognized by a computational device 
of limited capacity coincides with the most restricted class of languages. 

On the other end of the spectrum, the most general computational device is 
the Turing machine and the most general language is a type 0 language. As it 
happens, the sets recognized by Turing machines correspond to type 0 languages. 

There are computational devices with capabilities midway between those of 
finite-state machines and those of Turing machines. These devices recognize ex- 
actly the context-free languages and the context-sensitive languages, respectively. 

The type of device that recognizes the context-free languages is called a 
pushdown automaton, or pda. A pda consists of a finite-state unit that reads 
input from a tape and controls activity in a stack. Symbols from some alphabet 
can be pushed onto or popped off of the top of the stack. The finite-state unit in a 
pda, as a function of the input symbol read, the present state, and the top symbol 
on the stack, has a finite number of possible next moves. The moves are of the 
following types: 


1. Go toa new state, pop the top symbol off the stack, and read the next input 
symbol. 

2. Go to a new state, pop the top symbol off the stack, push a finite number 
of symbols onto the stack, and read the next input symbol. 

3. Ignore the input symbol being read, manipulate the stack as above, but do 
not read the next input symbol. 


A pda has a choice of next moves, and it recognizes the set of all inputs for which 
some sequence of moves exists that causes it to empty its stack. It can be shown 
that any set recognized by a pda is a context-free language, and conversely. 

The type of device that recognizes the context-sensitive languages is called a 
linear bounded automaton, or Iba. An lba is a Turing machine whose read-write 
head is restricted to a portion of the tape that is no longer than a constant multiple 
of the length of the original input; in addition, at each step it has a choice of pos- 
sible next moves. An lba recognizes the set of all inputs for which some sequence 
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of moves exists that causes it to halt in a final state. Any set recognized by an lba 
can be shown to be a context-sensitive language, and conversely. 

Figure 9.27 shows the relationship between the hierarchy of languages and the 
hierarchy of computational devices. 


secognized by Tuy; 
n 
e 
2 


Regular «э 
recognized by 
finite-state machine 


Figure 9.27 


Context-Free Grammars 


Context-free grammars are important for three reasons. Context-free grammars 
seem to be the easiest to work with, since they allow replacing only one symbol at 
a time. Furthermore, many programming languages are defined such that sections 
of syntax, if not the whole language, can be described by context-free grammars. 
Finally, a derivation in a context-free grammar has a nice graphical representation 
called a parse tree. 


The grammar of Example 54 is context-free. The word d2q can be derived 
as follows: identifier — identifier letter — identifier digit letter — letter digit 
letter = d digit letter — d2 letter = d2q. We can represent this derivation as a 
tree with the start symbol for the root. When a production is applied to a node, that 
node is replaced at the next lower level of the tree by the symbols in the right-hand 
side of the production used. A tree for the derivation appears in Figure 9.28. 


identifier 


: as letter 
identifier 


identifier ¢ digit 4 


Figure 9.28 e 


PRACTICE 66 | Draw a parse tree for the word m34s in the grammar of Example 54. 
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EXAMPLE 60 


Suppose that a context-free grammar G describes a programming language. 
The programmer uses the rules of G to generate legitimate strings of symbols, 
that is, words in the language. Here we may think of a word as corresponding to 
a program instruction. Thus, a word consists of various subwords, for example, 
identifiers, operators, and key words for the language. The program instructions 
are fed into the compiler for the language so that the program can be translated 
into machine language code for the computer. The compiler must decide whether 
each program instruction is legitimate in the language. In a sense, the compiler 
has to undo the process that the programmer used to construct the statement; that 
is, the compiler must start with the statement and decompose it to see whether it 
follows the rules of the language. This really entails two questions: Are the sub- 
words themselves legitimate strings? Is the program instruction a legitimate way 
of grouping the subwords together? 

Usually the set of legitimate subwords of a language can be described by a 
regular expression, and then a finite-state machine can be used to detect the sub- 
words; the lexical analysis or scanner portion of the compiler handles this phase of 
compilation. (See Exercise 32 of Section 9.3 for the lexical analysis of ScrubOak.) 
Ifall goes well, the scanner then passes the program instruction, in the form of a 
string of legitimate subwords, to the syntax analyzer. The syntax analyzer deter- 
mines whether the string is correct by trying to parse it (construct its parse tree). 

Various parsing techniques, which we won’t go into, have been devised. 
Given a string to be tested, one approach is to construct a tree by beginning with 
the start symbol, applying productions (keeping an eye on the “goal’—the given 
string) and ending with the goal string. This procedure is called top-down pars- 
ing. The alternative is to begin with the string, see what productions were used 
to create it, apply productions “backwards,” and end with the start symbol. This 
process is called bottom-up parsing. The trick to either approach is to decide 
exactly which productions should be used. 


Consider the context-free grammar G given by С = (V, Vr, S, P) where 
V = (a, b, c, A, B, C, S}, V; = (a, b, с}, and the productions are 


SB BG A — abc 
5 — 4A B — ab С => е 


Suppose we want to test the string abc. А derivation for abc is 5 = A = арс. 
If we try a top-down parse, we might begin with 
S 


| 


В 


Then we have to detect that this will not work and try something else. If we try a 
bottom-up parse, we might begin with 
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BIC 


| 


abc 


Then we have to detect that this will not work and try something else. Parsing tech- 
niques automate this process and attempt to minimize the amount of false starts 


and backtracking required. 


Notice the distinction between generating members of a set, which the pro- 
grammer does, and deciding membership in a set, which the compiler does. Since 
we ask the compiler to decide membership in a set, a decision algorithm must exist 
for the set. It turns out that decision algorithms do exist for context-free languages, 


another point in their favor. 


SECTION 9.5 REVIEW 


TECHNIQUES 


* Describe L(G) for a given grammar G. 


(2) Define a grammar to generate a given language L. 


* Construct parse trees in a context-free grammar. 


MAIN IDEAS 


* Agrammar G is a generating mechanism for its lan- 


guage L(G). 


* Formal languages were developed in an attempt to 
describe correct syntax for natural languages; al- 
though this attempt has largely failed because of the 
complexity of natural languages, it has been quite 


successful for high-level programming languages. 


EXERCISES 9.5 


Special classes of grammars are defined by restrict- 
ing the allowable productions. 

The various types of formal languages correspond 
to the sets recognized by various automata; in par- 
ticular, (1) regular languages are the sets recognized 
by finite-state machines, (2) context-free languages 
are the sets recognized by pushdown automata, 
(3) context-sensitive languages are the sets recog- 
nized by linear bounded automata, and (4) type 0 
languages are the sets recognized by Turing ma- 
chines. 

Derivations in context-free grammars can be illus- 
trated by parse trees. 

A compiler for a context-free programming lan- 
guage checks correct syntax by parsing. 


1. Describe L(G) for the grammar С = (V, Vr, S, P) where V = (a, А, B, C, S], Vy = {a}, and P consists of 


5 А 
A — ВС 


Аа 


What type of grammar is this? 


BoA 
aC эл 


2. Describe L(G) for the grammar С = (V, Уу, S, P) where V = (0, 1, A, B, S}, Vr = (0, 1}, and P consists of 


S — 0A 
S — 1A 


What type of grammar is this? 


A — 1ВВ 


В—›01 
В 11 


© 00-10 л 


10. 


— 
= 


12. 
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. Describe L(G) for the grammar С = (V, Ут, S, Р) where V = (0, 1, A, B, S}, Vr = (0, 1}, and P consists of 


5 0 A 1B B — 0A 
S — 0A B0 


What type of grammar is this? 


. Describe L(G) for the grammar С = (V, Vn, S, P) where V = (0, 1, A, S}, Ит = (0, 1}, and P consists of 


S — 05 A Э 1A 
$—> 114 А1 


What type of grammar is this? 


. Find a regular grammar that generates the language of Exercise 1. 
. Find a regular grammar that generates the language of Exercise 2. 
. Find a regular grammar that generates the language of Exercise 3. 
. Find a regular grammar that generates the language of Exercise 4. 
. Describe L(G) for the grammar С = (V, Vr, S, P) where V = {a, b, A, B, S, Vr = (a, b}, and P consists of 


S — aA 
S Эл 
A — bS 


Describe L(G) for the grammar С = (V, Vj, S, P) where V = (0, 1, А, S}, V; = (0, 1}, and P consists of 
S 0 


S — ASA 
А 1 


. Describe L(G) for the grammar С = (V, Vr, S, P) where V = (a, b, A, В, S}, V; = (a, b), and P consists of 


S— AB AB — AAB 
Aa AB — ABB 
Bb 


What type of grammar is G? Find a regular grammar G' that generates L(G). 


Describe L(G) for the grammar С = (V, Vr, S, P) where V = (a, b, c, A, B, C R, T, S, Vr = (a, b, c), 
and P consists of 


S — ARBT ТОСТ. 
R— ARB Tc 

RB — ACBB Аа 

СВ — BC Bob 

CT >T 


What type of grammar is G? Find a context-free grammar G' that generates L(G). Explain why it is not 
possible to find a regular grammar that generates L(G). 
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13. Write the productions of the following grammars in BNF: 
a. Gin Exercise 2 
b. Gin Exercise 3 
c. Gin Exercise 4 

14. Write the productions of the following grammars in BNF: 
a. Gin Exercise 10 
b. G, in Example 58 

15. A grammar G is described in BNF as 


<S> п= 1/1<S>|<S> 00 


a. Find a regular expression for L(G). 
b. Find a regular grammar for L(G). 
16. A grammar G is described in BNF as 


<S> п= 01/0<S>|<S> 1 


a. Find a regular expression for L(G). 

b. Find a regular grammar for L(G). 
17. Find a grammar that generates the set of all strings of well-balanced parentheses. 
18. English words are translated into "pig latin" by the following two rules: 

1. If a word begins with a vowel, add the suffix “уау”. 


2. If a word begins with a string of one or more consonants, move the string of leading consonants to the 
back of the word and then add the suffix “ау”. 


. What is the pig latin translation of the word “apple”? 
. What is the pig latin translation of the word *monkey"? 
. What is the pig latin translation of the word “chain”? 


ao с gp 


. Find a grammar that generates all legal pig latin words. 
e. Using your grammar from part d, generate the pig latin word that was the answer to part c. 


19. A word w in V* is a palindrome if w = w^, where w^ is the reverse of the string w. A language L is a 
palindrome language if L consists entirely of palindromes. Find a grammar that generates the set of all 
palindromes over the alphabet (a, b}. 


20.a. Let L be a palindrome language (see Exercise 19). Prove that L^ = (w^|w є L} is a palindrome 


language. 
b. Let w be a palindrome. Prove that the language described by the regular expression w* is a palindrome 
language. 
21. Find a regular grammar that generates the language L = 11(0 V 1)*. 


22. Find a regular grammar that generates the language L — (0 V 1)*01. 
23. Find a grammar that generates the language L = {17"|n = 0}. 


24. Find a regular expression for the language of Exercise 23. Is your grammar from Exercise 23 regular? If 
not, write a regular grammar that generates L. 


25. Find a context-free grammar that generates the language L = (0"1"|n > 0}. 


26; 


27. 


28. 


29. 
30. 
31. 
32. 


33: 


34. 


35. 


36. 


3T. 


38. 
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Find a context-free grammar that generates the language L = (ww^|w є (0, 1}* and w^ is the reverse of 
the string wj. 


Find a context-free grammar that generates the language L where L consists of the set of all nonempty 
strings of Os and 1s with an equal number of 0s and 15. 


Find a context-free grammar that generates the language L where L consists of the set of all nonempty 
strings of Os and 1s with twice as many 05 as 15. 


Find a grammar that generates the language L = {0° |i > 0}. 
Find a grammar that generates the language L = (0"1?"0"|n = 0}. 
Find a grammar that generates the language L = {ww|w e (0, 1}*}. 


Find a grammar that generates the language L — (a" |n = 1}. (By Exercise 38, L is not a context-free 
language, so your grammar cannot be too simple.) 


Draw parse trees for the following words: 

a. 111111 in the grammar G of Example 51 
b. 111111 in the grammar С’ of Example 51 
Draw parse trees for the following words: 

a. 011101 in the grammar G of Exercise 2 
b. 00111111 in the grammar С of Exercise 4 


Consider the context-free grammar С = (V, Vr, S, P) where V = (0,1, 4, S}, V; = {0,1}, and Р 
consists of 


S — AIA 
А 0 
A — AIA 


Draw two distinct parse trees for the word 01010 in G. A grammar in which a word has more than one 
parse tree is ambiguous. 


Ambiguity in a grammar (see Exercise 35) that describes a programming language is an undesirable trait 
because the compiler is unable to uniquely parse the programming instruction. Ambiguity in a natural 
language can be similarly confusing. The following two instructions were posted beside an escalator. 


"Shoes must be worn." 
“Dogs must be carried.” 


Give two possible interpretations for each instruction. 

Show that for any context-free grammar G there exists a context-free grammar G' in which for every 
production a — f, B is a longer string than a, L(G’) C L(G) and L(G) — L(G’) is a finite set. 
Following is the pumping lemma for context-free languages. Let L be any context-free language. Then 
there exists some constant k such that for any word w in L with |w| = К, w can be written as the string 
Wi1W3W3Wa4Ws With [изии | = k and уу = 1. Furthermore, the word wjw5w;3w4w; € L for each i = 0. 
a. Use the pumping lemma to show that L = (a"b"c"|n = 1} is not context-free. 

b. Use the pumping lemma to show that L = (a" |n = 1} is not context-free. 
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TERMINOLOGY 


addition modulo л (р. 692) 
alphabet (pp. 694, 782) 
alternating group (p. 701) 
associative binary operation 
(p. 687) 
Backus Naur form (BNF) (p. 787) 
bottom-up parsing (p. 794) 
cancellation laws (p. 696) 
canonical parity-check matrix 
(p. 721) 
check bits in a code word (p. 721) 
Chomsky hierarchy (p. 790) 
coefficient (p. 690) 
commutative binary operation 
(p. 687) 
commutative group (p. 688) 
concatenation (p. 694) 
context-free (type 2) grammar 
(p. 790) 
context-free language (p. 790) 
context-sensitive (type 1) 
grammar (p. 790) 
context-sensitive language 
(p. 790) 
coset leader (p. 723) 
cosets of a subgroup in a group 
(p. 717) 
decision problem (p. 772) 
degree of a polynomial (p. 691) 
delay element (p. 744) 
direct generation (derivation) of a 
word (p. 784) 
double-error detecting code 
(p. 715) 
empty string (p. 694) 
equivalent grammars (p. 791) 
equivalent states (p. 739) 
erasing convention (p. 789) 
even and odd permutations 
(p. 700) 
final state (pp. 734, 765) 
finite-state machine (p. 729) 
formal language (p. 785) 
free monoid generated by a set A 
(p. 695) 


generation (derivation) of a word 
(p. 784) 

group (p. 688) 

group code (p. 719) 

group of permutations on a set 4 
(p. 693) 

halting problem (p. 774) 

Hamming distance (p. 718) 

homomorphism (p. 702) 

identity element (p. 687) 

improper subgroup (p. 700) 

information bits in a code word 
(p. 721) 

input alphabet (p. 729) 

intractable problem (p. 776) 

inverse element (p. 687) 

isomorphism (p. 702) 

k-equivalent states (p. 739) 

kernel of a homomorphism 
(p. 716) 

language (p. 782) 

language generated by a 
grammar G (p. 784) 

left cancellation law (p. 696) 

left cosets (p. 717) 

length of a string (p. 694) 

linear bounded automaton (Iba) 
(p. 792) 

maximum likelihood decoding 
(p. 715) 

minimum distance of a code 
(p. 718) 

monoid (p. 689) 

multiplication modulo n (p. 692) 

negative solution to a decision 
problem (p. 772) 

next-state function (p. 729) 

nondeterministic Turing machine 
(p. 777) 

NP (p. 777) 

NP-complete problem (p. 778) 

number-theoretic function (p. 767) 

order of a group (p. 698) 

output alphabet (p. 729) 

output function (p. 729) 


Р (р. 777) 

parse tree (р. 793) 

partial function (p. 767) 

partition refinement (p. 741) 

perfect code (p. 722) 

permutation group (p. 700) 

phrase-structure (type 0) grammar 
(p. 783) 

polynomial in x with real number 
coefficients (polynomial in x 
over R) (p. 690) 

polynomial of zero degree (p. 691) 

positive solution to a decision 
problem (p. 772) 

production (p.783) 

proper subgroup (p. 700) 

pushdown automaton (pda) 
(p. 792) 

recognition by a finite-state 
machine (p. 734) 

recognition (acceptance) by a 
Turing machine (p. 765) 

refinement (p.741) 

regular expression (p. 735) 

regular (type 3) grammar (p. 790) 

regular language (p. 790) 

regular set (p. 736) 

right cancellation law (p. 696) 

right cosets (p. 717) 

semigroup (p. 689) 

semigroup of transformations on a 
set A (p. 693) 

sequential network (p. 745) 

single-error correcting code 
(p. 715) 

start symbol (p. 783) 

state graph (p. 730) 

state table (p. 730) 

string (p. 694) 

subgroup (p. 699) 

symbol (p. 694) 

symmetric group of degree n 
(p. 693) 

syndrome of a binary n-tuple in a 
group code (p. 724) 


tape alphabet (р. 762) 
terminal (p. 783) 
top-down parsing (p. 794) 
total function (p. 767) 
transposition (p. 700) 
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SELF-TEST 


Answer the following true—false questions. 


Section 9.1 


1. 


2. 


A binary operation is associative if the order of the 
elements being operated upon does not matter. 
The identity i in a group [G, +] has the property that 


x!l-i=i x! = x! forall xin G. 


. Every group is also a monoid. 
. A group of order 10 cannot have a subgroup of 


order 6. 


. If [S,-] and [T, +] are two groups, then a function 


А5 T for which f(x: у) = f(x) + f(y) is an 
isomorphism. 


Section 9.2 


1. 


2. 


3. 


A binary single-error correcting code must have 
minimum distance at least 3. 

In a canonical n X r parity-check matrix H, the 
bottom r rows form the r X r identity matrix. 

A canonical п X r parity-check matrix H maps all 
of Z2 to code words in Z5. 


. If the syndrome of a received word X in Z5 is 0,, 


then X is assumed to be a code word. 


. A perfect code is "perfect" because no bit errors 


will occur during transmission. 


Section 9.3 


1. 


2. 


The next state ofa finite-state machine is determined 
by its present state and the present input symbol. 
The set of all binary strings ending in two Os is 
regular. 


. A finite-state machine cannot get to a state from 


which there is no exit. 


. According to Kleene’s theorem, a set that cannot be 


described by a regular expression cannot be recog- 
nized by a finite-state machine. 


Turing-computable function 
Turing machine (p. 762) 


type 0 language (p. 790) 
unreachable state (p. 737) 


5. 


unsolvable (undecidable) decision 
problem (p. 772) 

vocabulary (p. 782) 

weight of a code word (p. 719) 

word (pp. 694, 782) 


In a finite-state machine, k-equivalent states are 
also (A + 1)-equivalent. 


Section 9.4 


1. 


2. 


A Turing machine halts if and only if it enters a 
final state. 

A Turing machine that computes the function 
f(n) = n + 1, given input л, will halt with (л + 1) 
15 on its tape. 


. Church’s thesis says that the halting problem is 


unsolvable. 


. The halting problem says that, given a Turing ma- 


chine and its input, there is no algorithm to decide 
whether the Turing machine halts when run on that 
input. 


. A set in P is recognizable by a Turing machine in 


no more than a polynomial number of steps. 


Section 9.5 


1. 


The language generated by a type 0 grammar G is 
the set of all strings of terminals generated from the 
start symbol by applying G’s productions a finite 
number of times. 


. Beginning at the start symbol and applying the 


productions of a grammar G eventually leads to a 
string of terminals. 


. Alanguage generated by a grammar that is context- 


sensitive but not context-free is a context-sensitive 
but not context-free language. 


. Any regular set is a regular language. 
. A parse tree will have as many leaves as terminal 


symbols in the word being derived. 
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ON THE COMPUTER 


For Exercises 1—11, write a computer program that 
produces the desired output from the given input. 


1. Input: Two words from an alphabet A 


Output: Their concatenation 


2. Input: Positive integer n and finite alphabet A 


Output: All words over A of length =n 


3. Input: Positive integer n 


Output: Tables for addition and multiplication 
modulo n 


Input: Positive integer n 

Output: The n! elements of S, expressed both in 
array form and in cycle notation, the group table for 
[S,, °], and the group table for [A,, °] 


Input: n X п array, п = 10, that purports to repre- 
sent a binary operation on the finite set of integers 
from 1 to n 

Output: Determination of whether the set under this 
operation is a commutative group 


6. Input: Two n X n arrays, n = 10, that represent 


two groups and an array that represents a function 
from the first group to the second 

Output: Determination of whether the function is 
an isomorphism 


7. Input: A canonical n X r parity-check matrix H 


for a single-error correcting code with r = 4 and 
n = 27—11 

Output: The set of binary m-tuples H encodes апа 
the code word for each one 


Input: A canonical parity-check matrix for a perfect 
single-error correcting code where r = 5 and any 
binary n-tuple X 

Output: The binary n-tuple to which X is decoded 


Input: Positive integer n, n = 50, representing the 
number of states of a finite-state machine with in- 


put alphabet = output alphabet = {0,1} and an 
n X 3 array representing the state table description 
of such a machine 

Output: List of any states unreachable from the 
start state s, 


10. Input: Positive integer n, n = 50, representing 


the number of states of a finite state machine with 
input alphabet = output alphabet = (0, 1} and an 
n X 3 array representing the state table descrip- 
tion of such a machine 

Output: m X 3 array representing the state table 
of a minimized version of M 


11. Input: Set of terminals in a grammar and a de- 


scription of the productions in a grammar; ability 
for the user to set a maximum number of steps for 
any derivation 

Output: List of words in the language that can be 
derived within that maximum 


12. Write a finite-state machine simulator. That is, 


given 


* a positive integer п, п = 50, representing the 
number of states of a finite state machine with 
input alphabet = output alphabet = {0,1} 

* an n X 3 array representing the state table 
description of such a machine 


your program should request input strings and 
write the corresponding output strings as long as 
the user wishes. 


Write a Turing machine simulator. That is, given 
a set of quintuples describing a Turing machine, 
your program should request the initial tape con- 
tents and write out a sequence of successive tape 
configurations. Assume that there are at most 100 
quintuples and that the number of cells used on 
the tape is at most 70, and allow the user to set a 
maximum number of steps in case the computa- 
tion does not halt before then 
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Appendix 


Derivation Rules for Propositional 
and Predicate Logic 


EQUIVALENCE RULES 

Expression Equivalent to Name/Abbreviation for Rule 
РУ О QVP Commutative —comm 

PAQ QAP 

(РМО УА PV (QV R) Associative—ass 
(PAQ)AR PA\ (QAR) 

(PV 9)’ P'AQ' De Morgan's laws— De Morgan 
(P^Q) РУО’ 

PQ P'VQ Implication — imp 

P (P^) Double negation — dn 
[EVA] (ух)[А(х)]' Negation—neg 

INFERENCE RULES 

From Can Derive Name/Abbreviation for Rule 
P,P>Q Q Modus ponens—mp 

P>Q, Q' P Modus tollens — mt 

P,Q PAQ Conjunction—con 

PAQ P,Q Simplification—sim 

Р PVQ Addition—add 

Рэ 09,9 ә В P—HR Hypothetical syllogism —hs 
РМО, Р' Q Disjunctive syllogism—ds 
PQ Q' > P' Contraposition—cont 
Q'—P' PQ Contraposition — cont 

P PAP Self-reference—self 

PVP P Self-reference — self 

(PAQ > В Р (О > В) Exportation—exp 

Р, Р' О Inconsistency — inc 

P ^(QVv ВБ, (P ^ О) V (P ^ В) | Distributive—dist 

PV (QAR) (PV Q)A (PVR) |Distribution—dist 
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Derivation Rules for Propositional and Predicate Logic 


INFERENCE RULES (CONTINUED) 
From Can Derive Name/Abbreviation for Rule | Restrictions on Use 
(Vx)P(x) P(t), where t Universal instantiation — ui If tis a variable, it 
is a variable must not fall within 
or constant the scope of a 
symbol quantifier for t. 
(3x)P(x) P(a) where a Existential instantiation —ei Must be the first rule 
is a constant used that introduces 
symbol not a. 
previously 
used in proof 
sequence 
P(X) (Vx)P(x) Universal generalization—ug _| P(x) has not been 
deduced from any 
hypotheses in which 
X is a free variable 
nor has P(x) been 
deduced by ei from 
any wff in which x is 
a free variable. 
P(x) or P(a) | (3x) P(x) Existential generalization—eg | To go from P(a) to 
where a is (3x)P(x), x must not 
a constant appear in P(a). 
symbol 


Appendix 


Summation and Product Notation 


Summation notation is a shorthand way of writing certain expressions that are 
sums of terms. As an example, consider the sum of the integers from | to 5: 


1+2+3+4+ 5 


This expression can be thought of in the following way: Suppose we have some 
quantity 7 that initially has the value 1 and then takes on successive values of 2, 3, 
4, and 5. The expression is the sum of i at all its different values. The summation 
notation is 


t=1 


The uppercase Greek letter sigma, X, denotes summation. Here the number 1 is 
the lower limit of summation, and the number 5 is the upper limit of summation. 
The variable i is called the index of summation. The index of summation is ini- 
tially set equal to the lower limit and then keeps increasing its value by 1 until it 
reaches the upper limit. All the values that the index of summation takes on are 
added together. Thus 


5 
і=1+2+3+4+5 = 15 
1 


is 


Similarly 


3 
Уі=1+2+3=6 


i=l 
And 


8 


>т=4+5+6+7+8=30 


i=4 


In these examples, the expression after the summation sign is just 7, the in- 
dex of summation. However, what appears after the summation sign can be any 
expression, and the successive values of the index are simply substituted into the 
expression. Thus 


5 
SP = 12+ 22 + 32 + 4 + 5 = 55 
S 805 
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A way to symbolize summation in general is 


Here the lower limit, upper limit, and expression behind the summation are not 
specifically given but merely symbolized. The notation a; is a reminder that the 
expression will be evaluated at the different values 7 takes on in going from the 
lower to the upper limit. 

There are three special cases to consider: 


Here the expression behind the summation is the constant 0, which has the value 
0 no matter what the value of the index of summation. The sum of any number of 
Os is 0. 


Here again the expression behind the summation is a constant, and the summation 
says to add л copies of 1, which results in л. 


Here the upper limit is smaller than the lower limit; the usual interpretation of sum- 
mation does not apply, but by convention the summation is assigned the value 0. 

The index of summation is a dummy variable, meaning that it merely acts as a 
placeholder and that a different variable could be used without changing the value 
of the summation. Thus 


It may be convenient to change the limits on a summation, which is legitimate 
as long as the final value of the summation remains the same. For example, 


3 2 
У і = У, (i+ 1) 
i=1 ї=0 
since both have the value 


1+2+3=6 


Finally, the following three rules hold, as we will see shortly. 
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Rules of Summation 


4 4 4 
l. ¥@+b)= S a+ УЬ, 
i=p i=p i=p 
4 4 4 
2. > (а; = bj) = 2 d = 2 
i=p i=p i=p 
3. Уса = суа where с is a constant 


To prove rule 1, note that 


| | | eee 
ap + bp t ap+1 + Буя Fu +t b; 
| | 
Т Т 


= а +а t+ ta 


gor Dy РВ аре Б, 
because of the commutative property of addition. The proof of rule 2 is similar. 
To prove rule 3, note that 


| | eee = eee 
cap + 60544 T Fog, = dd, + аруу ® + а) 


because of the distributive property. This rule allows a constant to be “moved 
through” a summation. 

Sometimes a summation can be represented by an even shorter expression 
that does not involve adding separate terms. For example, according to Exercise 7 
of Section 2.2, 


Ls. п(п+1)(2п +1) 
2 6 (1) 


5 
so that the value of У 2 can be found by substituting the upper limit, 5, into the 
i=l 


right side of (1), giving 


5(5 + 1)(2*5 + 1 
( X La 


as before. Section 2.2 and its exercises give a number of other “closed form” ex- 
pressions for certain summations, all of them provable by mathematical induction. 

Product notation is a shorthand way of writing certain expressions that are 
products of factors. Product notation is very similar to summation notation, except 
it uses the uppercase Greek letter pi, П and the various items are multiplied rather 
than added. There is an index of multiplication, a lower limit, and an upper limit. 
For example, 


7 
i = (3)(4)(5)(6)(7) = 2520 
i=3 
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and 


2 
Цо + 5) = (2*1 + 5)2*2 + 5) = (7)(9) = 63 


Product notation occurs less often than summation notation, but there аге still 
special cases to note: 


0 (the product of all 0s equals 0) 


ы = 
— о 
ll ll 
_ 


(the product of all 1s equals 1) 


Appendix 


The Logarithm Function 


The logarithm function is closely related to the exponential function 
y= 
where b, the base, is a constant greater than 1. (Actually b can be any positive 


number, but the only interesting cases occur when b > 1.) Recall the following 
rules of exponents: 


| pig (when you multiply, you add exponents) 

2. Б/Б" = p^" (when you divide, you subtract exponents) 

3. (p^)" = pr (when you raise a power to a power, you multiply 
exponents) 


If we select a specific base, b = 2 for example, we can plot у = 2* for various 
values of x and fill in the remaining values, getting the graph 


In this function, x can take on any real value, and y will always be positive. An- 
other way to say this is that the domain ofthe function is the set К of real numbers, 
and the range is the set [R* of positive real numbers. 


809 
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A related function (in fact, the inverse function) is the /ogarithm function, 
defined by 


y = log,x meaning by =x 


Therefore log,16 = 4, for example, because 2^ = 16. These two equations are 
the logarithmic form and exponential form of the same fact. Similarly, log; 8 — 3 
and log;2 = 1. 

Following is a graph of y = log, x. 


>< 


Because the logarithm function у = log;x is the inverse of the exponential func- 
tion, its domain (the values x can take on) is the set R* of positive real numbers 
and the range (the values y can take on) is the set R of real numbers. The logarithm 
function for any base b > 1 has a domain, range, and shape similar to the case 
for b — 2. 

Certain properties about the logarithm function are true either because 
of its definition or because of corresponding properties about the exponential 
function, We'll list all the properties of the logarithm function, and then prove 
them. 


Properties of the Logarithm Function y = log,x 


1. If p < q then log,p < log;q (the log function is strictly increasing) 

2. Iflog,p = log,g then p = q (the log function is one to one) 

3. log, 1 = 0 

4. log,b = 1 

5. log, (b?) = p 

6. Б = р 

7. log, (pq) = log,p + 109,9 (the log of a product equals the sum of 
the logs) 

8. log, (p/q) = log,p — 1089 (the log of a quotient equals the 
difference of the logs) 

9. log;( p^) = q(log,p) (the log of something to a power 
equals the power times the log) 

10. log,p = log, p/log,a (change-of-base formula) 


11. pied 2 4% 
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In proving the properties of the logarithm function, we first note that many of 
the properties involve the quantities log, p and log, q. Let us name these quantities 
r and s, respectively, so that 


log, p = rand log, q = s 
This in turn means that 
b = рапа ° = д 


1. Because b > 1, the larger ће power to which Р is raised, the larger the result. 
Thus if p < q, then b” < b5, sor < s and therefore log,p < log,q. 

. If log,p = log, q, then r = s, so b" = b' and p = q. 

. log, 1 = 0 because Б? = 1. 

. log, b = 1 because b! = b. 

. log,(b”) = p because (translating this equation to its exponential form) 


л > чо го 


6. Ь°®Р = p because (translating this equation to its logarithmic form) 
log, p = log,p. 

7. log,(pq) = log,p + 109,9 =r + s because it is true that 
b’** = УЗ? = pq, which is the exponential form of the equation we are 
trying to prove. 

8. log; (p/q) = log,p — log,g = г — s because it is true that 
Ь 5 = Б/Б = p/q which is the exponential form of the equation we are 
trying to prove. 

9. log;( p^) = q(log,p) = qr because it is true that b” = (5^)? = рї, which 
is the exponential form of the equation we are trying to prove. 

10. log,p = log, p/log,a 


Let log, p = w. Then a” = p. Now take the logarithm to the base b of both 
sides of this equation: 


log,(a”) = w(log, a) = log,p 


or 
w = log, p/log,a 
which is the desired result. 
11. р°®4 = gis 
log,(p'°®"") = (log,q)(log,p) by Property 9 
= (108,р)(108,4) 


= log, (q'°*”) by Property 9 


Therefore р°®4 = qV% by Property 2 
The three most useful bases for logarithms are 
b = 10 (common logarithm) 

b = e,e ~ 2.7183 (natural logarithm) 

b = 2 (what we use throughout this book) 
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Common logarithms were used as computational aids before calculators and 
computers became commonplace. Property 7 of the logarithm function says that 
to multiply two numbers, one can take the logarithm of each number, add the re- 
sults, and then find the number with that logarithm value. Addition was easier than 
multiplication, and tables of common logarithms allowed one to look up a number 
and find its logarithm, or vice versa. 

Natural logarithms are useful in calculus and are often written “In p" rather than 
“log.p.” Base 2 logarithms are sometimes denoted by “lg p" rather than “log, р.” 
In this book, all logarithms are base 2, so we use log p to denote log, p. 

A final inequality involving base 2 logarithms (used in Section 3.3) is 

1+ logn <n ги = 3 
To prove this, note that 
n < 27 фори 3 
so by property 1 of logarithms, 
logn < log2"! 
By property 5 of logarithms, log2" ! = n — 1. Therefore 
logn«cn-1 


or 


1+ logn <n forn = 3 


Answers to Practice Problems 


Note to student: Finish all parts of a practice problem before turning to the 
answers. 


CHAPTER 1 


1. False, false, false 


2. AVB 3. 


түт| | |> 

т 1 п | | © 

njala 

ninaa 

тр a | | Ф 

alalama} 
тп 


. Antecedent: The rain continues Consequent: The river will flood 

. Antecedent: The central switch goes down Consequent: Network failure 

. Antecedent: The avocados are ripe Consequent: They are dark and soft 

. Antecedent: A healthy cat Consequent: A good diet 

6. Answer d. This is negation of А ^ B, the same as the negation of "Peter 15 tall and thin." 


aot & 


А (А> В) < (В > А) 


А 
T 
T 
F 
F 


n[d|m|auQ 


A’ | B |AVA' | BAB' | (АУА) > (B^B?) 
T F F 


ninla 
miadaimiaH | © 
чуч т| тп 
чіт | 


Т Е Е 
T F F 
T F F 
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Cc.) A B [e B' | AVB'|C' (A^B')C' [(A^B)C'J 
T T T F F F T F 
T T F F F T T F 
T F T T T F F T 
T F F if T T T F 
F T T F F F T F 
F T F F F T T F 
F F T T F F T F 
F F F T F T T F 
d. A B A' B' AB В' > А' | (A—5B)o(B'A?) 
T T F F T T T 
T F F T F F T 
F T T F T T T 
F F T T T T T 


9. To prove (P > Q) e (P' 


V О), just construct a truth table: 


Р | О 'P>Q| P | PPva| (Р а) < (Р У 0) 
TIT т Е т т 
т | Е Е Е Е т 
F | Т т т т т 
FF T т т т 
10. (4 ^ B^)! 1,2, mt 


11. 


— 


1. (А М В) э С һур 
2.C—D hyp 

3. А hyp 
4.4 V B' 3, add 
5. С 1, 4, mp 
6. D 2,5, mp 


12.(4А—> В) Л (В > С) > 
l.A В һур 
2. BiG: һур 
3. A hyp 
4. B 1,3, mp 
5. С 2,4, mp 


(4 2 C) 


(4VB)—C|^(C—^D)^A—D 
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13.(4 > ВӘ Л (СУД) ЛС В 
1. А >B һур 
2.C' VA hyp 
3. C hyp 
4. CA 2, imp 
S.C —B 1, 4, hs 
6. B 3,5, mp 
14. The argument is (S — R) ^ (S' — В) > (R' — B). A proof sequence is 


.S—R hyp 
5° —B hyp 
R' hyp 
S' 1, 3, mt 
B 2, 4, mp 


15. a. True (all daffodils are yellow) 
. False (not true that all flowers are yellow) 


. True (all flowers are plants) 


Be TP AR WN > 


. False (zero is neither positive nor negative) 
16. For example: 


a. The domain is the collection of licensed drivers in the United States; P(x) is the property that x is older 
than 14. 


b. The domain is the collection of all fish; P(x) is the property that x weighs more than 3 pounds. 


c. No; if all objects in the domain have property P, then (since the domain must contain objects) there is 
an object in the domain with property P. 


d. The domain is all the people who live in Boston; P(x) is the property that x is a male. (Not every person 
who lives in Boston is a male, but someone is.) 


17. Let x = 1; then x is positive and any integer less than x is = 0, so the truth value of the statement is true. 
For the second interpretation, let A(x) be “х is even,” B(x, y) be “x < у,” and C( y) be “y is odd”; the 
statement is false because no even integer has the property that all larger integers are odd. 


18. a. (Vx)[S(x) > I(x)] 
b. (Эх) (х) ^ (х) ^ М(х)] 
‚ VAMA) > SQ) ^ UW) 
‚ (VM) > (х) ^ Kx) 
‚ EDA) ^ (МУ (ЕС у) > Sœ, у))] 
‚ (МӘ[К@) > WVO) > 80, у) 
‚ (V9) Рох) > (AVM) ^ SC, у))] 
‚ VALO) A Sœ, y) > FQ)] 
20. Answer d. If L(x, y, t) means “x loves y at time ¢,” the original statement is 


19. 


a © c » бс o 


(vx)(3y)(G0LG; y, f) 


2 


— 


22. 


23. 


24. 


25. 


26. 
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and the negation is 


(WENED, у, DT <> (G3X)[GY) (OLG; у, д] 
€ (3x Vy)LGOLG, у, 0] 
€ (Ax)(Vy)(VOILG, у, D] 


or, “There is some person who, for all persons and all times, dislikes those persons at those times" or “Some- 
body hates everybody all the time." Answers (a) and (c) can be eliminated because they begin with a univer- 
sal quantifier instead of an existential quantifier; answer (b) is wrong because “loves” has not been negated. 


. Invalid. In the interpretation where the domain consists of the integers, P(x) is “x is odd" and Q(x) is “x 


is even,” the antecedent is true (every integer is even or odd), but the consequent is false (it is not the case 
that every integer is even or that every integer is odd). 


(Мх)[ Р(х) > RA] ^ ROT > [POY] 
1. (vx)[P() > К(х)] hyp 

2. [RC y)]' hyp 

3. Р(у) > RCy) 1, ui 

4. [PC y)]' 2,3, mt 
(Vx)[P@) ^ 969] > (VLOG) ^ Р(х)] 
1. (Vx) LP) ^ О()] hyp 

2. P(x) ^ Q(x) l,ui 

3. Q(x) ^ P(x) 2, comm 


4. (VX)|Q(x) ^ Р(х)] 3, ug 
(Vy)LPG) > Оф, »)] > P > WE, »)] 
1. (WIPE) > OG, у)] hyp 


2. P(x) hyp 

3. P(x) > Q(x, у) 1, ш 

4. Q(x, y) 2,3, mp 

5. (Vy)O(x, y) 4, ug 

OBA V CE > 46)] > (Ф) Вх) > 409] 

1. (LB V Cœ > 40)] hyp 

2. (B(x) V C(x)) > A(x) 1, ui 

3. B(x) temporary hyp 
4. B(x) V C(x) 3, add 
5. A(x) 2,4, mp 

6. B(x) > A(x) temporary hyp discharged 

7. (Vx)[B(x) > А(х)| 6, ug 

(Эх)А(х) ^ [GIx)LRGO) ^ S62] ]' > GV] 


The argument is valid. If something has property Ё but nothing has both property Ё and property S, then 
something fails to have property S. A proof sequence is 


1. (GVR) hyp 
2. [EVRE ^ sw] hyp 
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3. (V)[R() A S] 2, neg 
4. R(a) l, ei 
5. [R(a) ^ S(a)]' 3, ui 
6. [RAA] V [S(a)]' 5, De Morgan 
7. [ER] |! 4, dn 
8. [S(a)]' 6, 7, ds 
9. (3x)[S(x) ]' 8, eg 
27. The argument is (Vx)[R(x) > L(x)] ^ (3x)R(x) > GVL) 
1. (Vx)[R(x)  L(x)] hyp 
2. (3x)R(x) hyp 
3. R(a) 2,el 
4. R(a) > L(a) l, ui 
5. L(a) 3, 4, mp 
6. (3x)L(x) 5, eg 


28. deer grass (deer eat grass and grass is a plant) 
29.a. predator(X ) <= eat(X, Y) and animal(Y) 
b. bear 

fish 

raccoon 

bear 

bear 

fox 

bear 

wildcat 


30. Responses 7—9 result from in-food-chain(raccoon, Y); responses 10 and 11 result from 
in-food-chain(fox, Y); response 12 results from in-food-chain(deer, Y). 


31x —2=y,orx =y+2 
32. Working backwards from the postcondition using the assignment rule, 


{x + 4 = 7} 
у= 4 
{х +у= 7} 
а=х+у 
{= = 7} 


The first assertion, x + 4 = 7, is equivalent to the precondition, х = 3. The assignment гше, applied 
twice, proves the program segment correct. 


33. The two implications to prove are 


{x = 4andx < 5}y =x — 1{y = 3} 
{x = 4andx = 5}y = 7{y = 3} 
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The first implication is true by the assignment rule. Working backwards from the postcondition, 
{x 1 = 3} 
у=х— 1 
{у= 3) 
х— 1 =3З‹+›х = 4<х = 4апіх < 5 


The second implication is true because the antecedent is false. The program segment is correct by the 
conditional rule. 


CHAPTER 2 


1. Possible answers: 

a. A whale 

b. The integer 4. Four is less than 10, but it is not bigger than 5. 
2. a. Show that the conjecture is true for all cases: 


п | п? | 10-5n 
1 1 15 
2 | 4 20 
з | 9 25 
4 | 16 30 
5 | 25 35 


b. Forn = 7, п” is 49 but 10 + 5n is only 45. 


3. Let x be divisible by 6. Then х = 6k where k is an integer, and 2x = 2(6k) 12k = 4(3k). Since 3k is 
an integer, 2x is divisible by 4. 


4. a. If the river will not flood, then the rain will not continue. 
b. If there is not a network failure, then the central switch does not go down. 
. If the avocados are not dark or not soft, then they are not ripe. 
. If the diet is not good, the cat is not healthy. 
. If the river will flood, then the rain will continue. 
. If there is network failure, then the central switch goes down. 
. If the avocados are dark and soft, then they are ripe. 
. If the diet is good, then the cat is healthy. 
6. Let x = 2m + land y = 2n + 1 where m and n are integers, and assume that xy is even. Then 


co c» ao 


xy = 2k for some integer k 
or 

(2m + l)2n + 1) = 2k 
Multiplying out the left side, 


4mn + 2m + 2n + 1 = 2k 


10. 
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Rearranging terms in the equation, 
1 = 2k — 4mn — 2m — 2n 
Factoring out 2 on the right side, 
] = 2(k — 2mn — m — n) where k — 2mn — m — nisaninteger 


This is a contradiction since | is not even. 

P(1): 1 = 1(1 + 1)/2, true 

Assume Р(Ю: 1 + 2 +--+ k= kk + 1)/2 

ЗЗР D 1) +1] 
2 


Show P(k + D): 1-2 4 t (k4- 1) 


1+2+9++(&+1)=1+2+9 = +Е+(К+1) 
kk + 1) 


k 
-iD +@+у-@+(т+1) 


=e Теза 


2 2 


. The base case is n = 2. 


Р(2): 227! < 32, or 8 < 9, true 
Assume P(k): 2**! < 3* and k > 1 


€) 
Show P(k + 1): 24+? < 34+! 


2*+? 2. РА 1) 


< 2(3*) (by е inductive hypothesis) 
< 3(3*) (since 2 < 3) 
= 3k+1 


a. To verify P(A + 1) in implication 2’, we subtract 3 from k + 1. For the inductive hypothesis to hold, it 
must be the case that (k + 1) — 3 = 8, so k + 1 must be = 11. Therefore implication 2’ cannot be 
used to verify P(9) or P(10). 


b. The truth of P(A + 1) cannot be verified from the truth of P(A). For example, in trying to express 11 
as a sum of 3’s, and 5’s, knowing that 10 = 5 + 5 is no help. However, knowing that 8 = 3 + 5 is 
helpful because adding one more 3 gives 11 = 2 * 3 + 5. 


Q(0): jg = x t iy true since j = x, i = 0 before the loop is entered 
Assume Q(k): j = x ti, 


9 
Show O(k + 1): "n =x+ ips | 


Жы = Л + 1 (by the assignment j = j + 1) 
=(x+i)+1 (by inductive hypothesis) 
— x + (i, + 1) 
=x + iggy (by the assignment і = 7 + 1) 


Upon loop termination, і = y and j = х + y. 
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11. a. If d|a then a = та where n, is a positive integer. If d|b then b = n;d where n, is a positive integer. 
Therefore 
c = ia + jb = i(nd) + j(md) = (in,  jn;)d 
where in, + jn; is an integer and d|c. 

b. If d|c then с = nd where п = 1 because both c and d are positive, so c = d. 

12. From the Euclidean algorithm to find gcd(21, 16), 
1216-3-.5 
5-—21-— 1:16 
from which 
1716—3-(21— 1:16) = 4:16 — 3:21 

soi = —3,j = 4. 
13. Do a proof by mathematical induction. 

Base case: (k = 1). If p|a, then р|а 

Assume that if p|aja)...a, then p|a, for some j,1 <j = К 

Let p|aja, ... акц = (ааз... аа. Using the theorem on division by prime numbers, 

either playa... a, or play... If р|ауа»... a, then by the assumption р |а; for some j, 1 =] = k. 

Therefore p|a; for some j, 1 Sj =k + 1. 
14. 1176 = 2:3. 7? 
15. 420 = 2?-3-5-7 and 66 = 2:3 · 11, so рс1(420, 66) = 2:3 = 6 
16. Because р is a prime number, it has no factors other than itself and 1. Therefore every positive integer less 

than p has only the factor 1 in common with p, so it is relatively prime to p. Therefore g(p) = p — 1. 
17. ф(п) = 3%. 7[ф(3)Ф(5)Ф(7)| = 27 :7:2:4:6 = 9072 

СНАРТЕВЗ 

1. 1,4, 7, 10, 13 
2. 1,1,2, 3, 5, 8, 13, 21 


3. In proving ће k + 1 case, the terms F(k — 1) and F(k) are used. If k + 1 = 2, then the value at 2 posi- 


tions back, F(k — 1), is undefined. Therefore in the inductive step, we must have Ё + 1 = 3 and the case 
n — 2 must be done separately. Put another way, the inductive step does not demonstrate the truth of the 
n = 2 case from the truth ofthe n = 1 case. 


. A, B, and C are wffs by rule 1. By rule 2, (B') is a wff, and so then is (А V (B’)) and (A V (В')) > C). 


This can be written as (4 V B') C. 


5. Every parent of an ancestor of James is an ancestor of James. 
6. 1011001, 0011011, 00110111011 
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N 


NO — N н 


. А, 0, and 1 are binary palindromes. 


. If x is a binary palindrome, so are 0х0 and 1х1 


х= х 


eo 


х" = x'xforn1 
9. if n = 1 then 
return 1 
else 
return T(n — 1) + 3 
end if 
10. 10, 7, 8 
11. T(n) = T(n — 1) + 3 
= [Tn — 2) +3] -32 Tín – 2) + 2*3 
= [Tn —3) + 3] - 2*3 = T(n — 3) + 3«3 


In saree we guess that 
T(n) = T(n — К) + k«3 
When n — k = 1, thatis,k =n — 1, 
Тт) = TA) + (n — 1)*3 =14+ (n — 1)*3 


Now prove by induction that Т(л) = 1 + (n — 1)*3. 
T(1): T(1) = 1+ (1 — 1)*3 = 1, true 
Assume 7(k): Nk) = 1 + (k — 1)*3 


9 
Show T(k + 1): T(k + 1) = 1+ k«3 


T(k + 1) = ТЮ + 3 (by the recurrence relation) 
=1+(k- 1)*3 +3 (by the inductive hypothesis) 
= ] + k*3 


12. The recurrence relation matches equation (6) with c = 1 and g(n) = 3. From equation (8), the closed-form 
solution is 


T) = 170) + SB) 
i=2 
ме 
ї=2 
-1-4(n- 18 


13. a. From the base cases and the recurrence relation, the first five terms of the sequence are 


S(1) = 3, 50) = 1, SG) = 282) + 350) = 11, 
S(4) = 25(3) + 35(2) = 25, S(5) = 25(4) + 35(3) = 83 


b. The formula S(n) = 3"! + 2(—1)" ! generates 3, 1, 11, 25, 83 for n = 1,2, 3,4, 5. 
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14. су = 6 and с, = —5, so the characteristic equation is 


?—6r+5=0 
which has roots r, = 1, r; = 5. The solution has the form 
Т(л) = p + 4(5)" 
where 
pt+q=5 
р + q(5) = 13 
Solving this system of equations, p = 3, g = 2 and the solution formula is 
T(n) = 3 + 2(5)""! 
15. This is in the form of equation (16) with с = 2 and g(n) = 1. By equation (21), the solution is 


logn 
2®”(1) + У 2099711) = glogn + 2(овл)—1 + 2(овл)—2 acer: 29 
i=1 


= 20108 n) +1 =i] 


= p — 1 = 2n — 1 


16. n Sequential Binary 
Search Search 
64 64 Т 
1024 1024 11 
32768 32768 16 
CHAPTER 4 
1. a. {4,5, 6, 7} 
b. {April, June, September, November} 
c. {Washington, D.C.} 
2. a. {х|х is one of the first four perfect squares} 
b. {хх is one of the Three Men in a Tub in the children’s nursery rhyme} 
c. {x|x is a prime number} 
3. а. А = (xx eNandx = 5} 
b. В = {3, 4, 5} 
4.xcB 


5. AC B means (V)x E A > хє В) A (Ay lve BA^(y e Ay] 
6.a,b,d,e,h,il 
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7 Let x € A. Then x € К and x? — 4x + 3 = 0, or (x — 1)(х — 3) = 0, which gives x = 1 or x = 3. In 
either case, x € N and 1 = x = 4, so x € B. Therefore А С B. The value 4 belongs to B but not to A, so 
A C B. 


8. &(4) = (2, {1}, {2}, {3}, 0, 2}, 41,3}, {2, 3}, 0,2, 39 
9. 2” 
10. By the definition of equality for ordered pairs, 


2х-у=7апйх+у=—1 
Solving the system of equations, 


x=2,y=—-3 
11. (3, 3), (3, 4), (4, 3), (4, 4) 
12. a. Sis not closed under division. (3 + 4 is not a positive integer) 
c. 0? is not defined. 
f. x” is not unique for, say, x = 4 (2? = 4 and (—2) = 4). 
13. Yes; if x € А N B, then x € A (and x є B, but we don't need this fact), so x € AUB. 
14. A’ = {x|x e Sandx € A} 15.4 — B = {x|x € Aand x ¢ B} 


S S 


A A-B 
. {1, 2,3, 4, 5, 7, 8, 9, 10} 
. {1,2,3} 
. {1,3, 5, 10} 
. A X B= {(1, 3), (1, 4), (2, 3), (2, 4)} 
. B X A = {(3, 1), (3, 2), (4, 1), (4, 2)} 
. А? = {(1, 1), (1, 2), (2, 1), (2, 2)} 
La = {(1,1,1), (1,1,2), (1,2, 1), (1,2,2), (2, 1, D, (2, 1,2), (2, 2, 1), (2,2, 2)} 


18. Show set inclusion in each direction. To show A U Ø C A, let x € AU Ø. Then x є A or x e Ø, but since 
Ø has no elements, x є A. To show 4 C A U Ø, let x є A. Then x € A or x є Ø, so x e A U Ø. 


19.a. [(CN(AUB)]U[(AUB)NC’] 
= [(AUB)NC]U[AUB)NC] (b 


16. 


17. 


© c 9p o c g 


с. 


= (АОВ)П uc: (3b) 
=(AUB)NS (5а) 
=AUB (4b) 


b. [CU(ANB)IN[(ANB)UC']=ANB 
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20. An enumeration of the even positive integers is 2, 4, 6, 8, 10, 12,.... 
21. 1/5, 5/1 22. 4(8)(5) = 160 23. 7(5) + 9 — 44 


24. Although the problem consists of successive events—the five tosses—the number of outcomes of each 
event is not constant but varies between one and two depending on the outcome of the preceding event. 


Y Y Y 
X X X X X X X XY YYY Y Z zzz 2 2 Z Z 
X X X Y YZ ZZX X XY Y X XXY YZ Z Z 
X Y Z X YX Y ZX Y ZX Y X Y ZX Y X Y Z 
21 strings 
26.AU B 


27. Equation (2) gives the result of Example 31 because if А and В are disjoint, then 
IAN B| = 0. 


28. The reasons for the equalities are 
set union is associative 


equation (2) 
equation (2) and set identity 3b (distributive property) 
equation (2) 
rearranging terms 
! 
29. 7 (The bins are the 6 possible values.) 32. C(12, 3) = x = 220 
20! Е 
30. Р(20, 2) = 18 > 380 33. a. 18 b. 24 
9! 
31. 6! — 720 34. — 
312! 
8! 


35. Неге r = 6 and n = 3, with repetitions. C(r + n — 1,r) = C(8, 6) = ET = 28 


36. cysar, scary, scrva, scyra, yarcs, yarsc 
37. Here five digits are being permuted, so и = 5. At this point 


di =5,d, = 1,4; = 4, d4 = 3,4; = 2 


In the for loop that generates all permutations after the first, set i = 4,7 = 5. Consider pairs of adjacent 
values from right to left as long as d; > d;: 

а, > d; 3 >2 true 

d; > d, 473 true 

d, > d 1>4 false 


38. 


39. 


40. 


41. 
42. 


— 


43. 
44. 


45. 


46. 
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The value of i at this point is 2, and d; — d; — 1. Set j — 5; consider d; values from right to left as long 
as d; > d; 


d, > d; 1>2 false 


The value ofj at this point is 5. Swap d, and d;, giving 52431. Take the descending sequence right of 4, 
namely 431, and swap pairs of values from the outside in to reverse the sequence. Swap 4 and 1, giving 
52134. The indices i from the left and j from the right meet in the middle, at which point the reversal —and 
the new permutation—is complete. 

The rightmost non-max element is 5, which becomes incremented to 6. The two digits to its right are reset 
to their minimum values of 78. The next combination is therefore 24678. 


(a + by = а? + ЗаЬ + Зар + DP 
Coefficients: 1 3 3 1, which is row n = 3 in Pascal’s triangle 


(a + Ьу = а + 4a°b + ба?Ь? + да? + bt 


Coefficients: 1 4 6 4 1, which is row n = 4 in Pascal’s triangle 


(x + D = C6, Oe + CG, 1)х* + CG, 2 + C(5, 3) + C(5, 4)х + CG, 5) 
= x + 5x4 + 10? + 10x? + 5x4 1 
C(7, 4x7 y4 
The sample space consists of all the cards in the deck, so |S| = 52. The event of interest is the set of aces, 


so |E| = 4. P(E) = 4/52 = 1/13. 

16/42 = 8/21; 28/42 = 14/21 

a. p(c) = 1 — (p(a) + p(b)) = 1 — (0.2 + 0.3) = 0.5 
b. p(a) + p(c) = 0.2 + 0.5 = 0.7 


E, = (HH, HT, TH}, Е, = {HH}, Е, ПЕ, = {HH} 
Р(Е ПЕ) 1/4 
Р(Е21Е1) = — = = 1/3 
(Е2|Е1) PE) 3/4 / 
Let 
E, be the event that the package came from Supplier A 
E, be the event that the package came from Supplier B 
F be the event that the package was lettuce 
By Bayes’ theorem, 
Р(Е|Е))Р(Е 57/100)(1/2 
P(E||F) = (Е|ЕЈ)Р(Е)) (57/100)(1/2) — 


P(F|E)P(E,) + Р(Е|Е,)Р(Е,) (57/100)(1/2) + (39/100)(1/2) 


47. E(X) = 5(2/8) + 2(3/8) + 32/8) + 7(1/8) = 29/8 = 3.625 


48. 


This is still a Bernoulli trial where n = 200 and р = 1/2. From the binomial distribution, the probability 
of 100 heads is C(200,100)(1/2)'°(1/2)' = 0.056. 
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CHAPTER 5 


© 


сы с бсо с ® бо с ®© бс осы 


. (3,2)єр 

. (2, 4), (2, 6) єр 

. (3, 4), (5, 6) єр 

. (2, 1), (5, 2) єр 

. Many-to-one 

. One-to-one 

. Many-to-many 
_xpUa)yoxsy 
хр yex-zy 


Xg'yexzy 


.рПс = ӧд 
. (1,1), (2, 2), (3, 3) 
. Knowing that a relation is symmetric does пої by itself give information about any of the ordered pairs 


that might belong to p. If we know that a relation is symmetric and we know some ordered pairs that 
belong to the relation, then we know certain other pairs that must belong to the relation (see part (c)). 


‚ (b, a) 


а. а= Б 
e. The transitive property says (x, y) є p / (y, Z) € p > (x, 2) Є р. In this case (1, 2) is the only element 


a 
b 

с 

d 
е. 
f. 
g 
h 


of p and (2, z) € p for any z in S. Therefore the antecedent of the implication is always false, and the 
implication is true; p is transitive. 


. Reflexive, symmetric transitive 

. Reflexive, antisymmetric, transitive 
. Reflexive, symmetric, transitive 

. Antisymmetric 


Reflexive, symmetric, antisymmetric, transitive 
Antisymmetric (recall the truth table for implication), transitive 


. Reflexive, symmetric, transitive 
. Reflexive, symmetric, transitive 


6. No. If the relation has the antisymmetry property, then it is its own antisymmetric closure. If the rela- 
tion is not antisymmetric, there must be two ordered pairs (x, y) and (y, x) in the relation with x = y. 
Extending the relation by adding more ordered pairs will not change this situation, so no extension will 
be antisymmetric. 


т. Reflexive closure: {(a, a), (b, Б), (c, c), (a, c), (a, d), (b, d), (c, a), (d, a), (d, а)} 
Symmetric closure: {(a, a), (b, b), (c, c), (а, c), (a, d), (b, d), (c, a), (d, a), (d, b)} 
Transitive closure: ((a, a), (b, Б), (c, c), (a, с), (a, d), (b, d), (c, a), (d, a), (d, а), (d, c), (b, a), (b, с), (с, 4)} 


8. a. 


b. 


с. 


(1, D), (1, 2), (2, 2), (1, 3), (3, 3), (1, 6), (6, 6), (1, 12), (12, 12), (1, 18), (18, 18), (2, 6), (2, 12), 
(2, 18), (3, 6), (3, 12), (3, 18), (6, 12), (6, 18) 

1, 2,3 

2,3 
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Є 5 


һу 


9. 12 18 10. y є S is a greatest element if x < y forall x € S. 
y E S is a maximal element if there is no x 
4 with y < x. 
2 3 
1 
11. 12.Let q € [x]. Then xpq. Because x pz, 


symmetry z p x. By transitivity, z p x together 


with x p q gives z p q. Therefore, q є [z]. 


13. Reflexive: For any x є S, x is in the same block as itself, so x p x. 


Symmetric: If x p y, then x is in the same block as y, so y is in the same block as x, or y p x and 
symmetric. 


p is 


Transitive: — If x p y and y pz, then x is in the same block as y and y is in the same block as z, so x is in 


the same block as z, or x p z. 
Therefore p is an equivalence relation. 
14. a. The equivalence classes are sets consisting of lines in the plane with the same slope. 
b. [n] = (n); the equivalence classes are all of the singleton sets of elements of №. 


c. [1] = [2] = {1,2}, [3] = {3} 


15. [0] = {..., —15, —10, —5, 0, 5, 10, 15, ...} 
[1] 2 {..., —14, —9, —4, 1, 6, 11, 16, ...} 
[2] = {..., 7-13, 2-8, —3, 2, 7, 12, 17, ...} 
[3] = {..., 7-12, 77, —2, 3, 8, 13, 18, ...] 
[4] = {..., -11, ^6, —1, 4, 9, 14, 19, ...} 
16. 2 17. 4(6) 


1(4) 2(3) 3(7) 9(5) 


7(5) 


18. Minimum time to completion is 36 days. Critical path is 1, 2, 3, 4, 6, 8, 9. 
19. For example: 1, 3, 2, 6, 7, 5, 4, 8, 9, 10, 11, 12 
20. For example: 1, 2, 3, 7, 5, 4, 6, 8, 9 
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21. 22.a. project(join(restrict PetOwner where Pet- 
Locale c » 
Type = *Dog") and Person over Name) over 
Name State City giving Result 
Patrick, Tom FL b. SELECT City 
Smith, Mary IL FROM Person, PetOwner WHERE Person. 
Collier, Jon IL Name = PetOwner.Name AND PetType = 
Jones, Kate OH “Dog” 
Smith, Bob MA c. Range of x is Person 
White, Janet GA Range of y is PetOwner 


{x.City | exists y( y. Name = x.Name and 


Garcia, Maria NY y.PetType = “Dog”)} 


23. а. Nota function; 2 є S has two values associated with it. 
b. Function 


c. Not a function; for values 0, 1, 2, 3 of the domain, the corresponding A(x) values fall outside the 
codomain. 


. Not a function; not every member of S owns an automobile. 
. Function (not every value in the codomain need be used) 
. Function 


. Nota function; 5 є N has two values associated with it. 


d 
e 
f. 
g. Function 
h 
a. 16 b. +3 23 E 


26. fx) = |х] f(x) = [x] 


27. fand g have the same domain and codomain, so we must show that each function has the same effect on 
each member of the domain. 


1 
S (4k — 2) 
Е = 4-1-2 2 
JU) 1 80) = 2 amm а ш. 
2 
Y (4k - 2) 
1 === 


(4:1-2)*(4:2-2) 2+6 
2 Е 2 Ta 7 


ЈО) =4 е0) = 4 
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3 
ы. 4:1-2) + (4-:2-2) + (4:3 – 2 
ЛЗ) = 9 2(3) = = =! = = l 


2 2 


22+6+10 | 
== 
Therefore f = g. 
28. b, f, g 
29. If P is either a tautology or a contradiction. 
30. e, g 
31. (g ° f X2.3) = 2( (2.3) = #((2.3)”) = 265.29) = |5.29] = 
(Fe D-3) = (2:3) = A23) = Д2) = 2^ = 4 


32. Let (g ° f (s1) = (g ° f (sz). Then g( f(s1)) = g( f(s2)) and because g is one-to-one, f(s;) = / (5). Because 
fis one-to-one, sı = s2. 


33. Let t € T. Then (f° g)(t) = Fe) = f(s) = t. 
34. f RA R, fœ = (x — 4)/3 
35.a. (1,4, 5) = (4, 5, 1) = (5, 1,4) 
Я ( 2 3 4 | 
U 42 52 
36.a. gof = (1,3, 5,2,4) = (3, 5,2,4, 1) =... 
fog = (1,5,2,3, 4) = (5,2,3,4, 1) =... 


12345 
»asf- (à 25 1 `) 
12 34 5 
fac (3 1 3 5 i) 
012 3 4-5 
s asf- fa - (i 5 1 4 >) 


37. (1, 2, 4) ° (3, 5) or (3, 5) ° (1, 2, 4) 


vov DO 


39. One possibility: {(0, 0), (1, 1), (C1, 2), (2, 3, (C2, 4), (3, 5), (C3, 6), . 
40.a. 10.87 < 12 x 1087 
22.27 = 27 = 2227 
37.67 = 48 = 3767 
57.07 = 75 = 5707 
b. No 
с. по = 1, су = 1/200, c = 1 
41. a. Let fp g. Then there are positive constants ле, сі, and c; with c,g(x) = f(x) = cog(x) for x = по. Then 
for x = no, it is true that (1/с) (х) = g(x) = (1/c))f(x), so g p f. 
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b. Let fp g and g p h. Then there are positive constants no, n1, C1, c5, dj, and d; with суох) = f(x) = csg(x) 
forx = ngandd,h(x) = g(x) = djh(x)forx = n,.Thenforx = тах(по, nı), cid; A(x) = fix) = cod;h(x) 
so fph. 

42. Зх? = O(x)) using constants ny = 1, c, = c; = 3. 

200x? + 140x + 7 = O(x)) using constants ny = 2, c, = 1, с = 300. 

43. a. If x = y (mod n) then x — y = kn for some integer k, so x — y = kn + 0 and (x — y) mod n = 0. 


Conversely, if (x — y) mod n = 0 then x — y = kn + 0 for some integer К, or x — y = kn so 
x = y (mod n). 

b. Let x = дүп +r, 0 r, < n and у= qon t rj, 0 = р < n, so x mod n =r, and y mod n = n. 
Then x — y = (дүп + rj) — (дәп + n) = (д, — qn + (ri — rj) with -n «€ rj — rn € n. 
If x mod n = y mod n, then r; = r so r, — r, = 0 and x — y = (9; — q;)n where q, — q is an 
integer, so x = y (mod n). 
Conversely, if x = у(тойл), then x — y = kn for some integer k. Because x — y = 
(41 — qn + (ry — r) with —n < ri — r; < n, ri — ra = 0and x mod n = y mod n. 


44. 

0 | 48 o| e 

1| 28 1| 

2 2| © 

3| 23 3| — 23 |e 
4 4| o 

5 s| o 

6 6| • 

А: 71| — 7 | 
8 | 158 8| — 28 | 4—4 48 | 4—4 158 lo 
9 | 59 9| Ы 59 |e 


(а) (b) 


45. THE CAT IN THE HAT 
46. x = 1011, p = x mod 2? = 0011, g = р: 2 = 0110,s = x Op = 1000, = s -27° = 0001, 
у= 4+і= 0111 
47. (166) той 221 = (166?)!' · 166 mod 221 = (27556)! · 166 mod 221 
= (152) - 166 mod 221 = (1522) · 152 - 166 mod 221 = (23104) · 152 · 166 mod 221 
= (120) · 152 · 166 mod 221 = (1202)* · 152 · 166 mod 221 
= (14400)* · 152 · 166 mod 221 = (35)* · 152 · 166 mod 221 
= 352.352. 152: 166 mod 221 = 120 · 120 - 152 · 166 mod 221 
14400 - 25232 mod 221 = 35 · 38 mod 221 = 4 


48. а. 3 b. X 


49. 


50. 
51. 


53. 


54. 


55. 
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temp = 375 

ones = temp mod 10 = 5 

temp = (375 — 5)/10 = 370/10 = 37 
tens = temp mod 10 = 37 mod 10 = 7 
temp = (37 — 7)/10 = 30/10 = 3 
hundreds = temp = 3 


аз = l, a4 = —7, аз = —6 
52. 
6 14 Аве | 
А +В= |3 10 12 28 
9 16 B-A= p d 
27 4 
I-A = Цап) + Olan) Ца) + ‚| E К a =A 
0a) + Каз) Olan) + 1an) а d» 
Similarly, А. I = A. 
ep 2 3] |] 4 —3] 1 0 0| 
А.В = 2 1 0 10 -7 =10 1 0 
4 -2 SJL 8 —6 5 ] 0 0 1 
-5 4 -3|[-1 2 3] 1 0 0] 
Вв-А=| 10 -7 6 2 1 =|0 1 0 
8 —6 SJL 4 -2 5j 0 0 1j 
The augmented matrix is 
[| =5 ` 
7 1 37 
Multiply row 1 by 1/3: 
| —5 ; 
7 1 37 


Then multiply row 1 by —7 and add it to row 2: 


AC TAA | resultin in|! ык OP 
7 Do Б 38/3 76/3 


Using the second row, 
(38/3)y = 76/3 or y = 2 
Using the first row, 
x — (5/3)у = 5/3 or x — (5/3)(2) = 5/3 orx = 5. 


The solution is x = 5, y = 2. 
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NI х | T AY 

1 |1 1 

1 |0 0 

0 |1 0 

olo 0 
57. 


x y xVy 
1 1 1 
1 0 1 
0 1 1 
0 0 0 
58. 
ВХА = 


CHAPTER 6 


1. One possible picture: 


10. 


11. 
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. Possible answers: 
a. 2 and 3 Ё 2, а), l, do, 3, dy, 4, a4, 3, a3, 4 
b. 5 g. 3, аз, 4, ay, З 
с. dg h. no 
d. a4 and a, 1. yes 
e253 
5. 
Куз 
Ks 


a. In a complete graph, any two distinct nodes are adjacent, so there is a path of length 1 from any node 
to any other node; hence the graph is connected. 


b. For example, the graph of Figure 6.10b. 


f: d4 е; 8. 1—4 
ds — eg 2e 

а €; 3f 
аз — e; (Or eg) 4 с 
dg е, (ог es) 5 ә Б 
6a 


. The graph on the left in Figure 6.19 has two nodes of degree 2, but the graph on the right does not; or the 


graph on the left has parallel arcs, but the graph on the right does not. 


K, can be represented as 


Making 1—3 and 1—4 exterior arcs leads to the graph below, where it is still impossible to make 3 and 5 
adjacent while preserving planarity. 
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12. An attempt to construct К; ; as a planar graph leads to the graph below; there is no way to connect nodes 
3 and 5. Any other construction leads to a similar difficulty. 


13.n=6,a=7,r=3,and6—-—7+3=2 

14. Without this condition on the arc, a figure such as the one below could result. Then the graph would be 
split into two disconnected subgraphs and the inductive hypothesis would not apply. Also the number of 
regions would not change. 


15. In K55, a = 9, n = 6,and9 = 3(6) — 6. 


16. 


1 
0 
2 
0 


_ O =. m. 
or oF 
оо н © 


20. 


21. 


22. 


23. 


24. 
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a. Left child | Right child b. ESESN 


0 2 


a BR ON + 
о|о|о| о 
о|о|[о| A 


a, b, e, f,c, d, g, і, h 
e, b, f,a,c,i, g,d,h 
OD Cyl, Beh й„@ 


+ Prefix notation: + а— *bcd 
Postfix notation: abc*d— + 


For the base case, n = 1, the tree consists of a single node and no arcs, therefore no arc ends. The number 
of arc ends is 0 — 2(1) — 2. Assume that any tree with k nodes has a total number of arc ends of 2k — 2. 
Consider a tree with k + 1 nodes, and show that the number of arc ends is 2(k + 1) — 2. In this tree, 
remove a leaf node and the arc to that node's parent. This leaves a tree with k nodes and, by the induc- 
tive hypothesis, 2k — 2 arc ends. The original graph had one more arc, and two more arc ends, so it had 
2k — 2 + 2 = 2k = 2(k + 1) — 2 arc ends. 


The base case is the same as in Practice 23. Assume that tree T is constructed from subtrees 7), ... , T, and 
that any subtree 7; with n; nodes has (2n; — 2) arc ends. Let п equal the number of nodes in 7. Then, as in 
Example 30, 


P t 
n-ltnso2n-2-t2»»mn, 
e = 


= = 


t 


The number N of arc ends in T is 2t + У (number of arc ends in 7;). (The extra 2t counts the number of 
i=l 


arc ends contributed by the ¢ arcs from the root of T to the ¢ subtrees.) Then 


t t t 
N= 2t+ Yn — 2) 22t t 2Y n 2t 2 2Y n = 2n- 2 
i-1 i=] і=1 


| 836 Answers to Practice Problems 


L[1] «x « L[2] ЦА] <х< Ц5] x»LI5] 


L[2] « x « L[3] 


b. Depth oftree = 3 = 1 + |log5] 
26. a. 12 b.d=3 


27. a. 11:10] 


1021: L[3] L[1]: L[3] 


Ц < L[2] < L[3]  L[1]: L[3] 


< 


L[1] < L[3] < L[2] L[2] < L[3] < L[1] 
L[3]< L[1]< L[2]  L[3]< L[2] < L[1] 
28. a. ppca? b. cagak c. ?kac? 


29. 100 


42 58(е) 


20(h) 22 


10(w) 12(q) 


30. w: 010 
4: 011 
h: 00 
е: 1 
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1. {(2, 1),(2, 2,G, 1),G, 4)} 1 2 


3 4 


2. There are two distinct nodes, 3 and 4, with 3 p 4 and 4 p 3. 


3. 0 


0 
0 
1 


A®[2, 1] = 1 because there is a path from 2 to 1 of length 2 (2—4—1). 
4. There is a length-4 path (2-4—1—4-1) from 2 to 1, so A®[2, 1] should be 1. 


0 


0 
0 
0 


0 
0 
0 
1 
АЗ) = | 


— о о о 


оо о c 


0 
0 
0 
0 


0 


1 
0 
0 


[« о н н 


1 
1 
0 
0 


A® = 


оо о c 


о о 


0 


0 


0 
0 0 
1 


5. В = A V AO УА V А), so performing the Boolean or of the four matrices from Practice 4 gives 


6. [0 
0 

M, = 
^ [0 
1 
0 
0 

М, = 
> 10 
1 


0 


ooo o occ ое 


0 1 
1 1 
0 0 
0 1 


ш о н н 


0 1 1 0 0 0 
1 1 1 0 0 0 
AO — 
0 0 0.00 0 
0 0 0 0 0 1 
0 1 1 0 0 0 
0 1 1 0 0 0 
AY = 
0 0 0 0 0 
0 0 0 0 0 1 
A 
1 0 
R= 1 0 
0 0 
1 0 
Column 2 is all Os, so 2 is not reachable from any node. 
0 1] 0 0 0 1 
1 1 0 0 1 1 
M = 
0 0 ' 10 0 0 0 
0 0 1 0 0 1 
0 1 1 0 0 
1 1 1 0 1 
М; = Е = 
ooj * 0 0 0 
0 1 1 0 0 
b. Yes 


7. a. No 


— о о oc 


ооо oc 


оо н oc 


к* о к» н 
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8. a. No, four odd nodes b. Yes, no odd nodes 
9. No, all four nodes are odd nodes. 
10. A4 B C D 


AJO 2 2 1 
B2 0 0 1 
C|2 0 0 1 
D|1 1 1 0 
After row C, total — 3, the loop terminates, and there is no path. 
11. a. No b. Yes 
12. р = 1 
IN = {x} IN = {x, 1} 
x 1 2 3 у x 1 2 3 y 
8 |. T ж 4 & d/o 1 4 2 6 
s| =- x x x x sl- x A 1 1 
p=3 p=2 
IN = {x, 1,3} IN = {x, 1, 3, 2} 
X 1 2 3 y X 1 2 3 y 
d 0 1 4 2 5 d 0 1 4 2 5 
$ = x 1 1 3 $ X 1 1 3 
р=у 
IN = {x, 1, 3, 2, y} 
X 1 2 3 y 
d 0 1 4 2 5 
S — X 1 1 3 


14. a, e, d, b, c, i, f, g, h, L, k, m, j 
15. a, e, f, d, i, b, c, g, h, j, k, m, 1 
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17. Choosing node c (arbitrarily) as the start node, we visit e, then have to back up to c. There's nowhere else 
to go, so start over from node a (again, an arbitrary choice from the unvisited nodes). Traveling from a to 
b to d and then backing up produces the rest of the node numbers. 


a (5) c (2) 


e (1) 
b (4) 


d (3) 


Topological sort: а, b, d, c, e 

18. The depth-first search progresses from node a to b and then c, with TreeNumbers and BackNumbers as- 
signed in sequence. At node c, the back arc to node a causes BackNumber of c to be changed to TreeNum- 
ber of a. The search progresses to d, which has a sequential TreeNumber and BackNumber assigned. The 
search then backs up to node c. Because BackNumber(d) > TreeNumber(c), c is recognized as an articula- 
tion point. The search backs up to node b, and b’s BackNumber is reduced to that of c. The search backs 
up to a, but a is not an articulation point because it is the starting node with only one tree arc. 


CHAPTER 8 
1. 0:120 2. а. АМА = А 
1:121 b AUA=A 
3. а х+1=х + (х +х) (5а, complement property) 
= (х+х)+х' (2a, associative property) 
= х + x’ (idempotent property) 
= 1 (5a, complement property) 


b. х:0= 0 
4. То prove that 0’ = 1, show that 1 has the two properties of the complement of 0. 


0+1 = 
0-1 


1 (universal bound) 
1-0 (1b) 
0 (dual of universal bound) 


Therefore 1 = 0’ by the theorem on the uniqueness of complements. 


5. а. fix : y) = fix) fO) 
b. fix’) = [A] 
6. Property 2: TE s -X4y 
f f 


fo fO — t fa + y) = feo & f(y) 


Р 


Р 


£e 


р о о с 
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roperty 3: x, y ———— x -y 


f(x), fü) —M—— f(x + y) = FOO * FO) 
roperty 4: х ү! 


f J 


f(x) — e f(x) = (ў0х))” 


. fO + a) = fla) = {1} = ØU {1} = f(0) U f(a) 


. fla + a’) = К) = (5,2) = {1} U {2} = Ka) Ufa’) 


«ias = R0) = Ø = 10 N {2 = Да) Nfa’) 


JU) = fO =Ø = {1,27 = (FO 


. A single truth function on (0, 1)" must map each of the elements in the domain to a 0 or a 1, and there 


are 2" n-tuples in the domain (0, 1)". Hence the table for the function will have 2" rows. 


. Any truth function must fill 4 “slots” (corresponding to the 2? = 4 domain elements) with one of 2 
values, a 0 or a 1. There are 2^ = 16 different ways to do this. 


. Any truth function must fill 2" *slots" (corresponding to the 2" domain elements) with one of 2 values, 


a 0 ora 1. There are 2” different ways to do this. 


X Xi + x} 
X5 


хх + X4) 


Xi Хо | Xs | (ху + XX3 
1 1 1 0 
1 1 0 1 
1 0 1 0 
1 0 0 0 
0 1 1 0 
0 1 0 1 
0 0 1 0 
0 0 0 1 
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11. а. xxx; + XXX;  xQxoxa4 + xqxox4 + ХүХәХ3 


b. х 


Xi p X4X3X3 + X4X5X3 + X4X3X5 + X4X3X3 + X1X3X3 


12. 101 
ut 
(1)100 


13. a. 


b. xps + x; = (œx) xX) X 


А ((х,х3) x3) 
X3 
х; 
14. а 

Хү х2 f(X1, X2) 

1 1 0 

1 0 1 

0 1 1 

0 0 0 
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Here x; = 0 when the switch is in the “off” position and /(х, x.) = 0 when the light is off. The last 
row of the truth table says that when both switches are off, the light is off. Rows 2 and 3 say that when 
one or the other switch is turned on, the light goes on. But the top row says that if the light is on (be- 
cause one of the switches has been turned on) then flipping the second switch turns the light off. 


b. One possibility is the canonical sum-of-products form, xix; + x{xz 


C. Xi 
X2 
Xi 
X2 
15. а. хух + хүр = xx, + xx, b. x, + xix; = ху 1 + xix; 
= (х + x") = x,(1 + x) + xix; (universal bound) 
= x1 =X, + ХХ) + XX 
— — ГА 
= X = x, + x6x + xi) 
= X1 EB Хә E 1 
= Хут 5 
16. x Х| 


The reduced expression is x/ 
17. хухз (4 squares) and x|x5x4 (2 squares) 


18. xyx4x4 + Xix4x4 + 525 


Number of 1s | x, Xo Xs Number of 15 | x, Xo X3 
Three 1 1 1 1 Two 1 1 = 
Two 1 1 0 12 One 1 - 0 
Опе 1 0 0 E None - 0 0 

Oo dw 1 + о 0 - 
None 0 0 0 3a 
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The comparison table follows. 


111 | 110 | 100 | 001 | 000 
11- "4 "4 
1-0 V / 
-00 V 4 
00- v А 


Essential terms are 11— апа 00—. Either 1—0 or —00 can be used as the third reduced term. The minimal 
sum-of-products form is 


XX + xqxj + XX; or XX t x(xj + xjxi 


CHAPTER 9 


. Multiplication in R is associative and commutative, and 1 is an identity. But [К, ·] is not a commutative 


group because 0 є К does not have an inverse with respect to multiplication; there is no real number у 
such that 0: y = y-0 = 1. 


2. See Practice 52 of Chapter 5. 
3. Many elements of M;(Z) do not have inverses under matrix multiplication. For example, if 


2 
. No, 5 is not closed under multiplication; for example, = · 


a ud 
[0 2] 
15 to have an inverse 
a Б] 

c d 


under multiplication, then 
2 0 


o able ave ail allo i 


By the definition of matrix multiplication, the only matrix that satisfies this equation would have 2a = 1 
ога = 5 and thus would not be a member of M;(Z). 


. Subtraction is not associative; for example, 5 — (3 — 1) = 3 but (5 — 3) – 1 = 1. 


3 


== 1, 
ee 


. All except [IR*, +] (which has no identity) are monoids; the identities are, respectively, 0, 1, 1, 0, Ø, S. 


[R, +] 
a. f(x) + g(x) = g(x) + f(x) 

[ f(x) + g(x)] + A(x) = Дх) + lg) + ^69] 
b. the zero polynomial, 0 


c. —7x* + 2x3 — 4 
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9%. a +, | 0 1 2 3 4 5 0 1 2 3 4 b. 0; 1 
0 0 1 2 3 4 0 0 о о 0 0 с. 3 
1 1 2 з 4 0 1 0 1 2 3 4 d. allexSento 
> | 2 з 4 0 1 2 0 2 4 1 3 
3 |3 4 0 1 2 3 0 3 1 4 2 
414 0 1 2 3 4 Ü 4 3 2 1 
10.а. 4 0 1 2 3 4 5 b. 1 and 5 
0 0 0 0 o 0 0 
1 0 1 2. 4 4 5 
$ 0 2 4 0 2 4 
3 0 3 o 4 0 3 
4 0 4 2 0 4 2 
5 0 5 4 3 2 1 


11. a. 011015 11011 = 10110 
b. —10100 = 10100 because 10100 +, 10100 = 00000 


12. Let f, g, h є S. Then for any x є A, (f° g) e Ala) = (F° DAW) = felh) and [7° (е » A) |x) = 
Slee WE] = felha). Hence, (f° g) ° A = f» (g» А). 


13.a. , а оу 3 о Gs а b. No, because œ, ° аз = ag but a3 ° a» = as. 
а 04 ap a3 O4 as ae 
Qo 02 ay ag as ay a3 
a3 a3 as а Og ao a4 
о | о Qe — Og ay аз: 65 
as as a3 ay Oto Og ay 
ae a6 a4 Ao a3 04 as 


14. a. No, because ab: a = aba but a: ab = aab. 


b. No, because no nonempty string has an inverse; for example, there is no string to concatenate with a to 
get A. 


15. ij = i47 in because i, is an identity i, ‘iy = iy because i, is an identity 
16. Let y and z both be inverses of x. Let i be the identity. Then y = y: i = у: (х:2) = (у.х) :2 = iczz. 
17 77! = 5,37! = 9; so 107! = (7 +43) 23447! 29455-2 


18. Z'X—z'y implies 20. x 1 a b c d 

z 5, (zex) =z (zy) 1 1 a b c d 

-l.Aa.—(.Là.A. a a b С а 1 

>л ш. blb c d 1 a 

d m mE сіс d 1 a b 

х=ўу ара 1 A b C 
19.x = 1 +,(3) 1 = 1+5 =6 21. a. [Zig, +18] 


b. [S3, °] 


22. 
23. 


24 


26. 


27. 


28. 


29. 


30. 


31. 
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Requirement 2, i € A, ensures that А 7 Ø. 
a. Closure holds: 
+,|0 2 4 6 
0 0 2 4 6 
2 2 4 6 0 
4 4 6 0 2 
6 6 0 2 4 
0 e {0,2,4,6}; 07! = 0,47! = 4, and 2 and 6 are inverses of each other. 
. Closure holds: 
+ |1 2 4 
1 1 2 4 
2 2 4 1 
4 4 1 2 
1 є {1,2,4}; 17! = 1, and 2 and 4 are inverses of each other. 


. ал, as, as}, o] Кал, ay}, °] ЕСА a3}, °] [{a,, o4], °] 


25. 


Toshow that f is one-to-one, let œ and З belongto A, andsuppose f(a) = /(8).Тһепе ° (1, 2) = B » (1, 2). 
By the cancellation law available in the group 5,, œ = В. To show that fis onto, let y € O,. Then 
у ° (1,2) eA, and fly ° (1,2) = y ° (1, 2) ° (1,2) = уі =i. 

For nz, nz, E€ NZ, пд + nz, = n(z, + 2) EnZ, so closure holds; О=л.0єл#, for nz € nZ, 
—nz = n(—z) є nZ. 


x,y „===== == пыз y 
/ f 
Хб), О) = f(x у) 2 fe) +f) 
f(s) + f(s ) = f(s:s = fis = ip. Similarly, f(s )) + f(s) = ip. Therefore f(s !) acts like the in- 
verse of f(s) in T, and since inverses are unique, f(s !) = —f(s). 
Let t, and t; be members of T. Because f is onto, ¢; = f(s,) and t; = f(s;) for some s;, s; € S. Then 


t + h = f(s) + f(s) = Asi s) = Asp + 51) (because [S, -] is commutative) 
= f(s.) + f(s)) = + ty 


so [T, + ] is commutative. 

Clearly f is onto. f is also on-to-one: Let f(x) = f(y). Then 5x = 5y and x = y. f is a homomorphism: 
For x, y € Z, f(x + y) = 5(x + y) = 5х + 5y = f(x) + Ду). 

a. Composition of bijections is a bijection, and for х,ує 5, (gef)x:y) = g(f(x:y)) = 


alfa) + fü) = sf) * Cf) = (g *f£)6)*( 9 AVY) 


b. S = S by the identity mapping. If f is an isomorphism from S to T, then f~" is an isomorphism from 
T to S. If S = T and T = V, then by part (a), S = V. 
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32. To show that a, is an onto function, let y € G. Then g !:y belongs to С and a,g™'-y) = 
g(g '-y) = (g-g Эу = y. To show that a, is one-to-one, let a(x) = a,(y). Then g-x = gy, and by 
cancellation, x = y. 

33. a. Pore, E P, Q, ° Q) = 0,4 = d, and oy ° Qy = dp, = Q 


gl E $ 


b. о„° Qy = Aye = ay and a,49 A, = oa, = 
34. a. Let f(g) = f(A). Then a, = o, and, in particular, a,(1) = а,(1), org: 1 =A-landg = А. 
b. For g, h € G, f(g: h) = dyn = a, ° о, = f(g) ° fh). 
35. x p x because f(x) = f(x). 
xpy > у px because if f(x) = f(y) then f(y) = f(x). 
x p y and y pz — x pz because if f(x) = f(y) and f(y) = f(z), then f(x) = f(z). 
36. К = {хє Z|f(x) = x - 41 = 0}. Therefore K = (0, + 3, + 6, + 9,...) = 37. 
37 a. 1 +S = {1+,0, 1 942, 1 +84, 1 +,6} = {1, 3, 5, 7) 
3 +S = {3 +20, 3 942,3 +34, 3 +6} = {3, 5, 7, 1} 
7+8 = {7 +0,7 442,7 +24, 7 +36} = {7, 1, 3, 5} 
b. 2 +S = {2 +,0,2 +,2,2 +,4,2 +6} = {2, 4, 6, 0} 
2 +5 =0+;S = 4 4.8 =643S 


a 


38. H(X, Y) = 2 
39. Straightforward matrix multiplication using addition modulo 2. For example, 
1 0 1 
1 1 1 
(01111) 1 0 0 | = (000) 
0 1 0 
0 0 1 
40. 00 — 00000 
01 — 01111 
11 — 11010 


Together with 10 — 10101, these are the four code words given in Example 21. 
41. a. For example, 


1 0 1 
1 1 1 
0 1 1 
H=]1 1 0 
1 0 0 
0 1 0 
0 0 1 


(There are other possibilities because the order of the rows in B does not matter.) 
b. 0000 — 0000000 

0001 — 0001110 

0010 — 0010011 
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0011 — 0011101 

0100 — 0100111 

0101 > 0101001 

0110 — 0110100 

0111 > 0111010 

1000 — 1000101 

1001 — 1001011 

1010 — 1010110 

1011 — 1011000 

1100 — 1100010 

1101 — 1101100 

1110 > 1110001 

1111 1111111 
42. (11011)H = 001 
(10100)H = 001 
(01110)H = 001 
(00001)H = 001 


43. 000110 
44. Present state Next state Output 
Present input 
0 1 
So So 53 0 
S4 So Sp 1 
Sp 53 53 1 
53 S4 53 2 
45. а. b. 01011 
0 0 
- o 
1,2 
46. а. S; b. s, 


47. 11001011 


| 848 Answers to Practice Problems 


49.a Set consisting of a single 0 
b. Set consisting of any number of 0s (including none) followed by 10 
c. Set consisting of a single 0 or a single 1 
d. Set consisting of any number (including none) of pairs of 1s 
50. The string in part (b) does not belong. 
51. a. 0 b. 0*10 c. OVI 
52. 5, 53 


53. A state s produces the same output as itself for any input. If s; produces the same output аз s;, then s; pro- 
duces the same output as s;. Transitivity is equally clear. 

54. Property 1 is satisfied because all states in the same class have the same output strings for any input 
string, including the empty input string. To see that property 2 is satisfied, assume s; and s; are equivalent 
states proceeding under the input symbol i to states s;' and s; that are not equivalent. Then there is an in- 
put string a such that f5(s;, o) 7 fo(s;, а). Thus, for the input string ia, s; and s; produce different output 
strings, contradicting the equivalence of s; and s;. 


55. Equivalent states of M in Table 9.9 are А = (0,1,3), B = {2}, and C = {4}. The reduced machine is 


d. (11)* 


Present state Next state Output 
Present input 
0 1 
A B A 1 
B C A 0 
C A A 0 


Equivalent states of M in Table 9.10 are {0}, {1}, {2}, and {3}. M is already minimal. 


56. First, write the state table: 


Present state Next state Output 
Present input 
0 1 
So So S4 1 
S4 Sy So 0 


The states can be encoded by a single delay element, as shown: 


d 
So | 0 
Sı | 1 


The truth functions are 


x(t) d(t) y(t) d(t + 1) 
0 0 1 0 
1 0 1 1 
0 1 0 1 
1 1 0 0 
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The canonical sum-of-products forms are 


y(t) = d' 
d(t + 1) = ха + x'd 


and the sequential network is 


57. a. ...b 00b... 

b. The machine cycles endlessly over the two nonblank tape squares. 

c. The machine changes the two nonblank squares to 0 1 and then moves endlessly to the right. 
58.a. bXX1XXb halts without accepting 


T 


5 
b. ЬХХХХХЬ halts without accepting 


| 


2 
c. bXXXOXXb halts without accepting 


| 


2 
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59. State 3 is the only final state. 


(0, 1, 1, 0, R) move right, ignoring 15 

(0, 0, 0, 1, R) a 0 has been read, change state 

(1, 0, 0, 2, R) a second 0 in succession has been read 

(1, 1, 1, 0, R) no second 0, start over 

(2, 0, 0, 2, R) 05 continue to be read 

(2, 1, 1, 0, R) string of 0s broken before end of input, start over 
(2, b, b, 3, R) end of input follows string of at least two 05, accept 


If we were not constrained to move only to the right, we could use the following machine, where state 4 
is the only final state. 


(0, 0, 0, 1, = 


(0. 1, 1, 1, В) 0 reads the first symbol so the tape is not blank 


(1, 1,1, 1, R) 

(1, 0, 0, 1, ah state 1 reads to end of input, moves left 
(1, b, b, 2, L) 

(2, 0, 0, 3, L) reads one 0 

(3, 0, 0, 4, L) reads second 0, accepts 


60. Change (2, 1, X, 3, L) to (2, 1, X, 7, L) and add (7, 1, X, 3, L). 


61. One machine that works, together with a description of its actions: 
(0, 1, 1, 1, R) reads first 1 
(1, b, 1, 6, R) n — 0, changes to 1 and halts 
(151, 1, 2;.R) reads second 1 
(2, b, b, 6, R) n — ], halts 
(2, 1, 1, 3, А) n=2 
1,1,3,R . = 
e b b 4. | } finds right end of л 
(4, 1, 5, 5, L) Е 
(5, 1, b, 6, L) erases two 15 from n and halts 


62. 08 = 00S => 000S = 0000S = 00001 
63. L = (0"1|n = 0} 
64. For example: 
a. G(V, Vr, S, P) where V = (0,1, S, V; = {0, 1}, and = {S —> 1, S > 050} 
b. G = (V, Vn, S, P) where V = (0,1, S, М}, V; = (0,1), and P = {S —> 0M0, M > 0M0, M — 1} 
65. In Су: S = ABA = 00A = 00004 = 00000 66. 
In Œ: S = 004 = 00004 = 00000 
In G4: S = 0A = 008 = 000C = 00005 = 00000 


Identifier 


Letter 


m 


Answers to Odd-Numbered 
Exercises 


(Note that these are answers, not necessarily complete solutions; your instructor 
may require more explanation or justification, as well as a different format, for 


some of these exercises.) 


CHAPTER 1 


EXERCISES 1.1 


1. a, с, d, е, f 
3. а. T Ь. Т е. Т а. Е 
5. а. If there is sufficient water, then there is healthy plant growth. 


11. 


13. 


15. 


17. 


. If there are further technological advances, then there is increased availability of information. 
. If errors were introduced, then there was a modification of the program. 
. If there is fuel savings, then there is good insulation or storm windows throughout. 


.AVB b. A’ A B' 
. l and3 b. 2 с. 4 


b 
с 
а 
а 
а 
а. The food is good but the service is poor. 

b. The food is poor and so is the service. 

c. Either the food is poor or the service is poor, but the price is low. 
d. Either the food is good or the service is excellent. 

e. The price is high but either the food is poor or the service is poor. 
а. [A> BAC]A(C'>B) 

b. 
с 
а 
е 
а 
b 
c 
d 
e 
a 
b 
c 


[(4 V B) > C] ^A (C > By 


. (АУ B) A (4 ABY 

. (АУ В) э С 

. AV(B С) 

. A^B 

. A^(BV C) 

. B—(A^C) 

. A (B' V C?) 

. AA[C = (В' ve 

. Violets are blue or sugar is sour. 


Violets are not blue or, if roses are red, then sugar is sweet. 


Sugar is sweet and roses are not red, if and only if violets are blue. 
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. Sugar is sweet, and roses are not red if and only if violets are blue. 

. If it is false that both violets are blue and sugar is sour, then roses are red. 
. Roses are red, or violets are blue and sugar is sour. 

. Roses are red or violets are blue, and sugar is sour. 


_.H->K b. K —^ (HA А) c. КЭ Н а. K& 4A e (AVH)5K 
. F>B b. B> Е c. B' > (F' AS) а. S— B' e Ser 
A B | A>B | A’ | A'VB | (А> В) OA VB 
T T T F T T 
T F F F Е T 
F T T T T T 
F F T T T T 
Tautology 
А | В | С | АЛВ | (АЛВ)/С | ВУС | A^(BVO) |( A^B)VCOA^A(BVO) 
T T T T T T T T 
T T F T T T T T 
Т Е T F T T T T 
T F F F F F F T 
F T T F T T Е F 
F T F F F T F T 
F F T Е T T F F 
F F F F F F F T 
A B A' B | ANB | (А УВ) | Ал(А' УВ”) 
Т T F F F T T 
T F F T T F F 
F T T F T F F 
F F T T T Е Е 
А В A’ АЛВ | A\B-<A' 
T T F T F 
T F F F T 
F T T F T 
F F T F T 
А | В | С | А-В | АУС | ВУС | (АУС) = (ВУС) | (А В) > (АУС) > (BV C) 
T T T T T T T T 
T T F T T T T T 
T F T F T T T T 
T F F F T F F T 
F T T T T T T T 
F T F T F T T T 
F F T T T T T T 
F F F T F F T T 


Tautology 
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ВЛА |A^BoB^A 


A^B 


25. ]b. 


(AV B)VCGAV(BVC) 


АУ (BV C) 


вус 


(АУ В) УС 


AVB 


2a. 


AAB)AC| BAC | AAN(B^C)|(A^BJ^C A^(B^C) 


A^B 


[e 


2b. 


БББ 


т 
Т 
Е 
Е 
T 
T 
F 
F 


A|BICIBACIAV(BAOJAVB|AVC|(AVB)A(AVO)|AV(BA^O)e (AV B) л (АУС) 


3a. 
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ЗЬ. А | В |С | ВУС АЛ (ВУС) | АЛВ|АлС(АЛВ) У (АЛС) | Ал (ВУС) < (АЛ В) У (А Л С) 
т[т|т| т т т т т т 
Т|Т|Е| T т т Е т 
Y up] T T F т т т 
Т|Е|Е| Е Е Е Е Е т 
Е|т|т| T Е Е Е Е т 
FI|T|F| T F F F F T 
F|F|T| T F F F F т 
FIFIFI Е Е Е Е Е т 

4a. А | 0 | AVO|AVOGA 5b. A | А | АЛА | 0 | АлА' +0 
т | Е т т т | Е Е Е т 
Е | Е | Е т Е | т | Е | Е | т 


27. a. (AANB')ACSAA(B' AC) by 2р Б A ^ (C ^ B") by 1b € (A ^ C) ^ B' by 2b 
b. (AV B)A(AV B') = АУ (BA B') by 3a = А V 0 by 5b & A by 4a 
c AV(BAA') & (A V B) ^ (A V A") by 3a & (А V B) ^ 1 by 5a & A V B by 4b 

29. If A is F and both B and C are T, then (А ^ B) V C is T but А ^ (B V C) is F. These two wffs are not 
equivalent. 


31. dogs AND NOT retrievers 
33. (novels OR plays) AND AIDS 
35. 1.0, 2.4, 7.2, 5.3 
37. if not (Valuel < Value2) then statement] else statement2 end if 
39. A | B | A>B| A | AVB| А5 В6А'ҮВ 
T T T F T T 
T|F F F F T 
F T T T T T 
F F T T T T 


41. a. Assign B’ ^ (А — В) true and JA’ false. From the second assignment, А is true. From the first assign- 
ment, B’ is true (so В is false), and A — В is true. If A — B is true and А is true, then B is true. В is 
thus both true and false, and [B’ ^ (A > B)] > A’ is a tautology. 

b. Assign (А — B) ^ A true and В false. From the first assignment, А is true and A — B is true. If A > B 
is true and А is true, then B is true. B is thus both true and false, and [(4 > B) ^ A] > B isa tautology. 

c. Assign (А V B) ^ A’ true and B false. From the first assignment, A’ is true (and A is false), and A V B 
is true. If A V Bis true and А is false, then В is true. В is thus both true and false, and (A V B) Л A’ > B 
is a tautology. 


аз. 27 = 232 


45. 


тот ч o/s 
тыч чт 34/0 
т ч җа т® 
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47. a. A | B | AVB | A’ | В | AAB | AABY | AVBe(A'^By 
T T T Е Е F T T 
T F T F T F T T 
F T T T F F T T 
F F F T T T F T 
b. A | B | B | ANB (A\B') | A>B A>~Bx(A\B')' 
T T F F T T T 
T F T T F F T 
Е T F F T T Ji 
F F T F T T T 


49. А ^ B is equivalent to (A — В’) 


A | B | A^B| В | А-В | A>BY | AABo(AOB 
T T T F F T T 
T F F T T F T 
F T F F T F T 
F F F T т Е т 
А V В is equivalent to A’ > B 
A | В | AVB | A | АВ | AVBoA'SB 
T T T F T T 
T F T F T T 
F T T T T T 
F F F T F T 
51. A ^ B is equivalent to (A|B)|(A|B) 
А | В | A^B | AB | (AB)(AB) | A^ В‹©(АВ)(АВ) 
T T T F T T 
T F F T F T 
F T F T F T 
F F F Т Е T 


A' is equivalent to А|А 


А | A | AA | ААА 
T | F | F | T 
F T T T 


53. 


55. 
57. 


59. 


6 


— 


63. 
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a. For A ^ B to be true, we would want to know that both parts are true; if one part has an unknown truth 
value then it is unknown whether this is the case. For А V B to be true, we would want at least one part 
to be true; if one part is false and the other part has an unknown truth value, then it is unknown whether 
this is the case. Finally, if the truth value of А is unknown, then the truth value of A' is also unknown. 

b. N c. F d. T 

2 

Machine D is either clean or infected. In either case, by statements 3 and 1, respectively, C is infected. 

Because C is infected, then A is infected by statement 2. By statement 4, B is infected (because C is not 

clean). By statement 3, because B is infected, D is not clean. The conclusion is that all four machines are 

infected. 

This could include everyone because members are welcome and nonmembers are welcome. Or it could 

include no one because no one is both a member and a nonmember. 


. If Percival is a liar, then his statement is false. Therefore it is false that there is at least one liar, and both 


Percival and Llewellyn must be truth-tellers. But this is impossible because we assumed Percival is a liar. 
Therefore Percival is a truth-teller, and his statement is true. Because he said, “At least one of us is a liar," 
Llewellyn must be a liar. Therefore Percival is a truth-teller and Llewellyn is a liar. 


Rothwold’s statement is of the form A V B, where A stands for “I am a liar" and B stands for “Grymlin 
is a truth-teller." If Rothwold is a liar, then his statement А V В is false, and the statement (А V B)' must 
be true. By De Morgan's laws, A’ and B’ must both be true. But A’ is the statement that Rothwold is a 
truth-teller, which is not true. Therefore Rothwold must be a truth-teller, and his statement А V B is true. 
Statement A, however, is false because it says that Rothwold is a liar. So statement B must be true, and 
Grymlin is a truth-teller. Both are truth-tellers. 


EXERCISES 1.2 


11. 


13. 


15. 


G N Ø O gw 


sim 


. By mt, the conclusion is that the car was not involved in the hit-and-run. 
. By simplification, the conclusion is that you will be paid tomorrow. 


hyp 

hyp 

. hyp (deduction method) 
2,3, mp 

1, 4, con 


hyp 

hyp 

hyp 

. 2, 3, con 

. 4, De Morgan 

1,5, mt 

.(AV B^) hyp 

‚ B > C). hyp 
. A’ A (B') 1, DeMorgan 
.A' NB 3, dn 
.AB hyp 
2. A (B C) hyp 
3.A hyp 


A’ 4, sim 
B 4, sim 
C 2, 6 mp 
5,7, con 
B 1,3, mp 
BoC 2,3, mp 
С 4, 5, тар 


PRUNE сш өю л юм 


9, ^R LO 
Е 
> 
o 


17. 


19. 


21. 


23. 


25. 


27. 


29. 


31. 


33. 
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Exercise 28 
1, De Morgan 


1. A’ hyp 

2.AVB hyp 

3. (4) VB 2, dn 

. 4 > В һур 

2. B hyp 

з. Аэ С һур 

4. (В')' 2, dn 

1. A> (BC) hyp 

2. B hyp 

3. A hyp (using deduction 
method again) 

l.AC hyp 

2.C B hyp 

3. B hyp 

ог 

1. А—ә С һур 

2. С В' hyp 

3. B hyp 

Il. РУО hyp 

2. Р hyp 

3. (P) VQ 1, dn 

1.Q' > P' hyp 

2..Р һур 

3. (P’)’ 2, dn 

1. PSP ^P 

2. РЭ (РУР)! 

3. [P > (PV P ] (РУР) > P] 

4. PVP—P 

1. P hyp 

2. Р hyp 

3. PVO 1, add 

4. QVP 3, comm 

Prove 


Exercise 27 
2,3, mp 


Uus (uu de 


COSY ON 


(0) УР 
О ЭР 
(07у 

Q 


PV(QAR)> (PV О) 


Rewriting the conclusion, the argument is 


P V (О ^ R) (P^ V Q) by dn 


or 


‚ РУ(ОЛЕ) 
P' 


UP INE) 
P' 2 (Q ^R) 
ОЛЕ 

Q 


Фром 


P V (О ^A R) > (P' > О) by imp 


һур 

һур 

1, dn 

3, imp 
2, 4, mp 
5, sim 


35. 


37. 


39. 


41. 


43. 


45. 


47. 


The proof for 


is similar. 


1. РЭО 
2. P =O 
3. О —P' 
4. Q'9 
1. A4! В 
2. B>C 
3. CD 


l УЭ" 
2. YOZ 
3. Y 


1. (4 ^ By 
2. (C^ KAY 
3. (C ^ B^ 
4. A' V B' 
5. B' V A! 
6. B—4' 
7. (CY Va 


The argument is (Е > О) A (E V B) A Q' >B 


A proof sequence is 


1. E^ О 
2. EVB 


3. Q' 
4. О'—Е' 


The argument is 


A proof sequence is 


1. Co Е 
2 FVS 
3. С 


The argument is 


A proof sequence is 


1. 4S 
2. АМС 
325) 
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PV(QAR)> (PVR) 


hyp 
hyp 
1, cont 
2, 3, hs 


hyp 
hyp 
hyp 


hyp 
hyp 
hyp 


hyp 

hyp 

hyp 

1, De Morgan 
4, comm 

5, imp 

2, De Morgan 


hyp 
hyp 
hyp 
1, cont 


tA 
Q 

< 
tO 


8. C —4' 
9. C' V (B) 
10. (B) V C' 
11. B'> C 
12. B'>A' 


5. Е 

6. (E) VB 
7. E' В 
8. В 


(СЭ Е) А (FV 5) э (CS) 


һур 
һур 
һур 


4. F' 
9-5 


[(А—> S) (АУ С)Л 5']— C 


һур 
һур 
һур 


1,3, mp 
2,3, mp 
4, 5, inc 


7, imp 

3, De Morgan 
9, comm 

10, imp 

8, 11, hs 


13. (B > A') ^(B' ^ A") 6, 12, con 
A' 


Exercise 35 
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49. The argument is 


5 


-— 


53. 


55. 


[RA^A(F'VN)AN ^(4' > Р)| (А Л Р) 


A proof sequence is: 


1 RA(F' VN) hyp 7. Е 2, 6, ds 
2.N' hyp 8. F' > (А”)' 3, cont 
3. А 5 Е hyp 9. (A'Y 7, 8, mp 
4. R 1, sim 10. A 9, dn 
5. ММ 1, sim П. АЛА 4, 10, соп 
6. NV F' 5, comm 
. The argument is 
(JV L) > OAT A(C>T) э J 
A proof sequence is 
1. UVLO? С hyp 6. C — (JV Ly 1, cont 
2. Т' һур 7. JV Ly 5, 6, mp 
з. CT hyp 8. J'AL' 7, De Morgan 
4. Тә С 3, cont 9. J’ 8, sim 
5. C' 2, 4, mp 
or 
1. (ЈУ ә С hyp 5. (JV Ly 1, 4, mt 
2. T hyp 6. J' AL' 5, De Morgan 
3. CT hyp 7. J’ 6, sim 
4. C' 2,3,mt 
The argument is 
(D> T) ^(D V B) > (Т > B) 
A proof sequence is 
l. DT hyp 4. D' 1, 3, mt 
2. DVB һур 5. В 2, 4, ds 
3. T' hyp 
a. A B [ө B5ClIA-—((B-C)| A^B|(A^B)—^CJ|A-—(B—^C)o(A^B)5C 
T T T T T T T T 
T |T|F F F T F T 
T Е T T T Е T T 
T Е F T T F T T 
F T T T T F T T 
Е | T | F Е т Е т т 
F Е T T T Е T T 
F | Е | Е т т Е т т 


b А > (В > С) елэ (ВМС A V(B'VO © (А VBOVC&e&IDIAB'VCe 


(АЛВ)— С 


с. By part (a) (or (D), [Р, ^ Р, ^ ··:ЛР,| > (R > S) © (Р A Р, A+++ AP, Л К) > S, which 


says to take each of P,, Pa, ..., Pa, R as hypotheses and deduce S. 
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EXERCISES 1.3 


1.a. Т b. F c. F d. T 

3.a. T b. T € T d. F е. Е f. T g. T h. F 

Б.а. F b. T c. Т d. F e. T f. T 

7. а. true: domain is the integers, A(x) is “x is even", B(x) is “х is odd", false: domain is the positive inte- 
gers, A(x) is “x > 0”, B(x) is “х = 1” 

b. true: domain is the collection of lines in the plane, P(x, y) is “х is parallel to y", false: domain is the 
integers, P(x, y) is “x < y" 

c. true: domain is the integers, P(x) is “x is even”, O(x, y) is “y |x” (y divides x), false: domain is the col- 
lection of all people, P(x) 15 “х is male", Q(x, y) is “у is a brother of x” 

a. scope of (Vx) is Р(х) > Обу); y is a free variable 

b. scope of (3x) is A(x) ^ (Vy)B(y); scope of (Vy) is B(y); no free variables 

c. scope of (3x) is (Vy)P(x, y) ^ Q(x, y); scope of (Vy) is P(x, y); y is a free variable 

d. scope of (Ax) is (3y)[ 4(x, y) ^ B(y, z) > A(a, z)]; scope of (Ay) is A(x, y) ^ BO, z) > A(a, z); zisa 
free variable 

11. b and c 


Many parts of Exercises 13—24 have multiple equivalent answers, some of which are shown here. 
13. a. (Vx)(D(x) > S(x)) 

‚ EDDA ^ (R(x))'] or [(Vx)(D@) > RO) 

с. (OLDA) ^ Six) > RE] 

d. (3x)LD(x) ^ S(x) ^ А(х)] 

e. (Vx) Р(х) > (Sœ) ^ RG))'] 

f. (VOLDE) ^ S(x) > D(x) ^ К(х)| 

g (Vx)[D(x) > S] 

h. 5(М)  (Vx)(D(x) SQ) 

i. R(M) ^ R(T) 

J 

a 

b 

с 

d 

e 

f. 


c 


j. (9х)(О(х) ^ R(x)) > VODE) > 5(х)) 

. СУх)(М(х) > T(x)) 

‚ (Ax)(W(x) A T(x)) 

‚ VME) > Te) ^ YOWE > [T] 

‚ бух)(Тх) > W(x)) 

- VMA  (169)'] 

- (ух)(М@х) > Tox) > (Vx)(Wx) > Tx) 

в. (Эх)[ х) ^ (T69)' ] 

h. (Мх) Мх) (169) ] > (Эх) Их) ^ (Тх))'] 
a. (Эх)[Р(х) ^ (WTO) > Fr, yy] 

b. (Vx)[P(x) > (VTO) ^ Fe, у))] 

с. (Эх)(3у)(Р(х) ^ TO) ^ (Fe, у)) 

a. (Vx)(Vy)(M(x) ^ GO) > Р(х, y) 

b. [(30(GG9) ^ (Vy)(M(y) > Fæ, у)))]! or (v:9(GG9 > EVM) ^ Fe, y)]) 
с 

а 

а 

b 

с 


19. 


‚ (“х)(Уу)(М(у) A F(x, у) > С(х)) 

. VAGE) > GMY) ^ FQ, у))) or VOYAGE) > (MO) ^ Fa, у))) 
. EDW A L(x) Л С(х)) 

. VOLKE) > Q0) ^ С(х))'] 

. EVLA ^ (Vy)(AG, у) > JO] or (Эх)(Уу) (х) ^ (465, у) > JY] 


21. 
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d. (VOJ > (WAK, y) > JQ))] or VALUI > AG, у) > ЛО) or 


e 


(Ух)(Уу) Дх) ^ А(х, у) > JO] 
‚ (X X) UG) A AG y)) > JQ)] 


f£. (Vx) WG) ^ L6)] > ELO) ^ AG, y)]) or VAEN ^ Д0) ]  [J6) ^ AG у)]) 


ge 


23. a 
b 
с 
а 
е 


(AMX) ^ (Му) Шу) > (AG, у))')) or AW) ^ (Му)[А(х, у) > (20))' 0) or 
ENWP ^ Шо) > (AG. у))']) 

» (Vx)[BO) > (WFO) > LO, у))] or VABE) A FO) > La, y)] 

‚ (ABQ) ^ (WFO) > LO, у))] 

‚ (Vx)[B@) > ENFO) ^ L(x, у))] 

‚ (Vx)[B@) > WEE,» > FO))] 

‚ (WFO) > LE, y) > В(х))\ or (WAF) ^ Lx, y) > BO] 


f£. (Vx)[BO) > WEK, у) > FO))] 


5 
h 
i 


‚ [EDB A (Vy)LGs у) > FO or (Vv9[BG) > (ALO, у) ^ FON] 
‚ Gx)LBG9 ^ GNEO) ^ LG, yY] or EVEN A FO) ^ Le, y)] 


i. ((Ax)[B@) ^ (YL, у) > FO))] 


1. (VOLB > (FO) ^ LO, у))'] 
x (OLB > (WFO) > LO, »))] or (Ух)(УУ)[(В(х) ^ FO) > LO, y) ] 


25. 


27. 
29. 


31. 


33. 


- [Gx)LBQ) ^ (W)FO) > Ee, »9]] от VBA > GEF) ^ LG. y))] 
. John is handsome and Kathy loves John. 

. All men are handsome. 

. All women love only handsome men. 

. A handsome man loves Kathy. 

. Some pretty woman loves only handsome men. 

. John loves all pretty women. 


i b. 3 c. 3 d. 1 
. No Web site features audio. 


. Some Web site has neither audio nor video. 
. Every Web site has either audio or video. 
. Some Web site does not have text and also either doesn't have audio or doesn't have video. 


. Every farmer grows something besides corn. 
. Some farmer does not grow corn. 
. Someone besides a farmer grows corn. 


. Both sides are true exactly when A(x, y) holds for all x, y pairs. 

. Both sides are true exactly when some x, y pair satisfies the property A(x, y). 

. If there is a single x that is in relation P to all y, then for every y an x exists (this same x) that is in rela- 
tion P to y. 


a 
b 
с 
а 
е 
f. 
a 
a 
b. Some Web site does not have audio or does not have video. 
с 
а 
е 
а 
b 
c 
a 
b 
с 


d. If a has property A, then something іп the domain has property А. 
e. If any member of the domain that has property A also has property B, then if all members of the domain 


have property A, all have property B. 


35. a. valid: there is an x in the domain with property A says it is false that everything in the domain fails to 


have property А. 


b. not valid: domain is the integers, P(x) 15 “х is even", Q(x) is “х is prime". Because there are prime in- 


tegers, (Чх)О(х) and therefore (Vx)P(x) V (3x)Q(x) is true. But it is false that every integer is even or 
prime, so the implication is false. 


37. If something in the domain has either property P or property Q, then something has property P or some- 
thing has property Q, and vice versa. 
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EXERCISES 1.4 


1. The conclusion is that pansies are plants. The hypotheses have the form (Vx)(F(x) — P(x)) ^ F(p). By 
universal instantiation, F(p) — P(p), then by modus ponens, Р(р). 


. The conclusion is that pansies аге red. The hypotheses have the form (Vx)[F(x) > (R(x) V P(x))] ^ 


F(p) ^ [P(p)]'. By universal instantiation, F(p) — (R(p) V P(p)), then by modus ponens, R(p) V P(p), 
and finally by disjunctive syllogism, R(p). 


5. No conclusion is possible. Just because pansies are flowers, it does not make them either red or purple. 
The hypotheses have the form (3x)((x) ^ R(x)), (Ax)(F(x) ^ P(x)), F(p). But existential instantiation 
does not allow us to use p in removing the existential quantifiers, so we can say nothing further about 
pansies. 


11. 


13. 


15. 


17. 


19. 


21. 


23. 


а Кк 


1 
2 
3 
4 
1 
2 
3. 
1 
2 
3 
1 
2 
3 
4 


һур 
lei 
hyp 
3,ui 

2, 4, mp 
5,eg 


. The domain is the set of integers, Р(х, y) is “x « y", and Q(x, y) is “x > y"; for every integer x, there is 


some integer that is larger and there is some integer that is smaller. But it is false that for every integer 
x there is some one integer that is both larger and smaller than x. 


. To get to step 2, ei was performed on two different existential quantifiers, neither of which was in front 


with the whole rest ofthe wff as its scope. Also, both existential quantifiers were removed at once, with 
the same constant a substituted for the variable in each case; this should be done in two steps, and the 
second would then have to introduce a new constant not previously used in the proof. And at step 3, 
the existential quantifier was not inserted at the front of the wff. 


‚ (Vx)P(x) hyp 

. P(x) l, ui 

. P(x) V Q(x) 2, add 

. (Vx)(P(x) V Q(x)) 3, ug (note that P(x) V Q(x) was deduced from (Vx)P(x) in which x is not free 

. GEVV) P, y) hyp 4. (3x)P(x, b) 3, eg 

. (Ay)P(a, y) l,ei 5. (3y)(3x)P(x, y) 4, eg 
P(a, b) 2,el 

. (Vx)P(x) hyp 4. P(a) 1, ui 

. (ag [Poo hyp 5. Q(a) 3, 4, inc 

. [Pla] 2, ei 6. (3x)Q(x) 5, eg 

. (9х)(4(х) ^ В(х)) һур 5. (Ях)А(х) 3, eg 

. A(a) ^ B(a) l, ei 6. (3x)B(x) 4, eg 

. A(a) 2, sim 7. (Ax)A(x) ^ (3x)B(x) 5, 6, con 

. B(a) 2, sim 


Domain is the integers, P(x) is “х is even", Q(x, y) is “x = 2y + 1” (which means that x is odd). Then 
(3x)P(x) is true (x = 2) and (3x)(3y)Q(x, y) is true (3 = 2* 1 + 1), but (3x) (3y)[P(x) ^ Q, y)] is 
false (no x is both even and odd). 


1 


. VP) һур 5. P(x) > Q(x) temporary hyp 
. (Р(х))' l, ui discharged 
. P(x) temporary hyp 6. (Vx)(P(x) > О(х)) 5, ug 
Q(x) 2, 3, inc 
- (Ax)(Vy)OG, у) hyp 4. (Ax)O(, у) 3, eg 
. (Vy)Q(a, у) l, ei 5. (Vy)(3x)OG;, y) 4, ug 
. Oa, y) 2,ui 


33. 
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1, 


ш 


‚ (VG) = [PH] > (Bx (Se) ^ М(х) > DAS ^ [РО] 
. VAME) > [Pœ] hyp 6. [P(a)]' 


1. (Vx)(A(x) > В(х)) һур 4. A(a) > B(a) 

2. (Ax)A(x) hyp 5. B(a) 

3. A(a) 2, ei 6. (3x)B(x) 

1. Р(х) > (Ay)OG, у) hyp 5. Р(х) > O(, a) 
2. P(x) temporary hyp 

3. (Ay)QG, у) 1, 2, mp 6. (IVP) > OG, у)) 
4. О(х, а) 3, el 

1. EVP ^ О(х)] hyp 6. O(a)  R(a) 

2. (Vy)LO0) > R63] hyp 7. R@) 

3. P(a) \ O(a) l, ei 8. P(a) ^ R(a) 

4. P(a) 3, sim 9. (Ax)[P(x) ^ А(х)| 
5. Q(a) 3, sim 
. a. (VAME) > PŒ ^ NSE) > М(х)) (XS) > PŒ) 

1. (Vx)(M(x) > Р(х)) hyp 4. S(x) > M(x) 

2. (ух)(5(х) > M(x)) hyp 5. S(x) > P(x) 

3. M(x) > P(x) 1, ui 6. (VASE) > PŒ) 
b. (ух)(М(х) > [PŒ] ^ SE) > М(х)) > a > [P@)]') 

1. (ух)(М(х) > [P(x)]’) hyp S(x) > M(x) 

2. (Vx)(S(x) > M(x)) hyp 5 S(x) > Po} 

3. М(х) > [Pœ] 1, ui 6. (Vx)(S(x) > [P(x)]’) 
c. (Vx)(M(x) > P(x)) ^ (Ях)(5(х) ^ M(x)) > (3xyY(S(x) ^ P(x)) 

1. (Vx)(M(x) > PŒ) hyp 6. P(a) 

2. (Ax)(S(x) ^ ME) hyp 7. S(a) 

3. S(a) ^ M(a) 2, ei 8. S(a) ^ P(a) 

4. M(a) 3, sim 9. (3x)(S(x) ^ Р(х)) 
5. M(a) > P(a) l, ui 

d 

1 

2. (Ax)(S(x) ^ М(х)) hyp 7. S(a) 

3. S(a) ^ M(a) 2, ei 8. Sla) ^ [Pla] 

4. Mla) > [P(a)]’ 1, ui 9. (Ax)\(S(x) ^ [P@)]’) 
5. M(a) 3, sim 


The argument is 
(Мх) (Му)[ CG) ^ 4G) > BO, y)] ^ Cls) ^ (Эх) (х) ^ Be, х)]) > (ALA) 


A proof sequence is 


кез. рей 
Ne 


— 


‚ (VX)Vy)LCG) ^ AQ) > BO, y)] hyp 


C(s) hyp 
. (WIC) ^ AQ) > BOs, y)] 1, ui 
- EDS) ^ [B(s, х)]) hyp 
. S(a) ^ [B(s, a)]' 4, ei 
C(s) ^ A(a) > B(s, a) 3, ui 
[B(s, a)]' 5, sim 
. [C(s) ^ A(a)]' 6, 7, mt 
ОС)! V [А(а)|! 8, De Morgan 
. [LCG)T'' 2, dn 
- [Aa] 9, 10, ds 
. (Ax)[4@)]' 11, eg 


3,4, mp 

5, eg 
temporary hyp 
discharged 

5, eg 


2, ul 
5, 6, mp 
4, 7, con 
8, eg 


2,ui 
3,4 hs 
5,ug 


2,ui 
3, 4, hs 
5, ug 


4, 5, mp 
3, sim 
6, 7, con 
8, eg 


5,mp 
sim 

7, con 
eg 


Qo DN әз ois 


864 
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35. The argument is 


(VAME) > Ix) V Со) ^ VAG) ^ LG) > К@)) ^ O ALG) > (046) > FO] 


A proof sequence is 
1. (Ух)(М(х) > Дх) V G(x)) 
2. (Vx)(G(x) ^ L(x) > F(x)) 
3. Мо) —> Ij) V GG) 
4. С) ^ L(G) > FQ) 
5. MG) 
6. 10) V GQ) 
37. The argument is 


hyp 7.00))' 

һур 8. GG) 

1, ui 9. L(j) 

2, ui 10.G(j) ^ L(j) 
hyp 11. FQ) 

3,5, mp 


hyp 

6, 7, ds 
hyp 

8, 9, con 
4,10, mp 


(VORE) > FE) ^ (Ax)(RQ) ^ BQ) ^ VAC > BO) > (Ax\(G@) ^ К(х)) 


A proof sequence is 
‚ ANRE) > F) 
‚ EVRE) ^ B(x)) 
‚ AGA)" > B’) 
. R(a) ^ B(a) 
. R(a) > F(a) 
. R(a) 
7. B(a) 
39. The argument is 


х чл Бо мо н 


һур 8. F(a) 

hyp 9. G(a)' — Bia)’ 
hyp 10. B(a) > С(а) 
2,ei 11. G(a) 

1, ui 12. G(a) ^ F(a) 

4, sim 13. (3x)(G(x) ^ F(x)) 
4, sim 


5, 6, mp 
3,ui 

9. cont 

7, 10, mp 
8, 11, con 
12, eg 


(YCE) > (Эу) W(x, y)) ^ (vx (Vy (Wx, y) > S, у)) ^ Ст) > (Ay)S(m. y) 


A proof sequence is 

1. (Vx)(C(x) > (Яу), у)) 

2. Ст) > (Ay)W(m, y) 

3. C(m) 

4. (3y)W(m, y) 

5. (Vx)(Vy)(Wx, у) > 5(х, yy) 
41. The argument is 


hyp 6. (Vy)(W(m, у) > Sim, у)) 
l, ui 7. W(m, a) 

hyp 8. W(m, a) > S(m, a) 

2,3, mp 9. S(m, a) 

hyp 10. (3y)S(m, y) 


5,ui 
4, ei 
6, ui 
7, 8, mp 
9, eg 


GIx)(E(x) ^ YMO) > A, y))) ^ EDME) ^ S) > (Ax)(EQ) ^ (3у)(50) ^ AC, у))) 


A proof sequence is 


1. (Эх)(Е(х) ^ WMO) > A(x, у))) hyp 


2. (IM ^ S) 


3. E(a) ^ (Vy)UM(y) > Ala, у)) 


4. (Vy)M(y) > Ala, у)) 
5. M(b) ^ S(b) 
6. M(b) — A(a, b) 
7. M(b) 
8. A(a, b) 
9. S(b) 
10. S(b) ^ A(a, b) 
П. (9у)(50) ^ AG у)) 
12. Е(а) 
13. E(a) ^ (IVS) ^ AG у)) 


14. (3x) (E(x) ^ GSO) ^ Ax, у))) 13, eg 
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43. [EVLAT T > (У)[А(х)]/' |! neg, using [A(x)]' for A(x) 
LEDAMOT T e (Vx) AQ) dn 
[(Ух)А(х) ]' e (Go L409]' |')' cont (each direction) 
[(Vx)A(x) |’ e Gx)[4€9]' dn 
EXERCISES 1.5 
1. yes 
3. no 
5. fish 
7. fox, deer 
9. herbivore(X) <= eat(X, Y) and plant(Y) 
11. fox 
13. a. anita 


a 
b. mike, kim 

c. judith, sam, mike, kim, joan, hamal, enrique, jefferson 

a. ?authorof(marktwain, houndofthebaskervilles) 

b. ?authorof(williamfaulkner, X) 

c. nonfictionauthor(X) <= authorof(X, Y) and not fiction(Y) 

d. ?nonfictionauthor(X) 

a. fatherof(X, Y) <= parentof(X, Y) and male(X) 

b. daughterof(X, Y) <= parentof(Y, X) and female(X) 

c. ancestorof (X, Y) <= parentof(X, Y), ancestorof(X, Y) <= parentof(X, Z) and ancestorof(Z, Y) 
a. ?dry(X) and ingredientof(X, Y)) 

b. ?perishable(Y) and ingredientof(X, Y) and liquid(X) 

c. foundin(X, Y) <= ingredientof (X, Y), foundin(X, Y) <= ingredientof(X, Z) and foundin(Z, Y) 
21. The results should agree with the results for Exercises 13 and 14. 


EXERCISES 1.6 


1íxtl-y—-leox-2y-2 
3.3x — 1 = 2y — 1=›3х = 2y 
5. Working backward from the postcondition using the assignment rule, 
{x+3=44ox=1 
у=х+3 
{2у=8огу=4} 
y=2*y 
{y = 8} 
7. Working backward from the postcondition using the assignment rule, 
{2x+1=1}ox=0 
z=2x+1 
{z-1=0orz= 1} 
{y = 0} 
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9. Working backward from the postcondition using the assignment rule, 


{x(x — 1) = xx — D} 


y=x- 1 
{xy = x(x — D} 
y=x*y 
{у = xx — 0) 


Because the precondition is always true, so is each subsequent assertion, including the postcondition. 
11. Using the conditional rule, the two implications to prove are 


{у = Oandy < 5} у= у + у = 1} and{y = 0 and y = 5} у = 5 {у = 1} 
The first is true by the assignment rule. Working backward from the postcondition, 


{fy+1=1}oy=0 
у=у+1 
w= 1} 
The second is true because the antecedent is false. 
13. Using the conditional rule, the two implications to prove are 
{х ~ Oandx > 0} y = 2 * x{y > 0} and {x = Oandx = 0} y = (—2) * x {y > 0} 
The first is true by the assignment rule. Working backward from the postcondition, 
{2*x > 0} ox > 0x з Oandx > 0 
y72*x 
{y > 0} 
The second is true by the assignment rule. Working backward from the postcondition, 
{(-2) *х > 0} ox < 00x 7 Oandx = 0 
у = (—2) *х 
{y > 0} 
15. Using the conditional rule and the definition of absolute value for a nonzero number, the two implications 
to prove are 


{x z 0 and x = 0} abs = х ((x > 0 and abs = x) or (x < 0 and abs=—x)} 
{х # 0 and x < 0} abs = —x {(x > 0 and abs = x) or (x < 0 and abs = —x)} 
Using the assignment rule on the first implication gives the precondition 
(x > 0 and x = x) or (x < Oandx = —x)< (x > Oandx = x) < (x = Qand x = 0). 
Using the assignment rule on the second implication gives the precondition 


(x > 0 and —x = x) or (х < Oand —x = —x) (x < 0and -x = —x) e (x = 0andx < 0) 


CHAPTER 2 


EXERCISES 2.1 


1. a. If there is not healthy plant growth, then there is not sufficient water. 
b. If there is not increased availability of information, then there are no further technological advances. 
c. No modification of the program implies that errors will not be introduced. 
d. Poor insulation and some windows not storm windows implies no fuel savings. 
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3. For example: 
a. a nonsquare rectangle 
b. 0 
c. a short, blue-eyed redhead 
d. a redhead who is short 
5. Half of this statement is true. If n is an odd integer, Зи + 5 is an even integer. However, the converse is 
false. Consider the even integer 6. If 3n + 5 = 6, then 3л = 1 and n = 1/3, which is not an integer at all, 
much less an odd integer. See Exercise 25. 
7. a. 4 + 6 = 10; 4 and 6 are even but 10 is not a multiple of 4. 
b. The error lies in choosing both x and y to be equal to 2m. This makes them the same number, which is 
a special case. 


9.25 = 5 = 9 + 16 = 3? + 4,100 = (10? = 36 + 64 = € + 8,169 = (13? = 25 +144 = 
52 + (12)? 

1. и = 1, п! = 1,2" = 2; п = 2, п! = 2, 2" = 4; п = 3, п! = 6, 2" = 8 

13. Let x = 2m, у = 2n, where m and n аге integers. Then х + y = 2m + 2n = 2(т + n), where m + n is 
an integer, so x + y is even. 


15. Let x = 2m + l,y = 2n + 1, where m and n are integers. Then x + y = (2m + 1) + Qn + 1) = 
2m + 2n + 2 = 2(m + n + 1), where m + n + lisan integer, so x + y is even. 


17. Let x = 2m + 1 and у = 2n where m and n are integers. Then x — y = 2m + 1 — 2n = 2(m — п) + 1 
where m — n is an integer, so x — y is odd. 

19. For two consecutive integers, one is even and one is odd. The product of an even integer and an odd inte- 
ger is even by the proof of Example 9. 

21. Let x = 2m where m is an integer. Then x? = (2m} = 4т?, where m? is an integer, so x? is divisible by 4. 

23. The contrapositive is: if x + 1 = 0, then x = 0.Ifx + 1 = 0, then x = —1 < 0,so x < 0 and therefore 
x=0. 

25. If n is odd, then n = 2k + 1 for some integer k. Then 3n + 5 = 3(2k + 1) + 5 = 6k + 8. For the con- 
verse, if 3n + 5 = 6k + 8 for some integer k, then Зи = 6k + 3 or 3n = 3(2k + 1) and n = 2k + 1 for 
some integer k, so п is an odd integer. 

27. If x < y then multiplying both sides of the inequality by the positive numbers x and y in turn gives x? < xy 
and xy < y? and therefore x? < xy < y? or x? < y?. For the other direction, if x? < у? then y? — 3? > 0 
by the definition of <, (у + х)(у — x) > 0 by factoring, (у + x) < 0 and (y — x) <0ог(у+х)> 0 
and (у — x) > 0 because a positive number is the product of two negatives or two positives. But it cannot 
be that (у + x) < 0 because y and x are both positive; therefore (у + х) > 0 and y — x > 0 and y > х. 

29. Let п be a prime number with n = 2k, where К is an integer. Then both 2 and k divide n. Because п is 
prime, п is divisible only by itself and 1, so n = 2 and k = 1. Therefore n = 2. 

31. Let p and q be divisible by и. Then р = kın and q = kn, where k, and k, are integers, and 
р + 4 = Күп + Ёп = (Ку + kn, where Кү + k is an integer. Therefore p + q is divisible by n. 

33. Because n|m,m = kın for some integer kı. Because m|p,p = т for some integer k,. Then 
р = km = (Куп) = (kk)n where kk, is an integer, so n|p. 

35. Let x = 2n + 1. Then x? = (2n + 1)? = 4n? + 4n + 1 = 4п(п + 1) + 1. But n(n + 1) is even (Exer- 
cise 19), so n(n + 1) = 2k for some integer k. Therefore x? = 4(2k) + 1 = 8k + 1. 

37. т?п? = (тп)? 

39. Proof by cases, depending on whether х and y are negative. Case 1: х = 0, y = 0. Then |x| = х,у = y. 
Also, x + y > 0 and |x + y| = x + y. Therefore |x + y| = x + y = |x| + |y. Case 2: х = 0, у < 0. 
Then |x| =х,|у| = —y. Subcase а: х + y 20. Then |х + у| = х + y. Therefore |x + у = 


4 


— 


43. 


45. 


47. 
49. 
51. 
53. 


55. 


м 
— 


75. 
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x +y € x + (—y) (remember that y is negative, so —y is positive) = |x| + |y|. Subcase b: x + y < 0. 
Then |x + y| = —(х + y). Therefore |x + y| = —(х + y) = (—х) + (—у) € x + (—y) (remember that 
x = 0,so —x = 0) = |x| + |y]. Case 3: x < 0, y = 0. Similar to Case 2 with the roles of x and y reversed. 
Case 4: x < 0, y < 0. Then |x| = —x, |y| = —y. Also, x + y < 0 and |x + y| = —(x + y). Therefore 
lx + y| = —@ + y) = (~x) + (~y) = ll + bl. 
. Proof by contradiction. If x, < 4, x; < 4,::-, and x, < А, then x, +x, +++ x,<At+At 

+++ A= nA, and (xy + x; +: + x,)/n < A, which contradicts the definition of A as the average of 
pec ae 


Assume that Мз is rational. Then Мз = p/q where р and q are integers, = 0, and p and q have no 
common factors (other than + 1). If Мз = p/q then 3 = p?/q? or 3q? = p?. Then 3 divides p? so 3 
divides p. Thus 3 is a factor of p or 9 is a factor of р? , and the equation 342 = p? can be written 342 = 9х 
or 4° = Зх. Then 3 divides q? so 3 divides q. Therefore 3 is a common factor of p and q, a contradiction. 
Assume that V/2 is rational. Then V/2 = p/q where p and д are integers, q # 0, and p and q have no com- 
mon factors (other than +1). If V/2 = p/q then 2 = p?/q! or 24 = p°. Then 2 divides p? so 2 divides 
p. Thus 2 is a factor of p, or 8 is a factor of p°, and the equation 24° = p? can be written 2g? = 8x or 
q^ = 4x. Then 2 divides 4x so 2 divides q? or 2 divides д. Therefore 2 is a common factor of p and q, a 
contradiction. 

0 — (0)2, which is an integral multiple of 2. 

297 is a composite number; 297 = 3*3*3*11 

Counterexample: 9 — 7 = 2 

Proof: Ifxiseven,then х = 2nand x(x + 1)(х + 2) = (2n)Qn + 1)(2n + 2) = 2[(n)(2n + 1)(2n + 2)], 
which is even. If x is odd, then х=2л +1 and x(x + 1)(x + 2) = (Qn + 1)(2n +2) 
(2n + 3) = 2[(2n + 1)(n + 1) (2n + 3), which is even. 
Proof: If x is even, then x = 2л and 2n + (2л) = 2n + 8п? = 2(n + 4m), which is even. If x is odd, then 
x = 2n + 1 апа (2n + 1) + (2n + 1)? = (2n + 1) + (85? + 1202 + 6n + 1) = 8? + 12n? + 8n + 

2 = 2(4т + 6n? + 4n + 1), which is even. 

Counterexample: 3 X 9 = 27 


. Let п be odd and m be even. Then n? is odd by Example 9 and т? is even by Example 5. Then n? + m? is 


the sum of an odd and an even, so it is odd by Exercise 16. 
1 1 


1 1 
. Forn = 1,п + жак 1+ T = 2. For n > 2,n + z >2-+ 5 > 2 because 1/n is a positive number. 


. Counterexample: 5 is prime, but 5 + 4 = 9 is not prime. 
. Proof: n? — 1 = (n + 1)(n — 1) where n — 1 > 1 (because n > 2), which is a nontrivial factorization, 


so the number is not prime. 


. Counterexample: 42 + 4 + 1 = 21 = 3(7), not prime. 
. Proof: Let x and y be rational numbers, x = p/q, у = r/s with p, q, r, s integers and q, s = 0. Then 


X t y = p/q + r/s = (ps + rq)/qs, where ps + rq and qs are integers with qs ~ 0, and any common 
factors between q and s can be removed. Thus x + y is rational. 


. Counterexample: V2 is irrational but V2 x V2 = 2, which is rational. 
73. 


Angle 6 plus Angle 5 plus the right angle sum to 180? by the first fact. The right angle is 90? by the fourth 
fact. Therefore Angle 6 plus Angle 5 sum to 90°. Angle 6 is the same size as Angle 3 by the second fact. 
Therefore Angle 3 plus Angle 5 sum to 90°. 

Assume that Angle 1 and Angle 5 are the same size. As in Exercise 73, Angle 3 plus Angle 5 sum to 90°. 
Because Angle 1 and Angle 5 are the same size, Angle 3 plus Angle 1 sum to 90°. Also, Angle 3 plus Angle 
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1 plus Angle 2 sum to 180° by the first fact. Therefore 90° plus Angle 2 sum to 180°, or Angle 2 = 90°. 
Angle 2 is a right angle by the fourth fact. 


EXERCISES 2.2 
1. а. Р(1): 4(1) — 2 = 2(1)2 or 2 = 2, (true) 
b. P(): 2 6 10 -«- + (4k – 2) = 2 
с. P(k + 1:26 10 + [4k + 1) - 2] = 2k + 1)? 
d. Left side of P(k + 1) 22-6 10 -- + [4k + 1) -2]229 6 10 + (4k 2) + 


[4(k + 1) — 2] (write next-to-last term) = 2/2 + 4(k + 1) — 2 (using P(k)) = 22 + 4k + 2 = 
202 + 2k +1) = > 2k + 1) which is the right side ids + 1). 


3. Base case: A0: 1 = 1(2(1) — 1), (true). Assume P(A): 1 +5 - 9 ---- + (4k — 3) = k(2k — 1). Show 
P(k1x145-494- + [4k 1) – 3] = GI esca] Left side of Pk +1) 1-5 
F9 + [4&*0)-3]214 549 + (4k 3) + [A(k + 1) – 3] = KQk — 1) + A(k + 1) 
— 3 (using P(k)) = 22 — k + 4k + 1 = 2 + 3k + 1 = (Е + D)Ok + 1) = (k + DX + 1) — 1] 


which is the right side of P(k + 1). 


5. Base case: Р(1): 6 — 2 = 1[3(1) + 1], (true). Assume P(k):4 + 10+ 16 + --- + (6k — 2) = k(3k +1). 
Show P(k + 1: 4 4 10 16 t --- + [6(k +1) 2]- @ + DB + D + 1}.4 + 10 + 164 . 
+ [6(К +1) – 2] =4+10+16+::: + (6k – 2) + [6k + 1) - 2] = KGk + D) + 6k -1) – 2 


(using P(k)) = 32 + k + 6k + 4 — 3E + 7k + 4 = (Е + 1)(3& + 4) = (Е + TET + 1), + 1] which 
is the right side of P(A + 1). 

7. Base case: Р(1): 1? = 1(1 + 1)(2 + 1)/6, (true). Assume P(k): 12 + 22 +--+: + 2 = kk + 1)(2k + 1)/6. 
Shaw P 1): I +2? +--+ + (k 1 = (k + Ik + 2)(2(k + 1) + 1)/6. Left side of P(k + 1) = 1? 
teers + (#+1)”=1°+2?°+--+ +? +(К+ 1)? = kk + 1)(2& + 1)/6 + (k + 1)? (using P(A) 
= (k + DIXOE + 1)/6 + К+ 1] = (к + DIQE +k + 6k + 6/6] = (k + DQE + 7k  6)/6 
= (k + D(k + 2)(2k + 3)/6 = (k + D(k + 2)Q(k + 1) + 1)/6 which is the right side of P(k + 1) 


9. Base case: Р(1): 1? = 1(2 — 1)(2 + 1)/3, (true). Assume P(k): 12 + 3 +--+ + Qk — 1) = 
kQk — DOr + 1)/3. Show P(k + 1: 12-32 9 --- t [E - D 12 (+ ОЕ -D— 0D 
Q(k + 1) + 1/3. Leftsideof PE + 1) = 12 +32 +--+ + [E 1) - 12 2-32 + (Е 1) 


+ Dk + 1) — 1р = дк — DQk + D/3 + [2(k + 1) — 1 P(using Р(Ю) = КОК — 1) Qk + 0/3 4 
(2k + 1)? = (2k + D[kQk — D/3 + 2k + 1] 2 Qk + DRE — k + 6k + 3)/3) = (2k + 2K + 5k + 3/3 
= (2k + D(k + DQk + 3/3 = (k + DQ(k + 1) — 1) Q(k + 1) + D/3 which is the right side of 


P(k + 1). 

11. Base case: Р(1): 1:3 = 1(2)(9)/6, (true). Assume P(A): 1:3 + 2-4 +--> + КЕ + 2) = КЕ +1) 
(2k + 7)/6. Show P(k + 1: 1:39 2-4 +: + (К  D(k + 3) = (k+ Dk + 2)Q(k + 1) + 7)/6. 
Left side of P(A + 1) = 1:3 + 2:4 di єє oc (k 0 Dk + 3) = 1:39 2-4 t + kK * 2) H 
(К + D)(k + 3) = K(k + 1)(2К + 7)/6 4 | NY 3) (using P(k)) = (k + D)[k(2k + 7) + 6(k + 3)]/6 
= (k + 1)(2 + 13k + 18)/6 = (k 4 ha + 2)(2k + 9)/6 = (k + D(k + 2)(2(k + 1) + 7)/6 which 
is the right side of P(A + 1). 

13. Base case: P(1): 1/(1: 2) = 1/(1 + 1), (true). Assume P(k): 1/(1:2) + 1/0 :3) +++ + 1/k(k + 1) = 
k/(k + 1). Show P(k + 1): 1/0 i t1/2:3)9-:-- 1 /(k + D(k + 2) = (k + 1)/(К + 2). Left side of 
P(k + 1) = 1/0:2) + 1/2-3) +++: + AK  D(k + 2) —1/0-:2) + 1/02 :3) t + MK 1) 


+ 1/(k + (Е + 2) = k/(k 4 
(K + 2k + DK + 1k + 2) 
P(k + 1). 


ив 1)(# + 2) (using Р(Ю)) = [МК + 2) + 1]/(k + D(k + 2) = 


1) ч 
= (k + 12k + D(k + 2) = (k + D)/(K + 2) which is the right side of 
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15. Base case: P(1): I? = (— 1)2(1)(2)/2, (true). Assume P(k): 1? — 22 +- + (-1 02 = (1)! 
(ЮКЕ + 1)/2. Show PE + 1): 12 22+... + (ПЕ + 1 = (7 Dk + 1) + 2)/2. Left side 
of Pk +1) = 12 2+... + (“DRE DP 2 P-22 s + CD (C11? 


(—1)**!(Ю)(К + 1)/2 + (—1)**(k + 1 (using PÆ) = [C 1)** !()(k + 1) + 2(—1)**? (k + 1]/2 = 
(1) (К + 1) [& 71)! + 2(k + 1)]/2 = (- 17K + D[—k + 2k + 2]/2 = (—1)°* (Ж + 1)(k + 2)/2 
which is the right side of P(A + 1). 


17. Base case: P(1): 22 = (2)(1)(2)(2 + 1)/3 or 4 = (2)(6)/3, (true). Assume P(k): 22 + 4 + --- + (2k) 


= 2k(k + 1)(2k + 1)/3. Show P(k + 1):22 + 4 +- + [2k + DE = 2k + D(k + 2Y2(k + 1) 
+ 1]/3.Left side of Pk + 1)=2?+44+--- + (+ 1)р = 22 + 42 4 + QE? + Dk + DP = 
2k(k + Dk + 1)/3 + [2(k + DF (using PŒ) = (k+ D[KQK + 1)/3 + 2(k+1)]=2(k + D[KQK + 1) 


+ 6(k + 1)]/3 = 2(k + 1)[2K + 7k + 6]/3 = 2(k + 1)( + 2)(2k + 3)/3 = 2(k + Dk + 2)[2K + 1) 

+ 1]/3 which is the right side of P(k + 1). 

19. Base case: P(1): 1:2 = (1)(2)(3)/3, (true). Assume Р(А): 1:2 + 2-3 + 3:4 t ++: + Kk +1) = 
k(k + 1)(К + 2)/3. Show Р(К+1):1-2+2-3+3-4+--- +(k+1X(k+2)=(k+ aoe. 
Left side of PA + 1) = 1:2* 2-3 -3-4 +: + (A+ 1)K +2) =1°:24+2:34+3-44+-:: 
k(k + 1) + (k + D(k + 2) = МК + 1)(k + 2)/3 + (k + DY(k + 2)(using P(A) = (k + 1)( + 2)[k/3 1] 
= (k + 1)( + 2)(k + 3)/3 which is the right side of P(k + 1). 

. Base case: P(1): 1/0 - 4) = 1/(3- 1 + 1), (true). Assume P(A): 1/(1: 4) + 1/(4* 7) + 1/(7-10) +: + 
1/(3k — 2)(3k + 1) =k/(3k + 1). Show P(k + 1): 1/01: 4) + 1/(4- 7) + 147-10) +--+ + 1/(3(k +1) 
— 2)3(k + 1) + 1) = (k + D/[3(k + 1) + 1]. Left side of P(k + 1) = 1/(0: 4) + 144-7) + 1/(7 - 10) + 
<e + 1/(3(К + 1) – 2)8G(k +1) 4+ D) = 1/0:4) + 144-7) +147: 10) +--+ + 1/36 — 2)Gk + 1) 

1/3(k + 1) - 2)(3(k + 1) + 1) = (36+ 1) + 1/3(k + 1) — 2)(3(# + 1) + 1) (using P(A) = k/(3k + 1) 
1/Gk + Dk + 4) = [k(3k + 4) + 1]/Gk + Dk + 4) = (32 + 4k + D/Gk + D)Gk + 4) = 
(3k + D(k + D/Gk + Dk + 4) = (k + D/[3(k + 1) + 1] which is the right side of P(k + 1). 

. Вазе сазе: P(1): 1 - 4 = (4? — 1)/3 о = 15/3, (true). Assume Р(й): 1 -4- 4 +--+ +4 = (4+1 – D. 
Show P(k + 1): 1 - 4 42 +--+ + 401 = (4**? — 1/3. Left side of P(k + D- со. 
+41 =1+4+48 +. +44 n (4**! — 1)/3 + 4**! (using P(A) = (4! —143- ауд 

= (4:401 — 1/3 = (4**? — 1)/3 which is the right side of P(k + 1). 

. Base case: P(1): i (3 — 1)/2, (шо), Assume P(k): 1 - 4-7 +10 +: + 3k — 2) = k(3k — 1)/2. 
Show Р(К + 1): 1 ys a 10+ · (3(k + 1) — 2) = (К + DQ(K + 1) — 1)/2. Left side of P(k + 1) 
=1+4+7+10+::: + G6(44+1)-2)=14+44+74+104--- + (3k 2) + BA +1) – 2) = КЗК 

1)/2 + 3(k 4 1) - 2) (using РОВ) = [k(3k — 1) + 2G(k + 1) - 2)]/2 = Gh — k + 6k + 6 — 4/2 
= (32 + 5k + 2)/2 = (Е + DGk + 2)2 = (k + DOG(k + 1) — 1)/2 which is the right side of 
P(k + 1). 

27. Base case: P(1):a = (a — ar)/(1n — r) = a(1 — r)/(1 — r), (true). Assume P(k): a + ar +з + 
ar" ! = (a — a)/( — r). Show P(k + 1: a + ar +--+ + ar = (a — ar^ )/(0 — r). Left side of 
Pkkt+l)=atart-:: ta -ac ar + t ar + ar = (a — ar) — r) + ar (using P(K)) 
= [a — art + ak — r)]/ü — r) = (a — а) — r) which is the right side of P(k + 1). 

29. a. 4,882,812 
b. 64,592,673,600 
c. 225 
d. 884 


31. Base case: P(2): 2? > 2 + 1, (true). Assume P(k): kK > k + 1. Show P(k + 1): (k + 1 > k + 2. Left 


side of P(k + 1) = (k+ 1y = K 2k + 12» (К+ 1) + 2k + 1 (using P(A) = 3k - 2» k+ 2 which 
is the right side of P(k + 1). 


2 


c 


2 


[^] 


2 


л 


33. 


35. 


37. 


39. 


41. 


43. 


45. 


47. 


49. 


51. 


53. 
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Base case: P(7): 7 > 5:7 + 10 or 49 > 45, (true). Assume P(k): k > 5k + 10. Show P(k + 1): 
(k + 1 > 5(k + 1) + 10 = 5k + 15. Leftside of P(k + 1) = (k + 1? = K +2К + 17» (5k + 10) + 
2k + 1 (using P(K)) = 5k + 2k + 11 > 5k + 12 + 11 (because k > 6) = 5k + 23 > 5k + 15 which 
is the right side of P(A + 1). 

Base case: P(4): 4! > 4? or 1:2-3-4 = 24> 16 (true). Assume P(k): k! > K. Show P(k + 1): 
(К + 1)! > (k + 1). Left side of P(k + 1) = (k + 1)! = k!(k + 1) > K(k + 1) (using P(A) > (k + 1) 
(k + 1) (by Exercise 31 because k = 4) = (k + 1)? which is the right side of P(k + 1). 

Base case: P(4): 4! > 24 or 24 > 16, (true). Assume P(k): k! > 2". Show P(k + 1): (k + 1)! > 2**!. 
Left side of P(k + 1) = (k + 1)! = k!(k + 1) > 206 + 1) (using P(A) > 2*2) (because k = 4) = 24+! 
which is the right side of P(A + 1). 

Base case: P(1): 1! = 2°, (true). Assume P(k): k! = 2^^!. Show P(k + 1): (k + 1)! = 2*. Left side of 
P(k + 1) = (k + 1)! = Kk + 1) = 2! + 1) (using P(A) = 2*^!(2) (because k = 1 sok + 1 = 2) 
= 2" which is the right side of P(k + 1). 

Base case: P(2): (1 + x > 1 + х? orl + 2x + x? > 1 + x (true because x > 0 implies 2x > 0). 
Assume P(k): P(k): (1 + х) > 1 + х“. Show P(k + 1): (1 + х) > 1 + x**!. Left side of P(k + 1) = 
(1+ xy**! = (1+ 3x K1 + x) > (1 + х®)(1 + x) (using Р(Ю) = 1 + xt + x +x! > 1 + x**! which 
is the right side of P(A + 1). 


Base case: P(2): 1 + 2 < 2? or 3 < 4, (true). Assume P(k): 1 +2 +--+ + k< K. Show P(k + 1): 
2-4 +(k+ 1) < (k+ D. Leftside of P(k -1) 212-4 +(Кк+1)=1+2+ + 
k + (К + 1) < К? + k + 1 (using P(A) < K + 2k + 1 = (Е + 1) which is the right side of P(k + 1). 
a. Base case: P(1): 1 + (1/2) < 2, (true). Assume P(k): 1 + (1/2) + + (1/2) « 2. Show 
P(k + 1): 1 + (0/2) +--+ + (1/27!) < 2. Left side of P(k + 1) = 1 + (10) +-+- + (4/283) = 
1 t (1/2) +: + (1/25 + (1/2471) < 2  (1/2** ) (using P(A), but 2 + ap Барар. 
b. Base case: P(1): 1 + (1/2) = 2 — (1/2), (true). Assume P(k): 1 + (1/2) +++» + (1/25) = 2— (1/25. 
Show P(k + 1): 1 + (1/2) +--+ (1/2755 = 2 — (1/28). Left side of P(k + 1) = 1 + (1/2) + 
-+ (1/25!) =14 (0/2) ++ +025 + (1/25) = 2 — (04/25 + (1/2**!) (using Р(Ю)) = 

2 — (2/2**!) + (1/2**!) = 2 — (1/2**!) which is the right side of P(k + 1). 


Base case: Р(1): 22 — 1 = 8 — 1 = 7 and 7|7. Assume P(k): 7|22* — 1 so 2% — 1 = 7m or 2** = 7m + 1 
for some integer m. Show P(k + 1): 7/22@*) — 1, 234+) — 1 = 234+3 — 1 = 234.93 — 1 = (7m + 1)23 
— 1 = 7(Z2m) + 8 — 1 = 7(23m + 1) where 27m + 1 is an integer, so 7|2°4*) — 1. 

Base case: P(1): 7 — 2 = 5 and 5|5. Assume P(k): 5|7* — 2*, so 7* — 2* = 5m or 7* = 5m + 2* for some 
integer m. Show P(k + 1): 5|7**! — 2441 781 — 25+! = 7.74 — Okt! = "(Sm + 2%) — 20+! = 5(7т) 
+ 247 — 2) = 5(7т + 2" where 7m + 2* is an integer, so 5|7**! — 2**1. 

Base case: P(1): 2 + (-1? = 2 + 1 = 3 and 3[3. Assume P(x): 3|2* + (—1)**!, so 2 + (-1)*! = 
3m or 2* = 3m — (—1)**! for some integer m. Show P(k + 1): 3|2**! + (—1)**2.2**! + (—1)**? = 
2-2 + (-1)*? = (3m — (-1)*!) + (-1)**? = 32m) — 2(- 1*! + (—1)**? = 302m) + (-1)**! 
(—2 + (—1)) = 3Qm) + (-1)**! (-3) = 39m — (—1)**!) where 2m — (—1)**! is an integer, so 
apu + (—1)**?. 

Base case: Р(1): 3472 + 5°7! = 3° + 53 = 729 + 125 = 854 = 61 · 14 and 14|61 - 14. Assume P(k): 
14|34 +2 + 52k*1 so 34+2 + 5%+1 = [4т or 34+2 = 14m — 5?**! for some integer m. Show P(k + 1): 
14|34** 1)+2 + 5209 1)+1 „3401 1)+2 + 52+ 1)+1 — == 342 . 34 + 52k+3 = (14m Е gases + 52k +3 = 14(m3*) 
— S2k*1. 35 + 52071. 52 = 14(т3®) — 52+ 1034 — 52) = 14(m3*) — 5?** (81 — 25) = 14(m3*)— 5?** (56) 
= 14(m3* — 4: 52*+ 1) where m3* — 4-57**! is an integer, so 14|3#®* D+? + 52€* 041, 


55. 


57. 


59. 


61. 


63. 


67. 


69. 


71. 
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Base case: P(1): 10 + 3-42 + 5 = 10 + 192 + 5 = 207 = 9-23 апа 9|9 · 23. Assume P(x): 9|10* + 
3-4**? 45 50 10% + 3: 4? + 5 = 9m or 10‘ = 9m — 3 - 4**? — 5 for some integer m. Show P(k + 1): 
9|10*°*! + 3-4*? + 5. 10! + 3:483 + 5 = 10-105 + 3: 4*? + 5 = 109m — 3-4**? — 5) 
+ 3-4'*3 + 5 = 9(10m) — 30-4**? — 50 + 3-4**2-4 + 5 = 9(10m) — 45 — 3: 4*?(10 – 4) = 
9(10m — 5) — 18: 4**? = 9(10m — 5 — 2- 4**?) where 10m — 5 — 2- 4**? is an integer, so 9|10**! + 
3-479545 

Base case: P(1): 1° + 2(1) = 3 and 3|3. Assume P(k): 3| + 2k so K + 2k = 3m for some integer m. 
Show P(k + 1): 3|(К + 1)? + 2(k + 1). (k + 1P + (k -1) = 0+ 32 -3k-1 42k +2=h + 
2k + 3k + К + 1) = 3m + (2 + К + 1) = 3(m + K + k+ 1) where m + K +k + 1 isan integer, 
so 3|(k + 1 + 2(k + 1). This result also follows directly from Exercise 56: n? + 2n = п? — n + 3n = 
3m + 3n (by Exercise 56) = 3(m + n). 

Base case: P(1): cos Ө + isin Ө = cos Ө + isin Ө. Assume P(k): (cos Ө + isin Ө)* = cos КӨ + isin kð. 
Show P(k + 1): (cos0 + іѕіп 0)! = cos(k- 1)0 + isin(k + 1)0. (соѕ0 + isin0)'*! = 
(cos 0 + isin 6) (cos Ө + i sin Ө) = (cos КӨ + i sin k0)(cos Ө + i sin Ө) = cos КӨ cos Ө + isin КӨ cos 0 
+ icosk0sinO + Psink0 ѕіп Ө = cos k0 соѕ0 — sink@sin@ + i(sin kð cos@ + cos kð sin 0) = 
cos (k0 + 0) + isin (k0 + Ө) = cos (К + 1)0 + isin (k + 1)0. 

The statement to be proved is that n(n + 1)(n + 2) is divisible by 3 for п = 1. Base case: P(1): 
1(1 + 1) + 2) = 6 is divisible by 3, (true). Assume P(A): k(k + 1)(k + 2) = 3m for some integer m. 
Show P(k + 1): (k + 1)(K + 2)(k + 3) is divisible by 3. (k + 1)(k + 2)(k + 3) = (k + 1)(k + 2)k + 
(k + 1)(k + 2)3 = 3m + (k + 1)(k + 2)3 = 3[m + (k + Dk + 2)]. 


65. 


H) 


Proof is by induction on и. Р(1) is true because 1 line divides the plane into 2 regions, and 
(1? + 1 + 2)/2 = 2. Assume that P(x) is true: k lines divide the plane into (k? + k + 2)/2 regions. 
Show P(k + 1), that + 1 lines divide the plane into [(k + 1)? + (k + 1) + 2]/2 regions. A new line 
creates one more region than the number of lines it crosses. When line Ё + 1 is added, it will cross k 
lines (because no two lines are parallel and have no common intersection points). Therefore Ё + 1 new 
regions are created. The total number of regions is therefore Ё + 1 more than the number present with 
k lines, or (2 + + 2)/2 + (k + 1) = (K - k - 2 + 2(k + 1)/2 =( + 3k + 4/2 = (Е + 1} + 

(k + 1) + 2)/2. 


P(1) is 1 = I + 1 which is not true. 


a. Let P(n) be the property that any word composed of a juxtaposition of п subwords has an even number 
of o's. Then P(1) is true because the only words with 1 subword are the words moon, noon, and soon, 
all of which have 2 o's. Assume that Р(/) is true and consider P(k + 1). For any word composed of 
К + 1 subwords, break the word into two parts composed of k subwords and 1 subword. By the induc- 
tive hypothesis, the part with k subwords has an even number m of o's. The part with 1 subword has 2 
о”. The total number of o's is therefore m + 2, an even number. This verifies P(k + 1) and completes 
the proof. 


7З. 


75. 


77. 


79. 


81. 


83. 
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b. Let P(n) be the property that any word composed of a juxtaposition of n subwords has an even num- 
ber of o's. Then P(1) is true because the only words with 1 subword are the words moon, noon, and 
soon, all of which have 2 o's. Assume that P(r) is true for all у, 1 = r = k, and consider P(k + 1). 
For any word composed of k + 1 subwords, break the word into two parts composed of r; and ғ 
subwords, with 1 =r, S k, 1 S r, Sk, and r; + r, = k + 1. By the inductive hypothesis, rı con- 
tains т 0°, an even number, and r, contains m, o's, an even number. Then the original word contains 
ту + m 0’s, an even number. This verifies P(A + 1) and completes the proof. 


For the base case, a 1 -piece puzzle requires 0 steps to assemble. Assume that any block ofr pieces, 1 = r = k, 
requires 7 — 1 steps to assemble. Now consider a puzzle with k + 1 pieces. The last step in assembling the 
puzzle is to fit together two blocks of size r, andr, with 1 S r; S k, 1 S r, k,andr, + r, =k + 1. By 
the inductive hypothesis these blocks required r, — 1 and r, — 1 steps to assemble, so with the final step, 
the total number of steps required is(r, — 1) + (7%) = 1) -1-2(r *r)—1-k. 


For the base case, the simplest such wff is a single statement letter, which has 1 symbol; 1 is odd. Assume 
that for any such wff with r symbols, 1 = r = k, r is odd. Consider a wff with k + 1 symbols. It must 
have the form (P) ^ (О), (P) V (О), or (P) > (О) where P has r; symbols, 1 = r, < k, and Q has r, 
symbols, 1 = r) < k. By the inductive hypothesis, both r, and r, are odd. The number of symbols in the 
original wff is then r; + r, + 5 (four parentheses plus one connective), which is odd. 

P(2) and P(3) are true by the equations 2 = 2 and3 = 3. Now assume that P(r) is true forany r, 2 = r = k, 
and consider P(k + 1). We may assume that k + 1 = 4, so that (k + 1) — 2 = 2 and by the inductive 
hypothesis can be written as a sum of 2s and 3s. Adding an additional 2 gives k + 1 as a sum of 2s and 3s. 
P(14), P(15), and P(16) are true by the equations 14 = 2(3) + 8, 15 = 5(3), 16 = 2(8). Now assume 
that P(r) is true for any r, 14 = r = k, and consider P(k + 1). We may assume that k + 1 = 17, so 
(К + 1) — 3 > 14 and by the inductive hypothesis can be written as a sum of 3s and 8s. Adding an ad- 
ditional 3 gives k + 1 asa sum of 3s and 8s. 

P(64), P(65), P(66), P(67), and P(68) are true by the equations 64 = 6(5) + 2(17), 65 = 13(5), 
66 = 3(5) + 3(17), 67 = 10(5) + 17, 68 = 4(17). Now assume that P(r) is true for any r, 64 = r = k, 
and consider P(A + 1). We may assume that k + 1 = 69, so (k + 1) — 5 = 64 and by the inductive hy- 
pothesis can be written as a sum of 5s and 17s. Adding an additional 5 gives Ё + 1 asa sum of 5s and 17s. 


From Exercise 2, 


n Ix? |" 

Y 2m = n(n + 1) = п? + n. Also, Е = Ет =n 

т= 1 0 

апа i 
п+1 
2x? ntl 
ЕС = E —(n-1?-1-2m-2n.Itistrethat n? = n? +n = m + 2n. 

1 


1 


EXERCISES 2.3 


1. 
3. 


5. 


Assume х2 > x, + 1. Then ху = (x, + 1P = +2х+1>х 4+1>,4+ D041 =% +1. 
Q(0): jy = (ig — 1)! because j = 1,7 = 2 before loop is entered. Assume Q(k): ją = (i; — 1)! Then 
Oh t Withee Hh he = (ip - DI = GD! = Gay — 1)! At loop termination, 7 = (і — 1)! and 
i=xt+1,soj=x! 

Q(0): jp = x^ because j = x, i = 1 before loop is entered. Assume О(Ю): jj = х“. Then Q(k + 1): jen = 
| htl = хіі, At loop termination, j = x and i = y, so j = x". 


Je Xx = хх SX 


15. 


17. 


19. 


21. 


23. 


25. 
2T. 
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‚ gcd(308, 165) = 11 

- gcd(735, 90) = 15 

. £cd(1326, 252) = 6 

. You want to divide the 792 bars of soap evenly among x packages. Therefore x must be a divisor of 792. 


Similarly, you want to divide the 400 shampoo bottles evenly among the x packages. Therefore x must be 
a divisor of 400. The number of packages is the largest value of x that divides both 792 and 400, which is 
the definition of gcd(792, 400). 

Q:j = x * y. Q(0): jg = x * у" because j = x, i = 0 before loop is entered. Assume Q(k): Ik= = yx y, 
Then О(Е + 1): jpn) = jež y = x * ужу = ужу?! = x ж yk Atlooptermination, j = x * y'andi = n, 
so j = x* y". 

Q:j = (i + 1). Q(0): jg = (ig + 1} because j = 4,i = 1 before loop is entered. Assume Q(k): j, = 
(i + 1}. Then ОЕ + 1): =A + 2h+3= (ip + 1% + 2i, +3 = Ê+ 2,414 21, +3 = 
P + 4i, + 4 = (+ 2P = (i+ 1 + 1) = (i, + 1}. At loop termination, j = (i + 1 and i = x, 
so j = (х + 1)’. 

Ој = х*й. Q(0): jy = x * ip! because j = x, i = 1 before loop is entered. Assume O(k): j} = x * (ip! 
Then Q(k + 1): fen = j* (ip + 1) = x* GIG + 1) =x» (ip + 1)! = x * (11)! At loop termina- 
tion, j =x *i! andi =n,soj =x *n! 

Q:j = max of (a|1], ..., ali). O(0): jọ = max(a[1], ..., a[i9]) because ij = 1, so the right side be- 
comes тах(а[1]) and / a[l]. Assume О(У:ў  max(a[|l],..., a|i,]. ThenQ(k + 1): 4,4, = 
max(j,, ali, + 1]) = тах(тах(а[1], ..., ali), ali, + 1) = max(a[1], ..., ali, + 1) = max(a[1], ..., 
ali,,,]). At loop termination, j = max(a[1], ..., а[ї]) and = n, so j = max(a[1], ..., a[n]). 

Suppose there exists an integer d such that d|a/2, d|b/2, and d > c, from which 2d > 2c. Then a/2 = kid 
and b/2 = Ња where k,, К» are integers. Then a = k,(2d) and b = k,(2d), which means that 2d divides 
both a and b but is greater than 2c = gcd(a,b), which is a contradiction. 

If a and b are both odd, then a — b is even, in which case by fact 2, ged(a — b, Б) = ged((a — b)/2, Б) 
308 165 Fact 2 

154 165 Fact 2 


TI 165 Fact 3 


71 44 Fact 2 

77 22 Fact 2 

77 11 Fact 3 

33 11 Еасї 3 

11 11 Fact 3 
0 11 


gcd(308, 165) = 11 


EXERCISES 2.4 


9. 
11. 


a i 


7:308 — 13-165 


1 
3. 15 = 1-735 — 8-90 
5. 

7. 1729 = 7-13-19 


6 = 100-252 — 19.1326 


Because V 1171 = 34, we try the primes 2, 3, 5, 7, 11, 13, 17, 23, 29, 31. None of these divide n, so n is prime. 
8712 = 23-3?. 11° 


13. 
15. 
17. 
19. 


21. 
23. 
25. 


27. 


29. 
31. 
33. 
35. 


37. 
39. 
41. 
43. 


45. 


47. 
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308 = 22*7 * 11 and 165 = 3 * 5 * 11, so gcd(308, 165) = 11 
735 = 3 45% 7 and 90 = 2 » 3? * 5 so ged(735, 90) = 3 +5 = 15 
1326 = 2 * 3 * 13 + 17 and 252 = 2^«3^« 7, so ged (1326, 252) = 2+3=6 


The gcd(a, b) is the product of primes that appear in both a and b to the lowest power to which they appear. 
The Icm(a, b) is the product of primes that appear in either a or b to the highest power to which they appear. 


ged = 2:3- 11, lcm = 27-3 +112 +13, 

ged = 3:5-11, lem = 3^- 5 - 117-17 

Let ged(a, b) = c and gcd(a, a + b) = d. Because gcd(a, b) = с, cla and c|b, so a = mc and b = nc 
for some integers m and n. Then a + b = mc + nc = (m + n)c,so c| (a + b). Therefore c is a common 
divisor of a and a + b, and must be = the greatest common divisor of a and a + b, namely d. Be- 
cause gcd(a, a + b) = d, d|a and d|(a + Б), so a = id and a + b = jd for some integers i and j. Then 
b = jd — a = jd — id = (j — да, so d|b. Therefore d is a common divisor of a and b and must be = 
the greatest common divisor of a and b, namely c. Now с = d апі d = c, so c = d. 


For example, 8|24 and 24 = 12-2 but 8/12 and 8/2. This does not violate the theorem of division by 
prime numbers because 8 is not prime. 


3,5,7 

Ф(8) = 4 (the numbers 1, 3, 5, 7) 

Ф(10) = 4 (the numbers 1,3, 7, 9) 

Let g(n) = n — 1. Because n is never relatively prime to n, the numbers being counted in ф(л) are 
(1, 2, ..., n — 1). Thus every number between 1 and n — 1 is relatively prime to n, so only 1 and n divide 
n, making n prime. 

q(2*) = 22(2) = 8:1 = 8. The numbers are: 1, 3, 5, 7, 9, 11, 13, 15 

35640 

1248000 


To compute ф(ра), count the number of positive integers less than or equal to pq, which is pq, and 
throw out those that are not relatively prime to pq. A positive integer т less than or equal to pq and 
not relatively prime to pq must either contain at least one factor of p or at least one factor of q. We 
count how many integral multiples of p ( p, 2p, 3p, ..., pq) are less than or equal to pq. This number 
is pq/p = q. Similarly we count how many integral multiples of q (q, 29, 3q, ..., pq) are less than or 
equal to pq, of which there are pq/q — p. Multiples of p and multiples of q are distinct except for pq, 
which we have counted twice; to compensate, we will add 1 to the final count. The correct expression is 
ppg) -pq-q-ptl-(p-l(q- 1) = ep) eq). 

Using the fundamental theorem of arithmetic, let n = рү p? · · · p so that n" = p" p?" - - p” and by 
Equation 2, 


b(n) = pit! py ре TOODA): 62] 
Then, again by Equation 2, 


пут phm , 


num—]l „тт, nm- 1 Pi Pp "PE 


пип 


p(n”) = pr"! рэ ре" PPDPP) PP) = = PE): PP 
PıP2 Pk 
_ pen рї" "m 2Y e(n) ) _ pi" py" xs JE 
рро `` рь Np ppp рр? "ре 
п" = 
= (п) = р" (п) 


5, 7, 31, 127 


49.a.28=1+2+4+7+4 14 


51. 
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b. 28 = 22023 — 1) 


The original list is 


1 2 3 4 5 6 
11 12 13 14 15 16 
21 22 23 24 25 26 
31 32 33 34 35 36 
41 42 43 44 45 46 
51 52 53 54 55 56 
61 62 63 64 65 66 
71 72 73 74 75 76 
81 82 83 84 85 86 
91 92 93 94 95 96 
After pass one (crossing out multiples of 2): 

1 2 3 5 7 9 
11 13 15 17 19 
21 23 25 27 29 
31 33 35 37 39 
41 43 45 47 49 
51 53 55 57 59 
61 63 65 67 69 
71 73 TS 77 79 
81 83 85 87 89 
91 93 95 97 99 
After pass two (crossing out multiples of 3): 

1 2 3 5 7 
11 13 17 19 

23 25 29 
31 35 37 
41 43 47 49 
53 55 59 
61 65 67 
71 73 77 79 
83 85 89 
91 95 97 
After pass three (crossing out multiples of 5): 

1 2 3 5 7 

11 13 17 19 
23 29 

31 37 

4] 43 47 49 
53 59 

61 67 

71 73 77 79 
83 89 

91 97 
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After pass four (crossing out multiples of 7): 


1 2 3 5 7 
11 13 17 19 
23 29 

31 37 

41 43 47 
53 59 

61 67 
71 73 79 
83 89 

97 


Because 7 is the largest prime less than V/100, the process terminates. The remaining numbers (excluding 1) 
are the primes less than 100. 


53. 


со 


— м роо о |ую/[оо оо 
со уо|уго/|о|со у | үче 
olev oN =- [Фо |юо 
мо | л| AINI ojojoj- 


7 
3 
4 
5 
6 
8 
9 
1 
2 


оро ly nfo} со|о ооо 
оро о/о IN о || 
ро |2 јол|о|о| уюу оз 


лош AINI I- ою оф 


СНАРТЕВЗ 


EXERCISES 3.1 
1. 10, 20, 30, 40, 50 


3. 2, 1/2, 2, 1/2, 2 
1 1 1 1 1 1 1 1 1 1 


7. 1, 5, 47, 755, 18879 

9. 2, 2, 6, 14, 34 

11. 2, 3, 6, 18, 108 

13. F(n + 1) + F(n — 2) = F(n — 1) + F(n) + F(n — 2) = [F(n — 2) + F(n — 1)] + F(n) = F(n) + 
F(n) = 2F(n). 

15. F(n) = F(n — 2) + F(n — 1) = [F(n — 4) + F(n — 3] + [F(n — 3) + F(n - 2] = F(n — 4) + 
2F(n — 3) + F(n — 2) = Еп — 4) + 2F(n — 3) + [F(n — 4) + Еп - 3] = ЗЕ(п – 3) + 


2F(n — 4). 

17. Е(п + 3) = F(n + 2) + Еп + 1) = F(n + 1) + F(n) + F(n + 1) = 2F(n + 1) + F(n). 

19. n = 1: F(1) = F(3) - l or 1 = 2 — 1, true. Assume true for n = k: F(1) + +++ + F(A) = F(A + 2) — 1. 
Then F(1) + +++ + F(A 1) = ЕП) +--+ t F(A) + F(k + 1) = F(R + 2) - 14+ F(k +1) = 
F(k + 3) — 1. 


21. 


2З. 


25. 


27. 


29. 


31. 
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n = 1: ЕІ) = Е(2)ог1 = 1,true.Assumetrueforn = k: F(1) + FG) + +++ + F(2k — 1) = F(2k).Then 
F()- FG) +--+ + FOK + 1) — 1) = F(1) + FQ) +--+ Fk- D + FOK +1) – 1) = 
FOK) + FQ(k + 1) — 1) = FOK) + FQk + 1) = FQk + 2) = F(2(k + 1)). 


п = 1: ЕА) = 2F(2) + F(1) or3 = 2(1) + 1, true. n = 2: F(5) = 2F(3) + F(2) or 5 = 2(2) + 1, true. 
Assume forall r, 1 < r < k, F(r + 3) = 2F(r + 1) + F(r). Then F(k + 4) = F(k + 2) + F(k + 3) = 
2F(k) + F(k — 1) + 2F(k + 1) + ЕЮ = 21500) + F(k + 1)] + [F(k — 1) + ЕДЮ] = 2F(k + 2) + 
F(k + 1). 


п = 1: F(1) € 2ог1 < 2,true. n = 2: F(2) < 2’ orl < 4, true. Assume for allr, 1 = r = k, F(r) < 2". 


Then F(k + 1) = F(k— 1) + F(k) < 247! + 2 = 2001 + 2) = 30^ < 4QF = 22471) = 
ok*l 


F(positive integer п) 
//function that recursively computes the value of 
//the nth Fibonacci number 
if n — 1 then 
return 1 
else 
if n — 2 then 
return 1 
else 
return F(n — 2) + F(n — 1) 
end if 
end if 
end function F 


UIN PEIVESS SHOVE Ads do мз 
= = Е = = 


T. =14 
id 2 2 4 2 2 2 P 
The proof that 1 + д = q? is similar. 
b, a = 1: (1) = = 1, true. 
p-q 
2-0 (р- фр + 1+\з 1-5 2 
МӨТ ү — ых = | =~ = 1, tre. 
p-dq p 3 2 
N dm d 
Assume for all r, 1 =r = k, F(r) = — 7 . Then 
k-1 k-1 k k k=l k—1 
= = 1 + р) – 1+ 
Fk +1) =Fk-1) + Fl =" езе риса uw 
pu p-dq p-dq 
pp m qu B p! u git! 
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c. From part (b), 


( + ү ( = ү _ 
peg 2 2 | 20 + Vsy (1 – М5) 


F(n) = = L — = = 
9) 2—4 (: + v5) (: Е v5) 22/5) 2"QN/s) 


2 2 


_ у (TS уузу е; 


Мз\ 2 5 2 5 2 


33. Yes. 

35. No. 

37. a. F(1) = F(2) = 1 because the rabbits do not breed until they are 2 months old. F(m) = the number 
of pairs at the end of n months = (the number of pairs at the end of n — 1 months) + (the num- 
ber of offspring pairs produced during month л, born to the pairs existing at month n — 2) = 
F(n — 1) * F(n — 2) 

Б. 27= 1+5 + 21,62 =2+5455 

39. Base cases: S(1), S(2), S(3) are even. Assume for all r, 1 S r S k, S(r) is even. Then S(k + 1) = 

3S(k — 2) = 3(even) = even 

41. a. п = 0: S(O) = Lis odd, л = 1: 5(1) = 1 is odd. Assume for all r, 0 Sr = k, S(r) is odd. Then 
S(k + 1) = 2S(Kk) + S(k — 1) = 2(odd) + odd = even + odd = odd 

b. n = 4: S(4) < 6S(2) or 17 < 6(3) = 18, true. n = 5: 5(5) < 6S(3) or 41 < 6(7) = 42, true. Assume 
for all r, 4S r<k, S(r) < 6S(r — 2). Then S(k + 1) = 2S(k) + S(k — 1) < 2[6s(k — 2)] + 
6S(k — 3) = e[2S(k — 2) + S(k — 3)] = 6S(k — 1) 
43. S(1) = a, S(n) = rS(n — 1) forn = 2 
45. a. A(1) = 50,000, A(n) = 34(n — 1) forn = 2 
b. 4. 

47. b andc 

49. a, b, ande 

51. The basis elements are 0 and 3, each of which is a multiple of 3. Assume that x and y are integers that are 

multiples of 3, so x = (n)3 and у = (k)3. Then x + y = (n)3 + (А)3 = (n + K)3, which is a multiple of 3. 

53. 1. Any unary predicate in x is a wff. 2. If P and О are unary predicate wffs in x, so are (P Л О), (P V О), 

(P > Q), (P^), (Р < О), (Vx)P, and (3x)P. 

55. (1) The integer 1 belongs to the set. (2) If x is an odd integer, so are x + 2 and x — 2. 

57. (1) The string 0 belongs to the set. (2) If x is a binary string with an odd number of 05, so is 1x, x1, and 

0x0. 

59. (1) The string 0 belongs to the set. (2) If x is a binary string ending with 0, so is 1x, Ox, x0. 

61. «positive digit ::— 1|2|3|4/5]6|7]8|9, <digit> ::— O0|« positive digit, «positive integer ::= <posi- 

tive digit>|<positive integer «digit 

63. (1) А = А (2) Ifx is a string with a single character, x^ = x. (3) If x = yz, then x? = z^y* 

65. 1! = 1, n! = n(n — 1)! юги = 2 

ay if ay = ay 


67. а. max(a;, a5) = Л тах (41, ..., а,) = тах (тах (а), ..., а, 1), а,) forn > 2 
d, fa, < а, 


. ay if a, = а» . : 2 
b. min(a;, а) = . min (41, ...,a,) = min(min(q, ..,a, )),a,) for n > 2 
d, if a, > а, 


69. 


71. 


73. 


75. 


77. 
79. 


81. 


83. 
85. 
87. 
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A V (В, A B5) + (A V В,) ^ (A V В.) by equivalence За. Assume that A V (В, A+: AB) = 
(AV B) A-++A (AV Bj). Then AV (ВЛ ЛВ)  AVI[Q ^: ABO ^ Ву |] by 
Exercise 68 <= (А V (В, Л: ABDA (4V Ву) = [AV В) л Л (АУ BR] A (AV By) > 
(A V Bj) ^: A (A V B, р) by Exercise 68. The proof of the other statement is similar. 
if n — 1 then 

return 1 
else 

return 3 * S(n — 1) 
end if 
if n — 1 then 

return 1 
else 

return S(n — 1) + (n — 1) 
end if 
if n — 1 then 

return а 
else 

if n — 2 then 

return b 
else 
return S(n — 2) + S(n — 1) 

end if 

end if 


Mystery(n) = n 
Ifthe list has 1 element or 0 elements, then we are done; else exchange the first and last element in the list 
and invoke the algorithm on the list minus its first and last elements. 


Divide a by b. If the remainder r is 0, then gcd(a, b) = b; else invoke the algorithm on b and r instead of 
a and b. 


4,10, —6, 2, 5; 4, 5, —6, 2, 10; 4, 2, —6, 5, 10; —6, 2, 4, 5, 10 
New Orleans, Charlotte, Indianapolis 


О: CurrentValue = 2! !. Q(0):CurrentValuey = 2*7! true because CurrentValue, = 2 and iy = 2 
and 2 = 2?~'. Assume Q(k): CurrentValue, = 2^^! Then CurrentValue,,, = 2 * 

CurrentValue, = 2(2* ') = 2* = 2*«!, At termination, CurrentValue = 2! ! and = n + 1, so 
CurrentValue = 2^*! ! = 2". 


EXERCISES 3.2 
1. S(n) = n(5) 


. F(n) = n2" 


5. А(п) = пт + 1)/2 


. T(n) = 


n(n + 1)(2п + 1) 
6 


. The solution formula does not apply. Using expand, guess, and verify, F(n) = n! 
11. 
13. 


The solution formula does not apply. Using expand, guess, and verify, A(n) = 2" '(n — 1)! 
а. The recurrence relation is T(n) = 0.957(n — 1) with a base case 7(1) = X. 


b. (п) = (0.95) (x). 
c. 7(21) = 0.358(X), which is slightly more than one-third the original amount X. 


15. 


17. 


19. 


21. 


2З. 
25. 


27. 
29. 
31. 
33. 
35. 
37. 


39. 


41. 


43. 


45. 
47. 
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. The recurrence relation is S(n) = 10S(n — 1) with a base case of S(1) = 1000. 
(п) = 10? 
. At the end of 20 seconds, (the beginning of the 21st second), S(21) = 10? e-mail messages are sent. 


. The recurrence relation is A(n) = (1.01)4(n — 1) — 80 with a base case of A(1) = 5000. 

‚ A(n) = (1.01) 15000) — 80[1 — (1.01) ']/[1 — 1.01] 

. A(19) — $4411.56 

. The recurrence relation is S(7) = 0.98S(n — 1) — 10,000 with a base case of S(1) = 1,000,000. 
. S(n) = (0.98)" (1,000,000) — 10,000[1 — (0.98)" '!]/1 — 0.98) 

. 5(10) = 750622 


. The recurrence relation for the total number of infected machimes each day is T(n) = 67(n — 1) — 6"? 
with a base case of 7(1) = 3. 
b. Ди) = 6" ?[6 * 3 — (n — 1)] 
c. The virus disappears after 19 days. 
The recurrence relation is P(n) = P(n — 1) + n, with P(1) = 1. The solution is P(n) = n(n + 1)/2. 
The recurrence relation is P(n) = P(n — 1) + 3n — 2 with P(1) = 1. The solution is P(n) = 
(n/2)(3n — 1). 
T(n) = 4(2)""! 4 Gy 
S(n) = 2 + 2(-2)""! 
F(n) = 6 + 2(5)""! 
B(n) = 3(2)""! + 4(n — DQy'! 
A(n) = 41 + 07! + 4(1 — iy! 
The characteristic equation is 2 — / — 1 = 0 with roots 
1+V5 1- V5 


2 2 


Same oc?» oct 


rı Fy 


The solution is 


F(n) = 


у - vs у - Уу 

5 2 5 2 

which agrees with the expression given in Exercise 31(c) of Section 3.1. 

а. The recurrence relation is M(n) — M(n — 1) = (1/2)|M(n — 1) — M(n — 2)] for n= 3, with 
M(1) = 200,000, M(2) = 250,000. The solution is M(n) = 300,000 + (— 100,000)(1 /2)"!. 

b. М(7) = 298437, which is within $2000 of $300,000. 

Let S(n) be the number of binary strings of length n with no two consecutive 05. Such strings can be 

generated in two ways: (1) Puta 1 on the end of a string of length n — 1 that has no two consecutive Os. 

There are S(n — 1) of these strings. (ii) Put a 10 on the end of a string of length n — 2 that has no two 

consecutive 05. There are S(n — 2) of these strings. Therefore S(n) = S(n — 1) + S(n — 2), which is the 

recurrence relation for the Fibonacci numbers. Also S(1) — 2 (both 0 and 1 are 1-length binary strings 

with no two consecutive 0s), S(2) = 3 (01, 10, 11 are 2-length binary strings with no two consecutive 05). 

So S(1) = 2 = F(3), S(2) = 3 = F(A), S) = S(2) + S(1) = 5 = F(5), etc. 

Here c; = 0 so the characteristic equation is 2 — су = 0, which has roots r; = 0, r, = сү. The solu- 

tion is S(n) = p(0)! + q(c)" ! = 4(с1)" ! where p + q = S(1), (с) = S2), so q = S(2)/c,. By the 

recurrence relation S(n) = c,S(n — 1), S(2)/c; = S(1), so q = S(1). The solution is S(n) = S(1)(cj)" !, 

which is the solution given by Equation 8 because g(n) = 0. 

P(n) = 4n — 3 

S(n) = (1 + log n)n 
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EXERCISES 3.3 


1. fori = 1 ton do 


3. 


low = roster[i].quiz[1] 
high = roster[i].quiz[1] 
sum roster[i].quiz[1] 


forj — 2tom do 
sum = sum + roster[i].quiz|j] ГА 
if roster[i].quiz[j] < low then 
low = roster{i].quiz[j] 
end if 
if roster[i].quiz[j] > high then 
high = roster{[i].quiz[j] 


end if 
end for 
sum = sum — low /IS 
sum = sum + high IIA 
write(“Total for student", i, “is”, sum) 
end for 


A total of n? additions is done. 


5. The overall number of output statements is n(log п) 


11. 


13. 


15. 


. 8. factorial(integer n) 


integer i 

factorial = 1 

if n — 1 then 
return 1 

else 
fori — 1 ton — 1 do 

factorial = factorial *(i + 1) 

end for 
return factorial 

end if 

b. п — 1 multiplications are done. 


. а. с has the value 4. When i = 1, product = 1*4 = 4, sum = —14 + 5 *4 = 6. When i = 2, 


product = 4*4 = 16, sum = 6+ (-7)*16 = —106. When i-—3, product = 16 * 4 = 64, 
sum = —106 + 2 * 64 = 22, so 22 is the final value, which is correct. 
b. The total work is 3л. 
The best case occurs when the first quiz grade 15 the lowest quiz grade for each student. Then the condition 
of the if statement is never true and the assignment statement within the if statement executes 0 times. The 
worst case occurs when all the quiz grades go downhill from beginning to end for each student. Then each 
new quiz grade is lower than the previous one, so the assignment statement within the if statement executes 
each time, or n(m — 1) times. Total assignments and comparisons in the best case = 3n + 2n(m — 1) 
and in the worst case = 3n + 3n(m — 1). 
a. After pass 1 the list is 5, 3, 4, 6, 2, 8. After pass 2 the list is 3, 4, 5, 2, 6, 8. After pass 3 the list is 3, 4, 
2, 5, 6, 8. After pass 4 the list is 3, 2, 4, 5, 6, 8. After pass 5 the list is 2, 3, 4, 5, 6, 8. 
b. B(1) = 0, B(n) = (n — 1) + B(n — 1) forn = 2 
c. B(n) = (n — 1)n/2 
n — | compares are always needed - every element after the first must be considered a potential new 
maximum. 
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17. S(n) = (n = 1)n/2 
19. a. The merged list is 1, 4, 5, 6, 8, 9; 3 comparisons 
b. The merged list is 1, 2, 3, 4, 5, 8; 4 comparisons 
c. The merged list is 0, 1, 2, 3, 4, 7, 8, 9, 10; 8 comparisons. 
21. M(1) = 0, M(n) = 2M(n/2) + (n — 1) forn = 2", п = 2 
23. selectionsort mergesort 
n=4 9 5 
n=8 35 17 
n= 16 135 49 
n = 32 527 129 
25. Original list: 9, 8, 3, 13. After 1st pass: 8, 3, [9], 13. After 2nd pass: 3, [8], [9], 13 —sorted 
27. 6 
29. Q(1) = 0, O(n) = (n — 1) + 20(п/2) for n = 2 
31. ОП) = 0, Q(n) = (n — 1) + Q(n — 1) юги z2 
33. If the original list is sorted in increasing order, then the first element of each sublist is the smallest ele- 
ment, so for the next pass the list of elements smaller than the pivot element will be empty and the list of 
elements greater than the pivot element will be only one element shorter than the sublist. 
35. Position at which x occurs Number of comparisons 
1 1 
2 2 
3 3 
n n 
37. For m = 1, F(m + 2) = F(3) = 2 and F(m + 1) = F(2) = 1. We need to show that if 1 division is 
required to find gcd(a, b), then a = 2 and b > 1. Because the Euclidean algorithm applies to posi- 
tive integers, b > 1. Because a > b, a > 2. Assume that if К divisions are required, a = F(k + 2), 
b= F(k + 1). Show that if k+ 1 divisions are required, then a = F(k + 3), b = F(k + 2). The 
first step of the algorithm in computing gcd(a, b) is to divide а by b, so a = qb + р, 0 = г «b. 
This is 1 division. The algorithm finishes the computation by finding gcd(b, r), which will there- 
fore require k divisions. By the inductive hypothesis, b = F(k + 2) and r=F(k + 1). Then 
а = 4Ь + т > р + т (д = 1 because a > b) = F(k + 2) + F(k + 1) = F(k + 3). 
m+1 
39. From Exercise 38, Ө < а, so, taking the logarithm to the 3/2 of both sides, m + 1 < log; 5a or 
m = log;sa m l. 
CHAPTER 4 


EXERCISES 4.1 


1. 
3. 


a. T b. F c. F d. F 


Four: {2,3,4} = {xlx e Nand2 = x = 4} = (3, 4, 2}, (a, b, с} = (xlx is the first letter of cat, bat, or 
apple}, Ø = {х|х is the first letter of cat, bat, and apple}, (2, a, 3, b, 4, c] 


19. 


21. 
2З. 
25. 


27. 
29. 


31. 
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id 10, l, 2, 3, 4} 

. {4, 6, 8, 10} 

. (Washington, Adams, Jefferson} 

@ 

. {Maine, Vermont, New Hampshire, Massachusetts, Connecticut, Rhode Island} 
» {-3, —2, = 1, 0 > l; 2, 3} 

. xx eNand1 xx 5} 

. {хіх є N and x is odd} 

. {xx is one of the Three Wise Men} 

. {x|x is a nonnegative integer written in binary form} 


E 


њо BO 


an с р 


. If A = {xx = 2" for n a positive integer}, then 16 є А. Butif A = {x|x = 2 + n(n — 1) for n a positive 


integer}, then 16 ¢ A. In other words, there is not enough information to answer the question. 


. а. Е b. T c. F d. T е. Т ББТ 
. b, e, and g are true; a is false because {1} є S but {1} ¢ R; cis false because {1} є S, but 1 ¢ S; dis false 


because | is not a set (the correct statement is {1} C U); fis false because 1 ¢ S 


. a, b, d, e, g, and i are true; c is false because neither member of C is a member of А; fis false because this 


2-element set is not an element of A; Л is false because a ¢ C 


. Let (x, y) є A. Then (x, у) lies within 3 units of the point (1, 4), so by the distance formula, 


М(х — 1) + (y – 4) = 3, ог (x — 1) + (y — 4) = 9, which means (x — 1} + (y — 4)? = 25, во 

(x, у) € В. The point (6, 4) satisfies the inequality (x — 1)? + (y — 4)? = 25, so (6, 4) € B, but (6, 4) is 

not within 3 units of (1, 4), so (6, 4) does not belong to A. 

a. For a = 1, b = —2, с = —24, the quadratic equation is x? — 2x — 24 = 0 or (x + 4)(x — 6) = 0, 
with solutions 6 and —4. Each of these is an even integer between — 100 and 100, so each belongs to Е. 

b. Here О = {6,—4}, but E = (—4, —2, 0, 2,4}, and OZ E. 

a, d, and e 

Let x € A. Then, because А C B, x e B. Because В С C, x e C. Thus A С С. 


The proof uses mathematical induction. n = 2: A set with 2 elements has exactly 1 subset with 2 elements, 
namely the set itself. Putting л = 2 into the formula n(n — 1)/2 gives the value 1. This proves the base 
case. Assume that any set with k elements has k(k — 1)/2 subsets with exactly 2 elements. Show that 
any set with k + 1 elements has (k + 1)k/2 subsets with exactly 2 elements. Let x be a member of a set 
with k + 1 elements. Temporarily removing x from the set gives a set of k elements that, by the inductive 
hypothesis, has k(k — 1)/2 subsets with exactly 2 elements. These are all of the 2-element subsets of the 
original set that do not include x. All 2-element subsets of the original set that do include x can be found 
by pairing x in turn with each of the remaining k elements, giving k subsets. The total number of 2-element 
ЩЕ 1), " (К + 1)k 
2 


subsets is therefore 


eS) = {©, (a 

For this set of four elements, the power set should have 2^ = 16 elements. 

p(S) = {@, {1}, {2}, {3}, {4}, {1,2}, {1,3}, {1,4}, {2,3}, 12; 4}, {3,4}, {1, 2, 3}, {1, 2, 4}, 11.3, 4}, 
{2, 3, 4}, {1, 2, 3, 4}} 

ё(5) = 10101.06), UO. {OH}, 02. (O3). (2. {@, 233, HID}, {@, 2333. {@, {д}. 
(2.603 } 


pA = s y} 
. Let x € A. Then {x} € (А), so {x} € p(B) and x € B. Thus А C B. A similar argument shows that 


В CAsothat A = B. 
.ax-—-ly-5 b. х= 8, у = 7 с. х= 1,у = 4 


39. 


41. 


43. 
45. 
47. 
49. 


51. 


53. 
55. 


57. 
59. 
63. 
67. 
69. 
71. 


73. 


75. 


77. 


79. 


81. 
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a. binary operation 

b. по; 0 ° 0 € №, so closure fails 

c. binary operation 

d. no; In x is undefined for x = 0 

a. no; operation undefined for x = 0 
b. binary operation 

c. unary operation 

d. binary operation 


n" 

a. AB + CD— * b. AB ** CD * — c. AC * BCDB * +/+ 
a. {t} b. {р, 9,7, $, 1, и} c. {q, r, У, w) а. 2 
а. {1, 2, 4, 5, 6, 8, 9} e. {2, 6, 8} 

b. {4, 5} f. 10, 1,3, 7, 9} 

c. {2,4} g Ø 

d. {1, 2,3,4, 5,9} 

a. {a} e. {a, {a}} 

b. {@, {a}, {a, {а}}} f. ©, (a, {a}}} 

с. {@, a, {a}, {{a}}, {a, {а = S g. (2) 

d. © 

c, e, and f 

a. B' d. ' ПС 

b. BNC е. В'ПС' о (ВО С) or B' С 

c. ANB 

a. С' b. BAD c. AMB d. A(1D' 

DNR’ 61. (NUP)NA 

a, b, d, and f 65. bandc 


a.B CA b. ACB c. 4=@ d. BCA е. 4 = В f. A=B 

Let x e A N B. Then x € A and x € B, so x є A. 

Let C € (4) N p(B). Then C € p(4) and C є p(B), from which C C A and C C В, so CCANB or 
C € g(A N B). Therefore (А) N p(B) C ф( АП B). The same argument works in reverse. 

Suppose В # Ø. Let x € B. Then x € AU B but x € A — B, which contradicts the equality of A U B 
and A — B. 

Let x є C. Then x € B — А = B N A’. Therefore x € A’, and no element that is in C can also be in A, so 
ANC=Ø. 

i. Let 4 C B and let x є A. Then x € В, so x € B' and no element that is in A can also be in B’. Therefore 
АГ\В' = Ø. ii. Let AM B' = Ø and let x є A. Then because 4B’ = Ø, x € B', so x є B. Therefore 
any element of А is an element of B, and A C B. 


a. р. {2, 4, 6, 7, 9} 
[0] 
c. x E€ (AUB) — (4N B) ex € (AU B) and x e (А ПВ) 4> (x e A or x є В) and 
x€A(YB«»(xeAandx АПВ) ог (хє Bandx € АПВ) €» (x є Aand x є B) or 
(хє Bandx є A) sx e (A – В) О (В — A) 
(uaàxeAUB«»xeAorxeBexeBorxeA«exeBUA(Ib)xeACXnBexeAandxe B 


э хє Вапйіхє4А < хє ВПА (2а) хє (АОВ) О Сюохє (АОВ) отхє С 6 (хє Аогх є B) 
orxeC = хє4АохєВо хєС < хєА ог (хє Вохє С) e»«xeAorxe(BUC) e 


83. 


85. 
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xeAU(BUC) 2b) хє (АПВПС) e хє (АПВ) and хє С e (xeAandxe B) and x e C 
< xeA and xe B and xe C © xeA and (xe Bandx e С) o xeA and хє(ВСС) e 
xE€AN(BNC) (35) xeAQn(BUC) 4+ xe and xe(BU C) ө хє А ап (xe Borxe С) e 
(хє Aandx є B) or (xeAandx EC) © хє (ANB) or хє (АСС) © xe(ACQB)U (An c) 
(4b)x EA NS > x € Aandx є 5 > x є A.Alsox є A > x є Aandx є Sbecause4 C $—5 x e ACAS 
(832x€AUA'—xeAorxeA' —5^xeSorxeSbecasseAC S,4 CS —^xeS.Alsoxe 5 > 
(x € Sandx € A) or (xe Sandx є A) > x EA orxeA' —^xeAU A' (5b) For any x such that 
x € ANA’, it follows that x є A and x € A’, or x belongs to A and x does not belong to A. This is a con- 
tradiction; no x belongs to A N A', and AMA’ = Ø. 


a.((4 U B)'Y(4 U В) = AU (BY B") (3a) 
=AUØ (5b) 
=Á (4a) 


The dual is (4 N B) U (AN В") = A. 
b. Капс 4 B)U (40 0) B')]U (4n Oy 


= [40 On (BU B)JU (an Oy (3b) 
= [4no)ns]U uan oy (5b) 
= (An C)U (4 Oy (4b) 
= 5 (5Ь) 


The dual is [((4 UC) UB)N(AUQUB)INAUO! =Ø. 
с. (AUQN[(ANB)U(C'NB)] 


=(AUQN[BNAU(BNC’) (1b) 
= (aUOon[snau c^] (3b) 
=(AUQN[AUC)NB] (1b) 
= [4UO n a4Uc»]nB8 (2b) 
= [4U (Cn C] B (3a) 
= (AUGQ)nB (5b) 
— 2 (4b) 

The dual is (4 '1 C) U [(4 U B) N (C' U B)] = AU B. 

a. AN(BU4A’) = (ANB) U(ANA’) (3b) 
=(ANB)US (5b) 
= АПВ (4а) 
= BNA (1b) 

b. (4UB) — C= (AUB)NC’ (defn. set diff.) 
= C’'N(AUB) (15) 
=(C'NA)U(C'NB) (3b) 
=(ANC)U(BNC) (1b) 
= (4А— С)О(В— С) (defn. set diff.) 

с. (4-B)-C=(4-B)NC (defn. set diff.) 
= (An B»)n cC (defn. set diff.) 
SC nans’ (15) 

- (C' 4) B' (2b) 
= (АГ\С)ГП\В' (15) 
= (4— С)ПВ' (defn. set diff.) 


=(4-C)-B (defn. set diff.) 


93. 


95. 


97. 
99. 
101. 


103. 


> op op 


a. 


b. 
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.A,U A, О: UA, = {хіх belongs to some А; for 1 = i < п} 
‚ A, UA, = {х|х € Aor x € A} forn = 2,4 ЈА, U + U A, = (4, Us: ОА, )QUA,forn > 2 


А ПАП: ПА, = {xx belongs to every A; for 1 <i € п} 


«AW Ay = {х|х € Aand x € A} form 2,4; CD 45 D NAA, = (4N... NA, A, forn > 2 
. Proof is by induction on л. For n = 2,BU(4,M4A,) = (BUA,)N(BUA,) by identity 3a. 


Assume that BU(A,0- ПА) = (ВОЈА): N(BUA). Then BU(A,O- NAD) = 
ВО(А, П: ПА) ПА) by Exercise 89b = (BU (А, П: 14) (8 ОА) by identity 
3a = (ВОЈА) П: -N(BUA))N(B ОА, 1) by inductive hyp. = (В ОА) П: (BU 44,4) by 
Exercise 89р. 


. Proof is by induction on n. For n = 2, B N (4, U 45) = (B N А,) U (BM 4) by identity 3b. Assume 


that  B(Y(4,U---UA) = (BÜO04)U---U(BO4). Теп ВП(А О: ЈА, р) = BN 
((A4,U::-U4A)UA,4,) by Exercise 87b = (BM (4,U-:-UA,)) U(BNA;,.,) by identity 
3b = (BN A,)U---U(BNA,)) U (В Г\ А„+ |) by inductive hyp. = (BC14) U---U(BEYA,,)) 
by Exercise 87b. 


Ua, = (xix e (—1,1)}; ()4i = {0} 


iel iel 
LJA; = кє [-L1 (4, = {0} 
iel iel 


P(1) is true—every member of T' is greater than 1; otherwise 1 would be the smallest member of Т. As- 
sume that P(K) is true, i.e., every member of T is greater than k. Consider P(k + 1), that every member of 
Т is greater than k + 1. If P(k + 1) is not true, then there is some member of T = k + 1. By the induc- 
tive hypothesis, every member of T is greater than k; therefore some member of Т equals k + 1, and this 
is the smallest member of 7. This is a contradiction, because we assumed that 7 has no smallest member. 
Therefore P(A + 1) is true. By the first principle of induction, P(n) is true for all n, and T must be empty. 
This contradicts the fact that T is a non-empty set. 


An enumeration of the set is 1, 3, 5, 7, 9, 11, ···, 
An enumeration of the set is a, aa, ааа, aaaa, · · · 


An enumeration of the set is shown by the arrow through the array 


(OA2) (0,3) (0,-3) 
(1.2) (1,3) (1.3) 


(1,2) (71,2) (-1,3) (-1,-3) 


(2, -1) (2,2) Q2 (2,3) (@,-3) 


Assume that the set has an enumeration 


211, Z12» Z13» 2145... 
221, 222, 2235 2245... 


231, 232, 233, 2345... 


Now construct an infinite sequence Z of positive integers with Z = zi, Z2, 23, ... such that z; = 2; for all i. 
Then Z differs from every sequence in the enumeration, yet it is a member of the set. This is a contradic- 
tion, so the set is uncountable. 
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105. Let А and B be denunmerable sets with enumerations А = а), а, аз, сс and B = b, b b3, · · · . Then 
use the list aj, bj, а, b2, аз, Рз, ‘`` and eliminate any duplicates. This will be an enumeration of A U B, 
which is therefore denumerable. 


107. B = {S|Sisaset and S ¢ S}. Then either B € B or B € B. If B € B, then B has the property of all mem- 
bers of B, namely B ¢ B. Hence both B € B and B ¢ B are true. If B ¢ B, then B has the property char- 
acterizing members of B, hence B € B. Therefore both B ¢ B and B € B are true. 


EXERCISES 4.2 


1. 30 19. 1680 37. 32 55. 60,466,176 
3. 92 21. 25,974,000 39. 160 57. 3,515,625 
5. 41. 8 59. 2704 
7. 43. 36 61. 144 

45. 6 63. 208 

47. 25 65. 96 

49. 648 67. 1104 

51. 72 

53. 36 


71. 


RB СЕВ С RBG RB С RB СКВ С 


33 ways; the outcome that is highlighted is BRGR. 


73. Form = 2, the result follows from the multiplication principle. Assume that for m = k, there aren,- © + n, 
possible outcomes for the sequence of events 1 to k. Let m = k + 1. Then the sequence of events 1 to 
k + 1 consists of the sequence of events 1 to k followed by event k + 1. The sequence of events 1 to k has 
nı: "п, possible outcomes by the inductive hypothesis. The sequence 1 to k followed by event k + 1 
then has ((n,* * + njn,, , outcomes by the multiplication principle, which equals n; © * : 1,44. 
75 a. Р(1) = 1 (trivial case), P(2) = 1 (only one way to multiply 2 terms). For n > 2, let the last 
multiplication occur at position А, 1 = k = n — 1. The product is then split into two prod- 
ucts of k and (n — k) factors, respectively, which can be parenthesized in P(k) and P(n — k) 
ways, respectively. By the multiplication principle, there are Р(А)Р(п — К) ways to parenthe- 
size for a fixed k. Each value for К gives a different set of parentheses, so by the addition principle, 
P(n) = Р(1)Р(п — 1) + PQ)P(n – 2) + +++ + P(n – 1)Р(1) = > POP — E). 
k=1 


b. The proof will use the Second Principle of Induction. P(1) = 1 = C(O), P(2) = 1 = C(1). Assume 
that P(r) = C(r — 1) for 1 Sr = m. Then P(m + 1) = > P()P(m + 1 – k) = 
k=1 


m 


> Ck — 1)C(m — k) = С(т). 
k=1 
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EXERCISES 4.3 


27. 


. 5 parts had all three types of defect. 
. a. 2 
. a. |A| + |B| + |С| + |р| 


ES 19. No 
. There are 3 pairs—1 and 6, 2 and 5, 3 and 4—that add up to 7. Each element in the set belongs to one of 


19 9. a. 39 b. 14 
60 11. 8 

13. a. 60 b. 40 
b. 6 


l4nB| — |anc| - IANO D| - IBN C| - IBA D| - |CND| + 
lananac|-lnsn»p|-«janennp|-«|snennp|- |ANBNCND| 

b.2"—] 

21. 51 23. 367 

these pairs. Apply the pigeonhole principle, where the pairs are the bins, and the numbers are the items. 


This follows from the pigeonhole principle, where the n possible remainders (the numbers 0 through 


n — 1) аге the bins. 


EXERCISES 4.4 


. 48 


(п = п (п = (п = (п = н)! 


‚а. 42 b. 6720 47. 902,720 
. 362,880 49. 495 
. 3,628,800 51. 40 
‚ 40,320; 15,120 53. 1770 
. 2730 55. 341,149,446 

19! 57. 220 
. (2!)(11!)(8!) = 2(39,916,800)(40,320) 59. 115 
. 18! 61. 14,307,150 
. а. 120 b. 36 c. 28 63. 4,412,826 
. C(300, 25) 65. 8,586,820 
. C(17, 5)C(23, 7) 67. 3003 
. 11,760 69. 2508 
. 427,518 71. 19,481 

73. 36 

. 22308 75. 20 

792 77. a. 3360 b. 420 

4 79. 27,720 

624 81. 21 
. 5,108 83. C(81, 48) — very big number 
. $4,912 85. a. 28 b. 7 c. 10 
. 1,098,240 87. a. 8008 b. 84 
. 482,320,623,240 89. 286 
. 11,662,691,040 

n! n! n! n! 
. P(n, n) = TUER s n! and P(n, n — 1) = TE Та n! 
n! n! 

. C(n,r) = C(n,n — r) 


Whenever r objects are chosen from n, n — r objects are not chosen. Therefore the number of ways to 
choose r objects out of л is the same as the number of ways to choose n — r objects out of n. 


95. 


97. 


99. 


101. 


103. 


105. 
107. 
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Consider selecting r elements from a set ofn and putting those in bucket A, then selecting k of those r to put in 
bucket B. The left side multiplies the number of outcomes from those two sequential tasks. Alternatively, we 
can select k elements from n and place them in bucket В, then r — k elements from the remaining n — k and 
place them in bucket A. The right side multiplies the number of outcomes from these two sequential tasks. 


Gud as 
2) = 3042) = 3 oro = 3-2 — 
1 1 6 6:5-4 
ТЕТРИ 
1 8  8:7:6:5 


14 


1 
4) = — 4 = = => 
СУТТИ Уре 


These results agree with the recurrence relation results. 
163452, 163542, 345621, 356421, 634521, 643125 


7431652; reading right to left, the first non-increasing value is 1. Again reading right to left, the first value 
greater than 1 is 2, so swap 1 and 2, giving 7432651. From the right of the 2 value, the numbers decrease; 
swap 6 and 1, giving 7432156. 

3675421; reading right to left, the first non-increasing value is 6. Again reading right to left, the first value 
greater than 6 is 7, so swap 6 and 7, giving 3765421. From the right of the 7 value, the numbers decrease; 
swap 6 and 1, swap 5 and 2, giving 3712456. 


24589, 24678, 24679, 24689, 24789 


Make the initial permutation the largest permutation, n...321. Then just reverse all inequalities in the body 
of algorithm permutation generator. 


EXERCISES 4.5 


a. à? + 5а + 10a°b* + 10a’b? + 5ab* + D? 
b. x° + 6x°y + 15x4y? + 20x77 + 15x?y* + бху? + у 
c. а? + 10a* + 408? + 80a + 80a + 32 
d. a^ — 16a? + 96a? — 256a + 256 
. 12047 p 
. —489,888x4 
. 65615? 
. 2560x? 
. Thinkof(a + b + cyas((a + b) + cf. Then ((a + b) + c = C, 0)(а + b + CGB, 1)(а + bye + 
CQ, 2)(а + b)? + C(3, 3)с = а? + 3à?b + Зар? + D + За?с + babe + 3b’c + Зас? + ЗЬс? + с? 
‚ 11,200 
. C(n + 2, r) = Cn + l,r — 1) + C(n + 1, r) (Pascal's formula) = C(n,r — 2) + C(n,r — 1) + 
C(n, г — 1) + C(n, г) (Pascal’s formula again) = C(n, r) + 2C(n,r — 1) + C(n,r — 2) 
. From the binomial theorem with a = 1, b = (—1): C(n, 0) — С(и,1) + С(и,2) — +++ + 


(= )"С(лп,и) = (1 + (-1))" = 0" = 0 


. From the binomial theorem with a = 1,b = 27!: 


С(п, 0) + C(n, 1)27! + C22? + +-+ + C(n, n2" = (1 + 271)" 
so, multiplying by 2”, 


C(n, 0)2" + C(n, 1)2"7! + C(n, 222? + -- + С(п, n7" = 291 + 271)" 
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or 


C(n, 02" + C(n, 1)2"7! + С(п, 2327? + +++ + Cn, п) = 


n "5l (2+1\ al 2 n 
(1+3) - 2324] -2(F) -3 


21. a. (1 + x)n = C(n, 0) + C(n, 1)х + C(n, 2? + C(n, 3? + +++ + С(п, п)х" 
b. Differentiating both sides of the equation in (a) gives n(1 + x)" = C(n, 1) + 2C(n, 2)х + 
3C(n, 3j? + +++  nC(n, nx"! c. follows from (b) with x = 1 d. follows from (b) with x = —1 
23. a. Out of all the intersections of m sets, 1 = m = k, we want the ones that pick all m sets from the k sets 
in В. There are C(k, m) ways to do this. 
b. For any value ofm, 1 = m = k, an intersection of m sets has C(x, m) that include only sets in В. Count- 
ing the intersections in the right side of (1) that contain x (intersections of single sets, intersections of 
2 sets, and so forth) we get C(k, 1) — C(k, 2) + C(k, 3) —:-- + (1) СА, k) 
c. From Exercise 17, C(k, 0) — C(k, 1) + C(k, 2) — +++ + (-DIC(k, k) = 0 or Clk, 1) — €(&2) + 
-++(—1)'*!C(k, k) = C(k, 0), but C(k, 0) = 1. Therefore x is counted only once in the right side of (1). 


EXERCISES 4.6 


Some decimal answers in this section are approximations. 


1. 8 13. 1/12 25. 1326 

3. 1/8 15. 8 27. = 0.5588 
5. 1/4 17. 1/8 29. = 0.3824 
7. 36 19. 1/4 31. & 0.0498 
9. 1/6 21. 1/52 зз. = 0.0023 
11. 1/6 23. 1/2 


35. The answer to Exercise 30 should be the sum of the answers to Exercises 28 and 29 because “at least one 
spade” is either exactly one spade (Exercise 29) or two spades (Exercise 28). Using the probabilities ob- 
tained, this arithmetic is correct. 


37. a.1000 b. 0.001 c. 0.006 d. 0.003 

39. a.69,090,840 b. = 0.0000017 c. = 0.000295 d. = 0.0097 
41. = 0.0000015 47. = 0.021 

4з. = 0.0002 49. = 0.423 

45. = 0.002 51. 365" 


53. B = E',so from Exercise 52, P(B) = 1 — P(E) = 1 — P(365, n)/365". 
55. B has a higher probability because В consists of exactly two + exactly three + ---+ exactly n persons 
sharing the same birthday, whereas C consists of exactly two. 


57. 38 67. a. 0.55 b. 0.68 c. 0.32 
59. = 0.026 69. a. 0.6 b. 0.25 c. 0.65 
61. = 0.105 d. 0.15 e. 0.95 

63. = 0.00000751 71. =0.93 


65. 6.29908E-12 73. а. =0.24 b. 0.43 c. 0.57 
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75. 0.25 83. 0.375 
77. 0.3125 85. 0.875 
79. 0.5 87. 0.5 
81. 0.5 aD 
89. а. P(E; |F) = — —- (1) 
Р(Р) 
Р(ЕПЕ) 
P(F | E) = PE) 9 P(E N E) = P(F | E) (E) (2) 


Because P(F N E) = Р(Е,С\Е), substitute from Equation (2) into Equation (1), giving 


P(F | E) P(E) 
Р(Р) 
b. The events Е,1 = i = n, are all disjoint; therefore the events FM Е,1 <i = п are all disjoint. 


F-— F(S- ЕП(Е ОЕ, О... ОЕ) = (ЕПЕ) U(FNE,)U...U(FME,). Because the prob- 
ability of the union of disjoint events is the sum of the probabilities of each event, 


Р(Е, |Р) = 


PE) = > PF Eg. 
k=1 
c. From Equation (2) of part (a), Р(ЕП E) = P(F | E) P(E). Substituting into the result of part (b) gives 
PP) = > P| E) PED 
k=1 


d. Substituting the result of part (c) into the result of part (a) gives 


P(F | E) (E) 
Р(Е,| F) = —, 
> PE | E) PED 
k=1 
91. = 0.40 
93. a. 3.5 
b. = 3.29 


c. Less than (which turns out to be true, 3.29 < 3.5). The reason is that the die is now weighted toward 
a smaller value than the previous expected value, so this smaller value is more likely to occur and will 
drag down the weighted average. 
95. 4.75 
97. a. = 0.904 b. = 0.999 c. = 0.096 
99. z 0.547 
n? + Зп 


101. — — — 
ud TES 
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CHAPTER 5 


EXERCISES 5.1 


1. a. (1, 3),(3, 3) 3. a. (1, —1),(—3, 3) 
b. (4, 2),(5, 3) b. (19, 7),(41, 16) 
с. (5, 0),(2, 2) с. (—3, —5),(—4, 1/2),(1/2, 1/3) 
d. (1, 1),(3, 9) d. ((1, 2),(3, 2)) 

5. a. УА b. 


| } | } | Га 
-3 -2 -I Qu x 

—1 == 

—2 ==, 

—3 == 
c УА а 

(0, 5) 
(—5, 0) (5, 0) 
> 
X 
(0, —5) 
7. a. many-to-many 9. a. (2, 6,3, 17),(0, 0) 
b. many-to-one b. (2, 12) 
c. one-to-one c. none 
d. one-to-many d. (1, 1),(4, 8) 
11. a. reflexive 13. a. reflexive, transitive 
b. reflexive, symmetric, antisymmetric, transitive b. reflexive, symmetric, transitive 
c. none c. symmetric 
d. antisymmetric, transitive d. transitive 
e. reflexive, symmetric, transitive 


15. (b); the equivalence classes are [0] = {..., —9, —6, —3, 0, 3, 6,9, ...},[1] = {..., -8, 5, -2, 1,4, 
7, 10, ...}, and [2] = {..., –7, —4, –1, 2, 5, 8, 11, ...). 
(е); the equivalence classes are sets consisting of squares with equal length sides. 


17. 


21. 


23. 


25. 
27. 


29. 
31. 


33. 


35. 


37. 


39. 
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. reflexive, transitive 

. antisymmetric, transitive 

. reflexive, symmetric, transitive 
. antisymmetric 


a. symmetric 19. 
b. antisymmetric, transitive 

c. reflexive, antisymmetric, transitive 

d. reflexive, symmetric, transitive 


по = p 


For example: 

a. 5 = set of all lines in the plane, x py <> x coincides with y or x is perpendicular to y. 

b. S = set of integers,xpy «< Psy’ 

с. 5 = set of nonnegative integers, xpy <» x «y 

d. 5 = set of integers, xpy €» x = |у. 

a. reflexive closure = p itself, symmetric closure—add (2,1),(3, 2), transitive closure—add (2, 1),(3, 2) 

b. reflexive closure = symmetric closure = transitive closure = p itself 

c. reflexive closure—add (2, 2),(3, 3), symmetric closure—add (2, 1),(3, 2), transitive closure—add 
(1, 1), (2, 1), (2, 2), (3, 3) 

а. reflexive closure—add (2, 2), (3, 3), symmetric closure—add (2, 1), (3, 2), (3, 1), transitive closure 
=p itself 

x p* у <> one can fly from x to y (perhaps by multiple hops) on Take-Your-Chance Airlines 

No—f the relation is irreflexive, it is its own irreflexive closure. If the relation is not irreflexive, there 

must be some x € 5 with (x, x) in the relation; extending the relation will not remove this pair, so no ex- 

tension can be irreflexive. 


"d 


ate b. b c С: (a,c) (а, Б) 
b MV ed 


(c) (b) 
a 
Reflexivity: If x є A, then x € S, so (x < x) because < is a reflexive relation on S. Symmetry: if x, y є A 


and x < y, then x, y € S and x < y, so y < x because < is symmetric on S. Transitivity: if x, y, z є A 
and x < y and y < z, then x,y,z € S, x < y, and y < z, so x < z because ~ is transitive on S. 


dx cx 


The two graphs are identical in structure. 

Reflexive: (51, tj) u (s;, tj) because both s; ps, and f, ø t, due to reflexivity of p and с. Antisymmetric: 
(Si, tj) (55, to) and (55, b) u (Si, ti) — S1 P S2 and sy ps), tj o t; and 50 ti > s, = s, and f, = і due to 
antisymmetry of p and с — (5), tj) = (S2, b). Transitive: (s1, £j) u (5, ty) and (s5, t5) ш (53, 3) > S1 p S2 
and s» р 5з, росі, and t,0 t, — s, ps, and tiot, due to transitivity of p and с — (5s), fj) и (ss, 5). 


Assume that p is reflexive and transitive on S. Then for all x € S, (x, x) € p, which means (x, x) € p !, 


so (x) рр! and рр! is reflexive. Let (x, y) e pMp |. Then (x,y) ep and (х,у) ep |, 
which means (x, y) € p and (y, x) € p. This implies (y, x) € р! and (у, х) є p, so (y, x) e p N p | and 
pp ! is symmetric. Let (x, y) € p N p ' and (у, z) € p N p |. Then (x, y) € p and (x, y) € p | and 
(y, z) e p and (у, z) € pl, so (x, y) € p and (y, x) є p and (y, z) € p and (z, y) є p. Because p is transi- 
tive, (x, 2) € p and (z, х) € p or (x, z) e p and (x, z) ep ',so(x,z) e рГ\р 'and p N p™! is transitive. 


41. 


43. 


45. 


47. 
49. 


51. 


53. 


55. 


57. 


59. 
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Reflexive: X < X because x; = x, 1 S i = k. Antisymmetric: Let X < Y and YX X. If X z Y, let 
m + 1 be the first index where х, {у A Уш+1. Then Xm+1 X Ym+1 and у Ж X444 9 Xm+1 = уь, а 
contradiction. Transitive: Let X < Y and Y < Z. Then x, < y, for some p =k and y, < 2, for some 
а =k. Let m = min(p, q). Then x,, < Zm and X < Z. Therefore < is a partial ordering. It is a total 


ordering by "otherwise". 


a. when; no; all but the last 
b. Old 


Cole 
merry soul 


a 


Maximal elements: a, merry, soul 

a. [a] = {a, c} = [с] 

b. [3] = 1, 2, 3}, [4] = 14, 5} 

[1] = {...‚—5, —3,—1,1,3,5,...} 

Ifx = у(шойл)Шепх — у = k,n forsome integer kj, orx = Күп + y.Ifz = w(modn)thenz — w = kn 

for some integer k, or z = kn + w. 

a. x + z = (дп + у) + (ои + w) = y + w + (k + kn, so х+7— (y + w) = (k, + k)n where 
kı + k is an integer, and x + z = y + w (mod n). 

Ь.х— 2 = (kn + y) — (kan + у) = y — w + (k Б)и, so x -z (у — w) = (АД, — kn where 
kı — kj is an integer, and x — z = y — w (mod n). 

с. xz = (kn + у)(&п + w) = kk? + уп + wkn + yw = (п + yk, + wk,)n + yw so 
xz — yw = (Коп + yk, + wk,)n where kikan + yk, + wk, is an integer, and xz = yw (mod n). 


dx —y = (ип + у* — у = YO DEM] — y 
k=0 


y=} 


$—1 А 
= | Xia exu | + у у = п сб, OE IE = nk 
ү) к=0 


where k, is an integer and x? = y? (mod n). 


a. {(1, 1), (2, 2), (3,3), (4, 4), (1, 2), (2, 1), (3, 4), (4, 3] 

b. {(a, a), (b, D), (c, c), (d, d), (e, е), (а, b), (b, a), (а, с), (c, a), (b, c), (c, b), (d, е), (e, а)} 

Reflexive: x? — x? = 0, which is even. Symmetric: If x? — y? = 2n then y? — x? = —2n, which is even. 

Transitive:ifx’ — y? = 2nandy? — 22 = 2m,thenx* — 22 = ж – y? + y? — 22 = 2n + 2m = 2(n + т), 

which is even. The equivalence classes are the set of even integers and the set of odd integers. 

Reflexive: (x, y) p (x, y) because y = y.Symmetric:If(x, у) р (z, w)then y = wsow = yand(z, w) p (x, y). 

Transitive: If (x, y) p (z, w) апа (z, w) p (s, t) then y = w and w = t, so y = t and (x, y) p (s, t). The 

equivalence classes are sets of ordered pairs with the same second components. 

a.Reflexive: x p x because x and x start and end with the same bit values. Symmetric: If x p y then y 
starts and ends with the same bits as x, so x starts and ends with the same bits as y and therefore y p x. 
Transitive: If x p y and y p z, then y starts and ends with the same bits as x and z starts and ends with the 
same bits as y, so z starts and ends with the same bits as x and therefore x p z. 

b. 256 

c. 4; there are 4 start bit-end bit combinations, 0...0, 1...0, 0...1, and 1...1. 

d. 2° 

Clearly P «» P is a tautology. If P< О is a tautology, then P and О have the same truth values every- 

where, so О < P 15 a tautology. If P «o О and О  R are tautologies, then Р, О, and А have the same 
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truth values everywhere, and P <> R is a tautology. The equivalence classes are sets consisting of м 
with the same truth values everywhere. 


61. а. 1 Ь. 2 с. 5 а. 15 
63. Answers agree with Exercise 61. 
65. а. 3 b. 7 
67. Answers agree with Exercise 65. 
69. The number of blocks in a partition can range from 1 (the whole set) to л (a single element in each block). 
The result follows by the definition of S(n, k) and the addition principle. 
71. 6 
73. a. 25, 49 b. (3, 4, 5), (0, 5, 5), (8, 6, 10) c. (—4, 4, 2, 0), (—6, 6, 0, —2) 
75. 6 
EXERCISES 5.2 


1. Yes; for example: 1, 2, 3, 8, 4, 5, 6, 7,9 


3. 


C(2) 
B(5) 


AQ) D(6) 


eH(1) 
E(2) 
00 


G(4) 


5. Minimum time-to-completion is 17 time units. Critical path: E, 4, D, B, H 


7. Minimum time to completion is 13 time units. Critical path: E, 4, C, B, H 
9. For example: С, Н, F, D Е, C, A, B 

11. For example: Е, A, C, D, GF, B, Н 

13. For example, 6, 9, 1, 7, 8, 11, 2, 3, 5, 10, 4 

15. For example, 5, 6, 4, 1, 2, 3, 7, 10, 8, 11,9 


EXERCISES 5.3 


1. 


a. Give the name, type, and breed of all pets that are cats. 
b. 2 
c. SELECT PetName, PetType, Breed FROM Pet WHERE PetType = “Cat”; 
WhatAml 
PetName Breed 

Spot Hound 

Twinkles Siamese 

Lad Collie 

Lassie Collie 

Mohawk Moorish idol 

Tweetie Canary 

Tiger Shorthair 


11. 


15. 
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897 


Author 


The writes relation is many-to-many; that is, one author can write many books, and one book can have 
more than one author. 


: Results7 9. Results9 
Name Country ISBN Title Publisher | Subject 
East, Jane U.S. 0-364-87547-X | Early Tang Paintings | Bellman | Art 
Kovalsco, Bert | U.S. 0-56-000142-8 | Springtime Gardening | Swift-Key | Nature 
Results11 13. Results13 
Name Publisher | Subject 
Chan, Jimmy Bellman Art 
East, Jane Harding Art 
King, Dorothy Harding Nature 
Kovalsco, Bert Loraine Nature 
Lau, Won Swift-Key Nature 
Nkoma, Jon In some database systems you have to specify, via something 
Quercos, Tom like a “DISTINCT” clause or a “Unique Values” property, that 
duplicate tuples should be eliminated. Otherwise in this exam- 
ple, (Harding, Nature) would be listed twice. 
Results15 
ISBN Name Title Publisher | Subject 
0-115-01214-1 Nkoma, Jon Birds of Africa Loraine Nature 
0-364-87547-X | Lau, Won Early Tang Paintings Bellman Art 
0-364-87547-X | Chan, Jimmy Early Tang Paintings Bellman Art 
0-56-000142-8 | East, Jane Springtime Gardening | Swift-Key | Nature 
0-816-35421-9 King, Dorothy Springtime Gardening | Harding Nature 
0-816-53705-4 | Kovalsco, Bert | Baskets for Today Harding Art 
0-816-88506-0 | King, Dorothy Autumn Annuals Harding Nature 


19. 


21. 


2З. 


25. 


27. 


ae ут} 


е 


е 


а. 


е 


а. 
а. 
b. 


898 Answers to Odd-Numbered Exercises 


. project(restrict Book where Subject = "Art") over Title giving Results 17 
. SELECT Title FROM Book WHERE Subject = “Art”; 
. Range of x is Book, {x.Title|x.Subject = “Art”} 
Results17 
Title 
Baskets for Today 


Early Tang Paintings 


. project(join(restrict Book where Publisher = *Harding") and Writes over ISBN) over Name giving 


Results19 
SELECT Name FROM Writes, Book WHERE Book.ISBN = Writes.ISBN AND Publisher = 
“Harding”; 


. Range of x is Writes, Range of y is Book, {x.Namelexists y(y.Publisher = “Harding” and 


y.ISBN = x.ISBN)} 


Results19 
Name 


King, Dorothy 


Kovalsco, Bert 


. project (join(join(restrict Author where Country = “U.S.”) and Writes over Name) and Book over 


ISBN) over Title giving Results21 


. SELECT Title FROM Author, Book,Writes WHERE Author. Name = Writes.Name AND Writes. 


ISBN = Book.ISBN AND Country = “U.S.”; 


. Range of x is Book, Range of y is Author, Range of z is Writes, {x.Title | exists у, z(v.Country = 


“U.S.” and y.Name = z.Name and z.ISBN = x.ISBN} 
Results21 
Title 
Baskets for Today 


Springtime Gardening 
project(join(join(restrict Author where Country = “England”) and Writes over Name) and 
(restrict Book where Subject = "*Art") over ISBN) over Name, Title giving Results23 


. SELECT Author.Name, Title FROM Book, Author, Writes WHERE Author.Name — Writes.Name 


AND Writes. ISBN = Book.ISBN AND Country = “England” AND Subject = "Art"; 


. Range of x is Author, Range of у is Book, Range of z is Writes, { х. Name and y. 


Title|x.Country = “England” and y.Subject = “Art” and exists z|z.Name = x.Name and 

y.ISBN = z.ISBN} 

The empty set; there are no results that match this query. 

р*9 

If the common attribute is sorted in each table, Шеп the join can be performed by doing something 
similar to a merge sort (see Exercise 19 in Section 3.3) on the common attribute, which means at most 
(р + д) rows would need to be examined. 

14 


‚42 
. SELECT Author.Name, Title, Book.ISBN, RoyaltyPercent FROM Author, Book, Writes WHERE 


Author.Name = Writes. Name AND Writes. ISBN = Book.ISBN AND RoyaltyPercent <100; 
ROYALTY LESS THAN 100 
Name Title ISBN RoyaltyPercent 
Chan, Jimmy Early Tang Paintings 0-364-87547-X 20 
Lau, Won Early Tang Paintings 0-364-87547-X 80 
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29. a. Yes—every attribute described for Employee, Contribution, and Payment is listed and no attributes not 
mentioned have been included. 

b. The primary key of Employees is EmployeeID, presumably a unique identifer for each employee. 
Likewise, the primary key of Contribution is ContributionID. The primary key of Payment is the com- 
posite key ContributionID / PaymentDate. None of the three attributes alone uniquely identifies a pay- 
ment, nor does ContributionID / PaymentAmount [a given contribution may result in the same payment 
amount on several different dates] or PaymentDate / PaymentAmount [multiple contributions can pay 
the same amount on the same payment date]. But a given contribution does not have different payment 
amounts on the same date. 


31. a. The three relation tables are 


EMPLOYEE 
EmployeelD | FirstName LastName Department 
1 Mary Black Accounting 
2 June Brown Payroll 
3 Kevin White Accounting 
4 Kelly Chen Payroll 
6 Conner Smith Sales 
CONTRIBUTION 
ContributionID EmployeelD | ContributionDate | TotalAmount NumberofPayments 
101 1 1/1/2013 $300.00 3 
102 3 1/1/2013 $500.00 2 
103 6 1/1/2013 $150.00 2 
104 4 4/15/2013 $100.00 1 
105 1 6/1/2013 $210.00 3 
107 2 6/1/2013 $300.00 2 
108 2 1/1/2014 $600.00 12 
109 3 1/1/2014 $500.00 2 
PAYMENT 
ContributionID PaymentDate PaymentAmount 
101 1/15/2013 $100.00 
101 1/31/2013 $100.00 
101 2/15/2013 $100.00 
102 1/15/2013 $250.00 
102 1/31/2013 $250.00 
103 1/15/2013 $75.00 
103 1/31/2013 $75.00 
104 4/30/2013 $100.00 
105 6/15/2013 $70.00 
105 6/30/2013 $70.00 
105 7/15/2013 $70.00 
107 6/15/2013 $150.00 
107 6/30/2013 $150.00 
108 1/15/2014 $50.00 
109 1/15/2014 $250.00 


33. 


35. 
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b. EmployeeID in the Contribution table is a foreign key into the Employee table. ContributionID in the 
Payment table is a foreign key into the Contribution table. 

c. Because the EmployeelID is just a sequence of sequential integer values that are likely to have no mean- 
ing outside the database, it is probably a blind key generated automatically by the database system. 

SELECT Employee.EmployeeID, PaymentDate, PaymentAmount FROM Employee, Contribution, 

Payment WHERE Employee.EmployeeID = Contribution. EmployeeID AND Contribution. 

ContributionID = Payment.ContributionID AND PaymentAmount > 100; The result is 


EmployeelD | PaymentDate PaymentAmount 
2 6/15/2013 $150.00 
2 6/30/2013 $150.00 
3 1/15/2013 $250.00 
3 1/31/2013 $250.00 
3 1/15/2014 $250.00 


SELECT FirstName, LastName, PaymentAmount, PaymentDate FROM Employee, Contribu- 
tion, Payment WHERE Employee.EmployeeID = Contribution.EmployeeID AND Contribution. 
ContributionID = Payment.ContributionID AND PaymentDate = “1/15/2013”; (The last equality is 
likely to need a system-dependent additional function to convert the 1/15/2013 to a true date type so the 
equality test will work.) The result is 


FirstName LastName PaymentAmount | PaymentDate 
Mary Black $100.00 1/15/2013 
Kevin White $250.00 1/15/2013 
Conner Smith $75.00 1/15/2013 


EXERCISES 5.4 


1. 


D 


13. 
15. 


ofa с> Poem р р 


a. Domain = {4, 5, 6, 7, 8} codomain = {8, 9, 10, 11} range = {8, 9, 10} 
b. 8, 10 

с. 6, 7 

d. no, no 


‚ а.{(0, —1), (1, 1), (2, 3)} 


b. {(1, 1), (2, 3), (4, 7), (5, 9)} 
с. {М 7, 2\/ 7—1), (1.5, 2)} 


. f(A) = {3, 9, 15} b. f(A) = all integral multiples of 6 

‚ f(S) = {3,7} b. f(S) = {1,3} c. f( = {2,1,0} 
F b. F c. T d. F 

. not a function b. function c. function; one-to-one and onto 

. not a function e. not a function 

. not a function b. function, onto, not one-to-one c. function, one-to-one, not onto 
function 

. not a function 

. function; onto 

. bijection; f~': (p, q, r} > (1,2, 3} where f~' = {(q, 1), (у, 2), (р, 3)} 

. function; one-to-one 


f. bijection; h~': R? > R? where A~ (х,у) = (y — Lx — 1) 


17. 


19. 


21. 


2З. 


25. 


27. 
29. 
31. 


33. 


35. 


37. 
39. 


41. 


43. 


45. 


47. 


49. 


51. 
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Any odd value of л will produce a bijection because the function will look similar to Figure 5.12. For even 
values of л the function is not one-to-one because positive and negative values map to the same result. 
fis neither one-to-one nor onto. For example, f(xxy) = (ууу) = 3, so fis not one-to-one. For any string 
5, f(s) = 0; there are no strings in A* that map to negative values, so fis not onto. 
fis both one-to-one and onto. If f(s,;) = f(s;) then s, = s; (just reverse the strings again and you get back 
where you started), so fis one-to-one. Given any string s in A*, let y be its reverse. Then y is in A* and 
Ду) = s, so fis onto. 
fis neither one-to-one nor onto. For example, f({a, b}) = f({b, с}) = 2, so fis not one-to-one. The range 
of f is the set of sizes of all subsets of (a, b, с}, which is (0, 1, 2, 3}, so this is clearly not Z and fis not 
onto. 
For example, f(x) = 1/x. For x = 1, the value 1/x is greater than 0 but less than or equal to 1, so 
f. S Т.І fixi) = Дх»), then 1/x, = 1/x, and x, = xz, so fis one-to-one. Given any value y in T, that 
is, 0 < y = 1, the value l/y is in Sand f(1/y) = 1/(1/y) = y, so fis onto. 
a. 3 b. 0 c. 0 

The greatest integer = x is the same as the smallest integer = x; therefore x is an integer. 
Let k € x < К + 1 where & is an integer. Then / is the greatest integer that is less than or equal to x, so 
[x] = К. Also, multiplying the entire inequality by —1, which reverses the direction of the inequality 
signs, gives —k = —x > —k — 1 which means that —K is the smallest integer greater than or equal to —x, 
so | —x] = —k. Multiplying both sides of this equation by -1 gives —[—x] = k. 
a. False. Let x = 3.6. Then [|x|] = [3] = 3 # x. 
b. False. Let x = 4.8. Then |2x| = |9.6| = 9 but 2|x| = 2(4) = 8. 
If 2° < n < 2** then log(2") < logn < log(2‘*!)ork < log n < k + 1and|log n| = k,[logn] = k + 1. 
a. 9 b. 0 с. 4 d. 2 
False. For example, let x = 7 and y = 9. Then x mod 10 + y mod 10 = 7 + 9 = 16 but (x + y) mod 
10 = 16 mod 10 = 6. 
a. (1, D), (2, 0), (3, 1), (4, 0), (5, 1) 
b.cj (x) = lox E A and x E B »c4(x) = 1 and сх) = 1 © сх) свх) = 1 

с. If c,(x) = 1, then x e A’ and x ¢ A,soc,(x) = 0 = 1 — с, (х). Ес, (х) = 0. then x ¢ A’ and x E€ A 

so сх) = 1 = 1 — c4(x) 
. No. Let S = {1, 2,3}, A = {1, 2}, В = {2,3}. Then c,y,(2) = 1 but c,(2) + ¢,(2) = 14+ 1. 
. S(0, n) > A(0, n) for n > 1 because n" > n + 1. 
- SC, n) = S(O, 500, п)) = S(0, n") = n" 
c. The googolplex is 10?" = 10'% = S(1,10) 


c?» 2 


gof = {(1, 6), (2, 7), (3, 9), (4, 9] 
a. 18 b. 16 с. Зх + 3 
d. 3x + 1 ex+2 f. 9x 


a. If Asi) = f(s5) then g(f(si)) = g(f(s2)) so (g ° f )(s1) = (е ° f)(S2). Because g ° f is one-to-one, 5 = s; 
and therefore fis one-to-one. 

b. For u € О, there exists s Є S such that (g ° f (s) = и, because g ° f is onto. Thus g( f(s)) = u and f(s) 

is a member of T that is a preimage of и under g, and g is onto. 

Let S = {1,2,3}, T= {1,2,3,4}, U = {1,2,3}, f= {0, 1), (2, 2), (3,3)}, g = {(1, 1), (2, 2), 

(3, 3), (4, 3)}. Then f S > Т,е: T О, р is not one-to-one but g° f= ((1, 1), (2, 2), (3, 3)} is 

one-to-one. 

d. same example as for (c) 


a. f (x) = x/2 b. Ух) = Wx с. flx) = 3x -— 4 


Ha 


53. 
55. 


57. 


59. 
61. 
63. 
65. 


67. 


69. 
71. 


7З. 
75. 


77. 
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a. (1, 3, 5, 2) b. (1, 4, 3, 2, 5) 

Both ho(gef) and (л° о) °/ have domain and codomain A. For x € A, (л ° (6° /))(х) = 
h((g of )(x)) = A(g(f(x))) = (Л ° D) = (л » а) » fE). 

a. (1,2, 5, 3, 4) 

b. (1,7, 8) (2,4,6) 

с. (1, 5, 2, 4) ° (3, 6) 


d. (2, 3) ° (4, 8) ° (5, 7) 

a. (a, d, e, b) b. (d, e) c. (a, d) ° (c, e) 

f! = (2,4,3,8) 

a. 34 b. 36 

a. For 15| = 2, 2!/0! = 2 and 2? — C(2, 1)(1) = 4 — 2 = 2. For |S| = 3, 3!/0! = 6 and 


3? — С(3,1)02)° + CGB, 2)(1)? = 27 — 3-8 + 3 = 6. For |S| = 4, 41/0! = 24 and 
4* — C(4, 1)(3)* + C(4, 2)(2)* — C(4, 3)(1)* = 256 — 4-81 + 6:16 — 4 = 24. 

b. Assume fis onto. If two distinct elements of S map to one element of S, then n — 2 elements are left to 
map onto n — 1 elements, which cannot be done. Therefore fis one-to-one. Now assume f is one-to- 
one. Then the л elements of S map to n distinct elements of 5; thus every element of S is in the range of 
/, and fis onto. 


c. For example, 5 = N, f № > № given by f(x) = 2x. 

d. For example, 5 = №, £ № — N given by f(0) = 0,f(@) = x — 1 forx = 1. 

a. п" 

b. n! 

c. п! 

d. п! 

1 e | lv 1 

e. n! 20 3 F( m | | 

f. The number of derangements (answer d) is < n! А = qnl 2 « n! and л! « n". The total number 
of functions, with no restrictions, is the maximum. Only some of these functions are one-to-one and 
onto, but this is the definition of a permutation as well. Not all permutations are derangements, so the 
number of derangements is smaller still. 

1854 


a. Forx € S, f(x) = f(x),so x p x and p isreflexive. For x, y € S,ifx p y then f(x) = f(y) and f(y) = f(x), 
so y p x and p is symmetric. For x, y,z € S, if x p y and y pz, then f(x) = f(y) and f(y) = f(z), so 
f(x) — f(z) and x p z, so p is transitive. 

b. If f is a one-to-one function, then no two elements of S map to the same value, so the equivalence 
classes each consist of a single element. 

c. [4] = (4, —4} because f(4) = f(—4). 

a. Um, n, о, р} b. {n, о, р}; {о} 

Reflexive: S p S by the identify function. Symmetric: If S pT and f is a bijection from S to T, then 

f hT- Sand f ~' isa bijection, so T pS. Transitive: If S pT and T p U, f. S > T, g: T > U, fand g 

bijections, then g ° f: 5 > U and g ° f isa bijection, so S pU. 

a. (define (square x) b. 18 

(* xx)) 


EXERCISES 5.5 


1. 
3. 
5. 


For example, по = 1, су = 1/34, c; = 1. For x = 1, (1/34)(17х + 1) E x € 1(17х + 1) 
For example, my = 2, c, = 1,0 = 2. For x = 2, l(15x? + x) = 29 — 4х — 15 = 2(15x? + x) 


For example, л, = 1, с, = 1, с = 2. For x = 1, 103) =x + logx = 2x. 
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. Yes. For example, in Exercise 1, we could use the constants пу = 1, с = 1/34, c; = 1/10. Then the 


envelope would be entirely below g(x), but it still follows the general “shape” of g(x). 


2 11. lim —— е 
"ITE Do x5*]7 17 
1 
Ж . 1 . log . d 
. lim — = lim — = 0 15. lim —— = lim = 0 
x0 y x0 Dx xoo Xx хоо 1 


. [200 log х] = [41 Inx2] < [Wx] < [420 x] < [17 x logx] < [102 — 3x + 5] < [2:32] 
. Knowing that the algorithm is O(n°) tells you only that the growth rate is less than or equal to n°; it could 


be n°, п, n log n, n, etc. Knowing that it is o(n*) tell you that the growth rate is less than n°, but again it 
could be n’, п log n, etc. The most useful information is that the growth rate is O(n’), essentially growing 
at a constant times a parabolic 


. S(n) = O(n’) 23. S(n) = O(n log n) 
. S(n) = O(n!) = O(n) 
. a.C'(n) = O(n log n). 


b. The exact solution for С(и) is C(n) = n(log n) — n + 1, which is also O(n log n). 


EXERCISES 5.6 


"00 2 


. 25 mod 6 = 1, 11 mod 6 = 5, 14 mod 6 = 2, and (5 + 2) mod 6 = 1 
. 262 mod 13 = 2, 74 mod 13 = 9,188 mod 13 = 6, and (9 + 6) mod 13 = 2 
. 486 mod 5 = 1, 18 mod 5 = 3, 27 mod 5 = 2, and (3: 2) mod 5 = 1 


. Letx = qn +r, 0 Sr < папа y = фп + r4, 0 = r, € п, so x mod n =r, and y mod n = ғ. Then 


xy = (qiqnn + gr,  qarj)n + (түс rj) m rr, < n’. Let rr, = kn +r with 0 = r < n. Then 
ху = (quqon + Ф + qira + Юп + r with 0 Sr < n, so (x: y) mod n = r. Also x mod n : y mod 
n=r; rn = kn + rwith0 = r < n,so (x mod zn · y mod 7) mod n =. 


. Answer c is correct. 
11. 


0 33 13. a. The values are stored in locations b. 58 hashes to location 7, 
1 1 6, 14, 1, 7, 8, 2, 16, 9, 0. which contains another ele- 
2 13 0 50 ment (40), so, following the 
3 12 1 52 collision resolution scheme 
under which 58 would have 
4 24 : B been stored, search the next 
5 38 3 table position, 8, which 
6 27 4 contains 24, then search the 
7 22 5 next table position, 9, which 
8 6 23 contains 58. 41 also hashes 
9 7 40 to location 7 in the table; pro- 
10 8 24 ceeding as before, locations 
9 58 8 and 9 are also checked, and 
10 do not contain 41. The next 
11 location to check is 10, which 
is empty. Therefore 41 is not 
Me in the table. 
13 
14 14 
15 


15. 
17. 


19. 
21. 


2З. 
25. 


27. 
29. 
31. 
33. 
35. 


37. 


39. 


41. 
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a. 1/t b. 2/t c. 3/t 

a. ALLS WELL THAT ENDS WELL 

b. TWAS BRILLIG AND THE SLITHY TOVES DID GYRE AND GIMBLE IN THE WABE 

c. COL MUSTARD WITH THE KNIFE IN THE LIBRARY 

k = 9, SLEEP NOW WE MARCH ON ROME TOMORROW 

а. x = 10011 (5-bit string), р = х mod 2*= 00011, q= p:2 = 00110, s = x p = 10000, 
t = s: 274 = 00001,y = q + = 00111 

b. х = 0011 (4-bit string), p=x mod 2? = 0011,9 =р:2 = 0110,5 = х@р = 0000, 
t = 527° = 0000, у = q + t = 0110 


1010001111101010 
а. а= 3 
b. 8° mod 15 = 8°: 8 mod 15 = 64: 8 mod 15 = 4-8 mod 15 = 32 mod 15 = 2 
с. 23 той 15 = 8 
а. d = 23 b. 12? mod 55 = 23 с. 2377 mod 55 = 12 
a. n(n — 1)/2 b. n 
a. X b. 7 
check digit = 8 
a. temp — number b. temp — 7426 
ones — temp mod 10 ones = 6 
temp — (temp — one)/10 temp — 742 
tens — temp mod 10 tens — 2 
temp — (temp — tens)/10 temp — 74 
hundreds — temp mod 10 hundreds = 4 
temp — (temp — hundreds)/10 temp — 7 
thousands = temp thousands = T 
If x = y (mod n) then x — у = kn where kis an integer. Therefore xc — ус = ken where kc is an integer, 


so xc = yc (mod n). 

a. f(ka) = (Ка) mod p maps each element of $ to a unique value in T. To show that fis one-to-one, sup- 
pose f(k,a) = f(k a) for some k and ky, 0 = k, ky = p — 1. Then ^а mod p = Ба mod p. By Prac- 
tice 43b, да = Ба (mod p). Because p is a prime number and a is not divisible by р, gcd (a, p) = 1, 
so by Exercise 38 (cancellation under congruence modulo л), k, = k, (mod p), or k, — k, = mp for 
some integer m. But —p < kı — k, < p so m = 0 and к = ky, so kya = ka and fis one-to-one. 

р. |S| = |T| = p and fis a one-to-one function. Therefore each of the p distinct elements of S map to dis- 
tinct elements of T, making each of the p elements of T the image of an element of S. 

с. [a:2a::: (p — 1)а]тойр = [(a mod p): (2a mod p) ··:((р = 1)а)тойр| mod p by Equation 
(1) of this section. Because f is an onto function, the set 7 is the set of all residues modulo p of 
the elements of S. Because 0 mod р = 0, the set {1,2,---,(p — 1)} is the set of all residues 
modulo p of the elements 1(a,2a,:::,(p — 1)а}. Therefore [(a mod p) : (2a mod p): :- 
((p = 1)а) mod p] mod p = [1-2---(p — D)] mod p = (p — 1)! mod p. 

d. From part (c), [aP (p — 1)!] mod p = (p — 1)! mod p or, by Practice 43b, а (p — 1)! = 
(p — 1)! (mod p). Because р is a prime, gcd((p — 1)!, p) = 1, so by Exercise 38 (cancellation under 
congruence modulo п), а?! = 1 (mod p). 

e. (1,2,3,4,5,6) 

. 4f mod 7 = (4? · 4°)mod 7 = 64 · 64 mod 7 = 1: 1 mod 7 = 1 

a.d : e = | mod g(n) means d : e = 1 mod (р — 1)(9 — 1), so that d: e = 1 + Кр — 1)(9 — 1) for 
some integer k. Then Т = T! * «e-D-D = ТТК 06-0) = Т? 1470 or (Тч OTD, 


==, 
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b. If T is not divisible by p, then 77 ! = 1 (mod p) by Fermat’s little theorem (Exercise 39). 
T" mod p = 7(7°- **-? mod p = [T(T?- (T?-).- «(T^^ )] mod p = [(T mod p): 1: 1: : 1] mod p 
by Equation (1) and Practice 435 = T mod p so that Т“ = T (mod p) by Practice 43b. Similarly if T 
is not divisible Бу q, then 77^! = 1(mod q). Г“ той д = T(T17 "P mod q and T” = T(mod q). 

c. If p|Tthen T = kp for some integer k and T“ — T = (kp) — kp = р(р°- К — k) where 
p^ k^ — k is an integer, so T^ = T (mod p). If p|T and q|T, then T is a multiple of p and T is a 
multiple of g. Because p and q are primes, Т = cpq for some integer c, which is a contradiction because 
T<pq =n. 

d. The proof is very similar to part c. 

e. T^ = T (mod р),Т° = T (mod q) and p and q are relatively prime. This matches the pattern of the Chi- 
nese remainder theorem where a, = a; = T and x = T”. Butalso, T = T (mod p) and T = T(mod q). 
By the Chinese remainder theorem, T” = Т (той рд), or Т = T (mod). By Practice 43b, 
T" mod п = T mod n = T because T < n. 


EXERCISES 5.7 


1. 2, -4 
3.x =ly=3,z=-2,w=4 
6 | Р | —4 E 
5. а.|0 3 —12 2 
5 3 14 —17 
—2 7 е. | 2 9 
b.|-2 —3 9 1 
1 5 
12 3 6 
c. |18 —3 15 
3 9 6 
10 7 21 -—23 
7. a.| -2 —4 с. |33 —44 
30 8 11 1 
. 28 4 
b. not possible d | 6 5| 
i SAGIT | n | 
18 —3 4 -7 
b. A(B: C) = (А-В)С = | ns |i 
102 —84 


САВА € 7 A-B AC 7| 


42 -—35 
d(A+BC=A-C+B-C=| | 


32 —28 


11. 


13. 


15. 


17. 


19. 


21. 


23. 


25. 
27. 
29. 
31. 
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Both A · (B: С) and (A : B): С can be computed and will result in an n. X m matrix. The i, j element of 
А · (B: C) looks like 


p p 
> а„(В i C); = = 2, a(S b ueu) = albii + Бус) кекен bic) tees e 
5= 1] 


5 = 


aj (bici; F bc» + se oe by Ly = (aj bj, F apb + sp а; pop Ci „жаен 


(anbi, + anba + + абс, z 2. (San) p (A: B)4c;; 


k=1\s=1 
which is the i, j element of (А. B) · C. 
a. Assume that row i of A is all Os. Then for any j, the element in row i, column j of A · B is given by 


> aby. This sum is 0 because а, = 0 for all values of k. 
k=l 
b. Assume that column j of B is all Os. Then for any i, the element in row i, column j of А. B is given by 


n 


ko аР. This sum is 0 because by = 0 for all values of k. 


1 6 
а.А'=|3 -2 
4 1 


b. If A is symmetric then a; = aj; and Ai, j) = AQ, i) = A(i, j). Therefore A’ = A. If AT = A, then 
A(i, j) = A'(i, j) = AQ, i) and A is symmetric. 

c. (AP)? = A follows from the definition - two interchanges of row and column gets back to the original. 

d. Let А + B = C. Then C'(i, j) = C(j, i) = AG, i) + BG, i) = A'(i,j) + B'(i,j) and C’ = А? + В”. 

e. Let A beann X тан a X p matrix; then A’ ism X nand B' is p X m.LetA-B = C. 


Then ci, j) = C(j, i) = X a Ры = X A'(k, j) B'(i, k) — P B/(i, ЮАТ(К, j) = = (Bl. ATi, j) and 
C = В”. AT, 7 7 


А | | 1 || 1 1 l | 
r exam = 
ИИИ эу. фу eed 0 0 


This is not always true (for example, use the A and B of Practice 52). It is true if = B = I, for example. 


n n 
The i, j entry of A? is У aja, The j,i entry of A? is У apay. But these are the same because a, = ay and 
k=1 k=l 


ay = aj, (A is symmetric). 


B р 1] [FQ) FQ) ‚_ [FE +1) F(k) 
Forn = І, А = | 1 = ES mo, Assume that A^ = | БЮ Fik- D |; Then 
ns |“ +1) | { _ | +) + Е) FR + | _ 

Е Е F(A) F(k-— 1) 0] [FŒ +F- 1) F(A] 


F(k +2) F(k + 1) 

| + 1) eI 

(rAY1/r)A ! = r(1/r(A- A ) = П = Тапа (1/A (rA) = (1/n(r(A A) = WE 
х= 6,у = —1 

х= 1,у = 6,2 = –2 

9/14, у = 2/7, 2 = 31/14 


E 
Il 


33. 
35. 


37. 


39. 


41. 


43. 
45. 


47. 


49. 


51. 
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x= 5,y = —3,z=4,w =2. 
For example, 
x+2y=3 
4x+y=19 
3x —y = 16 
1 2 3 
The augmented matrix is | 4 1 19 |. The following elementary row operations 
3 —1 16 
a ( 1 2 3 
3 4 1 19 
3 —1 16 
result in 
1 2 3 
0 cw T 
0 —7 7 
Because row 2 of the augmented matrix is the sum of rows 1 and 3, the new rows 2 and 3 represent the 
same equation, so this actually is a system of 2 equations in 2 unknowns. Solving —7y = 7 gives y = —1, 


and solving x + 2y = 3 or x + 2(—1) = 3 gives x = 5. 


Let g = the amount of gold in cubic centimeters, and c = the amount of copper in cubic centimeters. 
Then c + g = 52 and 9c + 19.3g = 859.4. The solution is g = 38 сс, c = 14 cc. 


The percentage of copper by volume is 14/52 = 26.9%. 


МЕЗЕ @ 2 
1/2 -1/A 
pin y Ав МВВ 
24/10 —1/10 50 y 
sox = 20, y = 50. 
x=6,y=-1 
x=l1ly=6,z=-2 
1 0 0] 1 0 1] 1 0 1] 1 1 1] 
AAB=|0 1 O/AVB=/i 1 1 AXB-^|1 1 1/BXA=]1 1 1 
0 1 1] 11 1] 11 1j 1 1 1j 
0 1 0] 0 1 1 0 0 1] 1 0 1] 
АЛВ= |0 0 1/AVB=|1 0 1 АхВв= |1 1 1/BXA=]/0 0 1 
0 0 0 10 1 1 0 0 0 1 0 


In order for A V В = A ^ B, it must be the case that а; V b; = а; ^ by for all i, j. This is true if 
dj = by = lora; = Ьу = 0, therefore when A = B. 


п(п +1) 
2 


. a. The rows in the augmented matrix as it is being transformed are of length n + 1, n, n — 1,...3. (The 


augmented matrix is size n X (n + 1) and the next to last row is the last for which such a multiplication 
is required in order to zero out position n,n — 1 in the last row.) Each non-zero element in each row 
must be multiplied by the scalar value, requiring a total of (n + 1) + n + (n — 1) +: + 3 which is 


TV _,. (a + Dn +2) 
(n+ 1)+n+4+(n—1)4+--++3 +2 +41 (2 + 1) multiplications, which is 2 3. 
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b. To zero out the first column, a multiple of the first row (length п + 1) must be added to each row below 
it, requiring (л + 1) multiplications and (л + 1) additions for n — 1 rows. To zero out the second col- 
umn, a multiple of the second row (length л) must be added to each row below it, requiring n multiplica- 
tions and n additions for n — 2 rows. The last transformation requires a multiple of the next to last row 
(length 3) to be added to the 1 row below it, requiring 3 multiplications and 3 additions for 1 row. The 
totals are (n + 1)(n — 1) + (n)(n — 2) +--+ + (3)(1) for both multiplications and additions, and this 

(n — DoQ(n = 1) + 7) _ 2n? + Зи? — 5n 

6 6 


expression (from Exercise 11 in Section 2.2) equals 


c. To solve the equations from bottom to top requires 


Row n: C,,X, = d, 1 multiplication 

Row n — 1: С, рун) * Сони = d,-, 1 multiplication, 1 addition, 1 multiplication 

Row n — 2: €-2(n-23-2 F C(n—2yn—1%n—-1 * C(n—2)%n = @„-; 2 multiplications, 2 additions, 
1 multiplication 


Row 1: Cnt + соо + +++ + eux, = dy (n — 1) multiplications, (n — 1) additions, 1 multiplication 
nin + 1 

for a total of 1+2 +3 +... +и = ( ШОКК КУТ е 1) = 

(n — Dn - 


— — —— additions. 
2 additions 


d. The transformation requires O(n”) + 20(т?) operations, and the resulting equation solving requires 
20(п?) operations, so the overall order of magnitude is Ө(т?). 


CHAPTER 6 


EXERCISES 6.1 


1. g(a) = (1,2), g(6) = (1,3), g(c) = (2,3), (0) = (2,2) 
3. a. b. For example, с. 


7. a. For example, 


b. Does not exist; the node of degree 4 would have to have arcs going to 4 distinct other nodes because no 
loops or parallel arcs are allowed, but there are not 4 distinct other nodes. 


11. 


13. 
15. 
17. 
19. 
21. 


a. 


b. 
с. 
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. Does not exist; in such a graph, the sum of all the degrees would be 11, but the sum of all the degrees 


is the total number of arc ends, which must be twice the number of arcs, 1.e., an even number. 


Because everyone in a department presumably knows someone in the same department, this would 
mean that no one in the IT department knows anyone in the marketing department (and vice versa). 
Carl and Fletcher are not acquainted. SiuYin is acquainted only with Carl. 

2 


For example: a. star — idol — statue — sculpture 


b. 
с. 


burden — load — weight — influence 
piano — upright — moral — significance 


(b), because there is no node of degree 0. 


fii 1> a2 5,3 0,4 d, fi d, — er d; — e7, Oy > €; A4 — £j, 05 — ез, Ag — E4, A7 — е» 


f 


1-2 2,2 > 4,3 > Б, 4-е, 5c 


Not isomorphic; graph in (b) has a node of degree 5, graph in (a) does not. 


a. 
b. 


е 


e 


d 


g. 


There cannot be a bijection between the two node sets if they are not the same size. 

For isomorphic graphs there is a bijection from one arc set to the other, either explicitly or, in the case 
of simple graphs, implicitly by means of the endpoints; this cannot happen if the arc sets are not the 
same size. 


. If the graphs are isomorphic and arcs a, and a; in one graph both have endpoints x—y, then their image 


arcs in the second graph must have the same endpoints, which cannot happen if the second graph has 
no parallel arcs. 


. If the graphs are isomorphic and an arc in one graph has endpoints x—x, then its image arc in the second 


graph must have endpoints (х) х), which is not possible if the second graph has no loops. 


. If the graphs are isomorphic and a node of degree k in one graph serves as an endpoint to k arcs, its 


image in the second graph must serve as an endpoint to the images of those К arcs, which implies it will 
have degree k also. 

If the graphs are isomorphic and if there is a path лп, a), 715, a5, ... ‚п, between two nodes in one graph, 
then fn), Kai), Am), а), ... Ary) is a path in the second graph. Two nodes in the second graph are 
the images of nodes in the first graph; if the first graph is connected, there is a path between these nodes 
and hence there is a path between the two nodes in the second graph. 

By the answer to part f, in isomorphic graphs paths map to paths, so cycles map to cycles. 


n(n — l) 


23. 4 graphs: е 7 A i 25. HT ae = C(n,2) 


27. If we can draw the graph with arcs that intersect only at nodes, then it is a planar graph. 


К, 3: 


29. 
31. 


33. 


35. 


37. 


43. 


49. 
51. 
53. 


5 
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The proof for Euler’s formula does not depend on the graph being simple, so the result still holds for non- 
simple graphs, but this is not true for inequalities (2) and (3). 


Planar 


Nonplanar—subgraph here can be obtained from К; ; by elementary subdivisions 


моо н н 


Se =e = н н 
аа) 


оон н о 


ооо н м 


oo оон нс 


м мо но о н 


=- = мо о о н 


4 


он ооо к= c 
— ооо но о 


м © о н н но 


41. 


0 
0 
0 
0 


O =- =. O =. KF Cc 
оо о н 
= OF о 
он н о 


The graph consists of n disconnected nodes with a loop at each node. 


The n X n matrix with 0s down the main diagonal and 15 elsewhere. 


1 


M—- 1 => 2 — 5 e-—— 5 | ө 

ШЕШ Е ——- 2 _-—— 3 _—= 4 5 |e 
-——= 2 -> 4 | e 

+ 2 +—>| 3 | e 

-——- 1 -—— 1 ——- 2 | e 
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—M— 7 


M— 7 


M— 7 


-—— 6 


6 


6 


—— 4 


=--> 5 


55. 


57. 


59. 


36 


с. 


Ь. 16 


65. 


63. 


Pointer 


10 


Node 


61. 


10 


11 


67. 


69. 


YI: 


73. 


75. 


77. 


79. 


81. 


83. 
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By the definition of isomorphic graphs, nodes x-y are adjacent in G, if and only if their images are adja- 
cent in О». Therefore nodes are not adjacent in С, (and therefore are adjacent in С!) if and only if their 
images are not adjacent in С, (and therefore are adjacent in С»). Thus the same function f makes the 
complement graphs isomorphic. 


If G is not connected then G consists of two or more connected subgraphs that have no paths between 
them. Let x and y be distinct nodes. If x and y are in different subgraphs, there is no x-y arc in С; hence 
there is an x—y arc in G', and a path exists from x to y in G'. If x and y are in the same subgraph, then pick 
a node z in a different subgraph. There is an arc x-z in G” and an arc z-y in G’, hence there is a path from 
xtoyin G'. 

The matrix for G' will have 1s where А had 0s and Os where A had 1s except for diagonal elements, which 
remain Os. 


The maximum number of arcs occurs in a complete graph; the maximum is C(n,2) = n(n — 1)/2, there- 


fore a = n(n — 1)/2 or 2a = п? — n. 


Let С be a simple graph with n nodes, n = 2, and m arcs, m > C(n — 1,2) = (n — 1)(n — 2)/2, 
and suppose that G is not connected. By Exercise 69, G' is connected. By Exercise 74, the number 
of arcs in G' is at least n — 1. Therefore the number т of arcs in G is (the number of arcs in a com- 
plete graph) — (the number of arcs in С”) = n(n — 1)/2 — (the number of arcs in G') = 
n(n = 1/2 — (n = 1) = (n — 1)(п/2 — 1) = (n — 1)(n — 2)/2 which is a contradiction. 

At least three colors are required because of the overlapping boundaries. Once the following assignment 
has been made, the country marked А must be a third color: 


Three colors are sufficient: 


The four-color conjecture is equivalent to the statement that the chromatic number for any simple, con- 
nected, planar graph is at most 4. 


The proof is by mathematical induction on the number of nodes in the graph. For the basis step of the 
induction process, it is clear that five colors are sufficient if the number of nodes is less than or equal to 
5. Now assume that any simple, connected, planar graph with = k nodes can be colored with five colors, 
and consider such a graph with k + 1 nodes. We can assume that k + 1 is at least 6 because 5 or fewer 
nodes are taken care of. By Exercise 82, at least one node л of the graph has degree less than or equal 
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to 5; temporarily removing п (and its adjoining arcs) from the graph will leave a collection of one or more 
simple, connected, planar subgraphs, each with по more than k nodes (Figures a and b). By the inductive 
hypothesis, each subgraph has a coloring with no more than five colors (use the same palette of five col- 
ors for each subgraph). Now look at the original graph again. If n has degree less than 5 or if the 5 nodes 
adjacent to n do not use five different colors, there is a fifth color left to use for n. Thus, we assume that n 
is adjacent to 5 nodes, n, п), n3, пд, and n5, arranged clockwise around n and colored, respectively, colors 
1, 2, 3, 4, and 5 (Figure c). 


One subgraph Two subgraphs 
remaining when remaining when 
n is removed nis removed 


(a) (b) (с) 


Now pick out all the nodes in the graph colored 1 or 3. Suppose there is no path, using just these nodes, 
between n, and n;. Then, as far as nodes colored 1 and 3 are concerned, there are two separate sections of 
graph, one section containing n, and one containing 755. In the section containing n, interchange colors 1 
and 3 on all the nodes. Doing this does not violate the (proper) coloring of the subgraphs, it colors n, with 
3, and it leaves color 1 for n. Now suppose there is a path between л, and n, using only nodes colored 1 
or 3. In this case we pick out all nodes in the original graph colored 2 or 4. Is there a path, using just these 
nodes, between n, and n4? No, there is not. Because of the arrangement of nodes 71, m, n3, n4, and ns, 
such a path would have to cross the path connecting n, and n4. Because the graph is planar, these two paths 
would have to meet at a node, which would then be colored 1 or 3 from the n;—n, path and 2 or 4 from the 
пупа path, an impossibility. Thus, there is no path using only nodes colored 2 or 4 between n, and n4, and 
we can rearrange colors as in the previous case. This completes the proof. 


85. four; three 


EXERCISES 6.2 


1. a. Yes, it is a tree. Put the root at the top. b. Not a tree because there is a cycle. 


r 


c. Yes, it is a tree. Put the root at the top d. Yes, it is a tree. "Shake down" the lower branches. 
and drop down all the branches. 


AN 


r 


13. 


17. 


3. a. Лх, b. 
Left child | Right child 
1 2 3 
2 0 4 
3 5 6 
4 7 0 
5 0 0 
6 0 0 
7 0 0 
Name | Left child | Right child 
1 All 0 2 
2 Gaul 3 4 
3 | divided 0 0 
4 is 5 6 
5 into 0 0 
6 three 7 0 
7 рап< 0 0 
Left child | Right sibling 
1 2 0 
2 5 3 
3 8 4 
4 9 0 
5 0 6 
6 0 7 
7 0 0 
8 0 0 
9 0 10 
10 0 11 
11 0 0 
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11. 1 
2 3 
4 
5 
6 7 
15. 1 
2 3 
4 5 6 
b. 


19. 
21. 
2З. 
25. 
27. 
29. 
31. 
33. 


37. 


39. 


41. 


43. 


45. 


47. 


49. 
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preorder: a b d e h f c g, inorder: d b h e fa g c, postorder: dhefbgca 
preorder: a be cfjgd hi, inorder: ebajfcghdi, postorder:ebjfgchida 
preorder: a b c e fd g h, inorder: e c fb g d h a, postorder:efcghdba 

prefix: +/3 4 — 2 y, postfix: 3 4 /2у – + 

infix: ((2 + 3) * (6 * x)) — 7, postfix: 23 + 6x «* 7 — 

prefix: + *4 — 7 х2, infix: (4 * (7 — x) + 2 


10 
а 35. фа 
c 
b b 
d e ё 
а 


If the root has no left child and no right child, return 0 as the height, else invoke the algorithm on the left 
child if it exists, invoke the algorithm on the right child if it exists, return the maximum of those two val- 
ues plus 1. 


Consider a simple graph that is a nonrooted tree. A tree is an acyclic and connected graph, so for any two 
nodes x and y, a path from x to y exists. If the path is not unique, then the two paths diverge at some node 
n, and converge at some node л», and there is a cycle from n, through n, and back to л, which is a contra- 
diction. Now consider a simple graph that has a unique path between any two nodes. The graph is clearly 
connected. Also, there are no cycles because the presence of a cycle produces a nonunique path between 
two nodes on the cycle. The graph is thus acyclic and connected and is a nonrooted tree. 

If С is a nonrooted tree, then С is connected. Suppose we remove an arc a between n, and n, and С re- 
mains connected. Then there is a path from n, to n,. Adding a to this path results in a cycle from n; to л, 
which contradicts the definition of a tree. On the other hand, suppose G is connected and removing any 
single arc makes G unconnected. If G is not a tree, then it contains a cycle. If a single arc is removed from 
the cycle, the graph is still connected because any path that made use of the removed arc can use the rest 
of the cycle instead. This is a contradiction, so С 15 a nonrooted tree. 


Proof is by induction on d. For d = 0, the only node is the root, and 2° = 1. Assume that there are at most 


27 nodes at depth d, and consider depth d + 1. There are at most two children for each node at depth d, so 
the maximum number of nodes at depth d + 1 is 2-24 = 2%*!, 


a.7 nodes b. 15 nodes с. 2+1 1 


а. Ina full binary tree, all internal nodes have two children, so the total number of “children nodes” is 2x; 
the only “non-child” node is the root, so there is a total of 2x + 1 nodes. 

b. From part a, there are 2x + 1 total nodes, x of which are internal, leaving 2x + 1 — x = x + 1 leaves. 

c. Consider a full binary tree with n nodes; let x be the number of internal nodes. From parta, n = 2x + 1. 
Therefore, х = (n — 1)/2. From part b, the number of leaves = x + 1 = (n — 1)/2 + 1 = (n + 1)/2. 

By Exercise 45, a full binary tree of height h — 1 has 2" — 1 nodes. When n = 2", this is the begin- 

ning of level h. The height Л remains the same until n = 2^*!, when it increases by 1. Therefore for 

2^ < n < 2^*!, the height of the tree remains the same and is given by h = |log n]. 
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51. 2 

53. a. There is only one binary tree with one node, so B(1) = 1. For a binary tree with n nodes, n > 1, 
the “shape” of the tree is determined by the “shape” of the left and right subtrees; the two subtrees 
have a total of n — 1 nodes. Let the left subtree have k nodes; the right subtree then has n — 1 — k 
nodes; k can range from 0 to n — 1. For each value of k, there are B(k) ways to form the left sub- 
tree, then B(n — 1 — k) ways to form the right subtree, so by the multiplication principle, there are 
B(k)B(n — 1 — k) different trees. 

n-l n 


b. B(0) = 1, (1) = 1, B(n) = Y В(ЮВ(п – 1 – k), = Y B(k — DB(n — k) 
к=0 k=] 


which is the same as the Catalan sequence, so by Exercise 97 of Section 4.4, 


1 
В(п) = ne C(2n, n) 


c. B(3) = 5. The 5 distinct binary trees are 


САРА 


а. В(6) = 132 
"AA, 
57. a 
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EXERCISES 6.3 


3. x: L[4] 


x«L[1] L[1] L[2] L[3] LA L[S] L[6] x> 7] 
eC Oy SxS SSxXexS 
L(2] L[3] ША] L[5] М6] LIT] 


5. x: L[6] 7. a. 9 b.average & 2.83 
5 

x: L[1] 

х L[6] 2 4 
x: L[5] 
x«L[] < : Ы 

x: Ц2] 

LIS] < x < 116] depth = 3 = 1 + |log 7| 


x: ЦА] 
Ц] « x « L[2] 


x: L[3] 
L[4] = x < 1/5] 


Ц2]1<х< L[3] L[3]}<x<Lf[4] 


depth = 6; algorithm is not optimal 
because 6 > 1 + log|6| = 3 


918 Answers to Odd-Numbered Exercises 


9. a. 3 b. For example: g, d, а, k, i, s 11. To 


not that 


is question е 
be is not or question that the To 
13. a. 5 b. 16 c. 45 
15. a. 5 b.2 с. lLor2L 124.5  4Lor5L 
lle d 


17. a. 4 b.2 €. 1L 2G 12 2L, 1G 
or 1G, 2H_ Nl 2G, 1H 


1:3 
12 X 3 43 2 1 X 


This problem (because we do not have to decide whether the counterfeit coin is heavy or light) can also 
be done with a binary tree of depth 2: 


21. 2*(1 + |logn)). 


X 4H 3L X 3H 4L 
23. a. log n! = log[(n)(n — 1)(n — 2) ··· (2)(1)] = log n + log(n — 1) + log(n — 2) +--+ + log2 
log 1 = logn + logn + logn +---+ logn forn = 1 = п1орп 
b. log л! = log[(n)(n — 1)(n — 2) ::- (2)(1)] = logn + log(n — 1) + log(n — 2) +--+ + log 2 
log 1 = logn + log(n — 1) + *:: + log[n/2] = log[n/2]  log[n/2] + +++  log[n/2] = [n/2] 


logi n/2] = (=)ioa( 2) = ӨТ — log2) = ӨТ —1)= eT = (3) = 
ӨТ | ӨТ (2) = Ө | MUT = 2) 2 ӨТ because 


log n = 2 forn = 4 
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EXERCISES 6.4 


1. No, because the code for т, 01, is a prefix of the code for d, 011. 
3. a. ooue b. iaou с. eee 

5. a. (pw)a b. paw c. ((a)) 

7. a-0101, 5-011, c-10, d-11 


b. c-11, d-01, g-1001, m-00, r-1000, z-101 


or b. a-01 a-01 
z-100 z-100 
t-101 or 1-00 
е-11 е-11 
с-00 с-101 
12-2 154 
13. а. 1754 b. 82-01 
664-0001 
955 327-001 

349-11 

416 446 509 423-10 

(82) (423) (349) 389-0000 


212 

(327) 
74 97 

(389) (664) 


15. a. 85,000 bytes b. 34000 bytes 
17. One of several possibilities: s-000, 4-001, a-01, 1-100, c-101, e-11 
19. a. One possible Huffman code is 


B-110100 0-01 5-1000 
C-11101 1-101 6-010 
G-1101100 2-1001 7-1100 
R-110101 3-011 8-110111 
S-1101101 4-1111 9-11100 


b. The new file takes about 44 percent of the space of the original file. 
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CHAPTER 7 


EXERCISES 7.1 


1 0 0 
1А= |10 1 
ото 
р = {0, 1), (2, 1), @ 3), (3, 2)} 
3. 1 2 3 
4 5 
p = {(1, 4), (2, 5), (4, 2), (5, 3), (5, 4)} 
00011 1 
000000 
РЕ i a- 000000 " . 
000000 0 2 6 
0000 0 0 2 
01 101 0 
4 5 6 


9. For every pair of nodes a and b, if there is an arc from a to b, then there is also an arc from b to a. 


11. The graph can be written as a "star" with node 1 at the center; i.e., 1 is adjacent to every node and every 
node is adjacent to 1, but no other nodes are adjacent. For example, with n — 5: 


17 | | 
2 рос 3 2 poo 3 


21. R will have all 1 entries. 


13. No node has a loop. 


1 0 ! 1 0 0 
15.pUoc:|1 1 1 pMa:|1 0 0 
0 0 1 0 0 1 


=i 
(o 
х, 
ll 

[« = нм 

O N н 

Re W N 
® 
Il 
— 
= 
— 
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m S a 0 1 1 1]! 1 
PON 10 0 D 1 1 i 4 
BERI. ул 25.R-|1 1 1 27R=|0 0 0 0 0 
DA 1 1 ] 0111 1 
D f 1.1 1 
04 4 t i 1 
TIE от 
?.R-|l0 0 9 0 0 0 31 В= |1 1 | 
i Ё 1 
00000 0 
0 1 1 0 1 0 
3 x x3 0 1 1 1 1 1 
TERE 00000 0 
3.R-|0 0 0 0 0 dre oe 
E ae "UT Jo 0 0 0 0 d) 
"D 000 0 о 0 
0 1 101 0 


37. Transitive closure = {(1, 2), (1, 3), (2, 2), (2, 3), (3, 2), (3, 3)} 
39. a. Add (2, 1) and (3, 2) to p to get the transitive closure. 
b. p is its own transitive closure. 
c. Add (1, 1), (2, 1), (2, 2), (3, 3) to p to get the transitive closure. 
d. p is its own transitive closure. 


41. 


n 


43. A?[ij] = У apay. If a term such as ара» in this sum is 0, then either aj. = 0 or a5; = 0 (or both) and 
k=1 


there is either по path of length 1 from n, to n, or no path of length 1 from n, to л; (or both). Thus there 
are no paths of length 2 from n; to л; passing through n. If aja), ~ 0, then ap = p and ay; = q, where p 
and q are positive integers. Then there are p paths of length 1 from n; to n, and q paths of length 1 from 
n, to n;. By the multiplication principle, there are pq possible paths of length 2 from л; to n; through n3. 
By the addition principle, the sum of all such terms gives all possible paths of length 2 from n; to n;. 


45.3; A? — 


ооо c 
ооо c 
оо о WwW 
ооо о 
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EXERCISES 7.2 


1. No, four nodes of degree 3 

3. No odd nodes, so yes; such a path can start at any node and will end there. For example, 1-2-6-3-1-4-6-5-1 

5. No, four nodes of odd degree 

7. Two odd nodes, 1 and 3, so yes; such a path must begin at one odd node and end at the other. For example, 
1-4-5-1-2-5-6-2-3-6-7-3 

9. No, six nodes of odd degree 

11. No, four nodes of odd degree 


0 1 0 1 1 0 0 
AMNES D 49 1 19 
mas pu oo or di 
ia 99 9. 9 ED aur oW e de d our а 
b db ou 4 110101 0 
Lea) m d 0 1 1.0 1 0 1 
EX RII 0010010 
total after row 2 is 0 i=8 
17. Aconnected directed graph will have an Euler path if and only if (a) All nodes have out-degree equal to in- 


degree or (b) One node, n,, has out-degree one greater than in-degree and another node, л 
one greater than out-degree. 


has in-degree 

19. Every node has equal in-degree and out-degree, so a path exists from an arbitrary node back to that node. 
For example, 2-4-2-3-1-2. 

21. No 

23. yes; for example, 1-4-2-6-3-5-1 

25. yes; for example, 1-2-3-7-6-5-4-1 

27. yes; for example, 6-5-8-7-3-4- 1-2-6 

29. Any two nodes must be part of the Hamiltonian circuit; therefore there is a path between them, namely, 
that part of the circuit that is between them. 


31. a. (n — 1)" bh.(u-—1(n-22"-! с. (n — I)! d. 14! seconds or about 24.22 hours. 
33.a. n = 20rn = any odd number. b. п> 2 
35. Such a graph is a chain, so just pick a node and then walk around the chain back to the starting node. 


37. a. Consider each node of G in turn. At each node add as many new arcs as possible without creating a 
circuit. This process terminates because the complete graph with n nodes would contain a circuit. 
b. His not a complete graph or it would contain a Hamiltonian circuit. Therefore there must be two nodes 
р and д that are not adjacent in Н, but adding arc p-q would complete a Hamiltonian circuit. Therefore 
there is a Hamiltonian path beginning at node p and ending at node q. 
. If both these arcs exist in H, then H has a Hamiltonian circuit, as follows: x; p, X2, X3, ..., X;—15 Qs 
Xy 1) Xg-2» X 
d. Nodes p and q are not adjacent (otherwise there would be a Hamiltonian circuit), so the only nodes that 
can be adjacent to p or q (with at most one arc) are the n — 2 nodes x, 2 = i = n — 1. From part (с), 
for each such node х;, if p is adjacent to x; then q is not adjacent to х;— |, so the total of the two degrees 
cannot exceed n — 2. 


е 
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е. H was constructed from G by adding additional arcs, so for any node its degree in G is = its degree in 
H. Using this fact together with the result of part (d), degree(p) + degree(q) < n in graph С. 

f. Nodes р and q are not adjacent, yet degree(p) + degree(q) « n. This contradicts condition (2), so the 
assumption that G does not have a Hamiltonian circuit is wrong. 


EXERCISES 7.3 


4 IN = {2} p = 8,1N = {2,7,3,8} 
1 6| 7 |8 1 з|4|5|6 |7 
а|з|Оо|2|®|®|® 1] а|з|о|2|з|з|в6в |1 |2 
$ 21-12 2 2 2 2 2 $ 2 = [д 3 8 7 2 
р =7,1М= {2,7} p = 5, IN = {2, 7, 3, 8, 5} 
1 4 5 7 8 1 3 4 5 6 7 
d 3 0 2 оо оо 6 1 2 а 3 0 2 3 3 6 1 2 
si2|-:.2121217:21|7 5121-1213 181712 


path: 2, 7, 8, 5 distance = 3 
p = 3,1N = {2,7,3} 


1 2 3 4 5 6 7 8 
d 3 0 2 3 oo 6 
S 2 — 2 3 2 7 2 
3. IN = {1} p = 3,IN = {1, 6,2,7,3} 
1 5 6 7 1 3 4 6 
d 0 3 5 оо 8 оо 0 d 0 5 6 1 5 
$ — 1 1 1 1 1 1 S = 1 1 3 1 2 
р = 6,IN = {1,6} р = 8, IN = {1, 6, 2, 7, 3, 8} 
1 4 5 6 7 8 1 3 4 5 6 7 
d 0 3 5 7 1 6 d 0 3 5 6 6 1 5 
$ — 1 1 1 6 1 6 1 Е 1 1 3 8 1 2 
р = 2, IN = {1, 6, 2} р = 5, IN = {1, 6, 2, 7, 3, 8, 5} 
1 4 5 6 7 8 1 2 3 4 5 6 7 
d 0 3 5 7 1 d 0 3 5 6 6 1 5 
$ — 1 1 1 6 1 2 1 S = 1 1 3 8 1 2 
p =7,IN = {1,6,2,7} path: 1, 2, 7,8,5 distance = 6 
1 2 4 5 6 8 
d 0 3 5 7 1 
$ Е 1 1 1 6 1 2 
5. ІМ = {а} р = b, IN = (a, b) 
а b с а е f [a |b |c |d je |f 
а|0|1|3|®|ж}| о а|о|1|2|®||2 
з == а а аса s LJ r4 la lal Ь 
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p = c, IN = {a,b,c} р = e, IN = {a, b, c, f, e} 
a b с а е а b с а е f 
d 0 1 2 4 6 d 0 1 2 4 3 2 
$ = а b C C S | = | а | b | С | f | b 
ath: a, b, f, e distance = 3 
p=fIN= {a,b,c,f} р 5 
а b с а е 
а 0 1 2 4 3 
$ = а b С f 
IN = {1} p = 5,IN = {1,2, 6,3,5} 
1 2 3 4 5 1 3 4 6 
d 0 2 3 d 0 2 3 4 3 2 
$ — 1 1 1 S — 1 2 3 1 
p=2,IN= {1, 2} p=4,IN= {1, 2,6,3, 5,4} 
1 2 3 5 1 2 3 6 
d 0 2 3 3 d 0 2 3 3 2 
$ — 1 2 1 1 S — 1 2 3 1 
р = 6,IN = {1,2,6} р = 7,1№ = {1,2,6,3,5,4,7} 
1 3 5 1 2 3 4 5 6 
d 0 2 3 3 d 0 2 3 4 3 2 
$ — 1 2 1 1 $ — 1 2 3 1 


p=3,IN= Сув) 


1 213 |4 [5 
d 02 з | 4 3 
S == 11 2 3 1 


path: 1,6,7 distance = 5 


9. a. Change the condition on the while loop to continue until all nodes are in IN. Also, rather than writing 
out a particular shortest path, make d and s output parameters that carry the information about shortest 
paths and their distances. 


11. 


b. No 
2 3 4 5 
d 3 0 2 о 
$ 2 — 2 2 2 
(1) 
2 3 4 5 
d 3 0 2 3 11 
S 2 — 2 3 1 


1 3 6 |7]8 

d | 3 Oo | 2 3 3 | 4 1 2 

S 2; js 2 3 8 1 2 7 
(3) 


No further changes in d ог s. Agrees with 
Exercise | for 2 to 5 
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13. 1 2 3 4 5 6 1 2 3 4 5 6 7 
d 0 2 3 2 оо а 0 2 3 4 3 2 5 
s |- ties ЕЕ = аг |за ч 16 
(1) (3) 
No further changes in d ог 5. 
L З.Ф. | eee Agrees with Exercise 7 for | to 7 
d 0 2 3 3 2 5 
s|- 2|1|1|\1 | еэ 
(2) 
15. Initial A and after k = x: after k = 1 and k = 2: after К = 3 andk = y: 
X 1 2 3 y X 1 2 3 y X 1 2 3 y 
x |0 1 o 4 oo x |0 1 4 2 6 x |0 1 4 2 5 
1 |1 0 3 1 5 1 |1 0 3 1 5 1 |1 0 3 1 4 
2 |% 3 0 2 2 2 |4 3 0 2 2 2 |4 3 0 2 2 
3 |4 1 2 0 3 3 |2 1 2 0 3 3 |2 1 2 0 3 
y|e 5 2 3 0 у|6 5 2 з 0 yl5 4 2 3 0 


17. IN = {1, 6, 2, 7, 8, 5, 3, 4} 


19. IN = {1, 4, 7, 3, 6, 5, 2, 8} 
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25. 2 3 3 


5 1 4 
The shortest path from 1 to 5 is 1—5 with dis- 
tance 5. If the algorithm added the node closest 
to IN at each step, it would choose the path 1-2- 
3-4-5 with distance 10. 


29. Dijkstra’s algorithm is Ө(и?) in the worst case, which is when all nodes are brought into IN. This is the 
situation to find the distance from the start node to any other node. Repeating this process with all n 
nodes, in turn, as the start node would result in an algorithm of order пӨ(и?) = @(n°). Floyd’s algorithm 
is clearly @(n*) because of the nested for loops. Therefore the algorithms are the same order of magnitude. 
Although Floyd's algorithm has the advantage of simplicity to code, that is more than balanced out by the 
fact that Floyd's algorithm does not give the actual shortest paths. 


The solution is to find a minimal 
spanning tree for the graph, as 
shown here. 


EXERCISES 7.4 


1. abcefdhgji 
3.dabcefhgji 
5.ebacfdhgji 
7. abcfjgdehki 
9. fcabdehkigj 
lLabcdegfhji 
13.dafbcehgij 
15.ebcfgadhji 
17 арсаеѓећһіј к 
19.fcjabgdehik 
21abcegdfh 


23. fb 

25.abcdeghf 

27 fb 

29. B 31. Begin a dfs atnodea:acfgebd 

b(2) d(1) 
A C 
а(7) 
G D 
c(6) e(3) 
F 
AS) 804) 


33. Because of the recursion, a stack would Бе a good data structure. 


35. Suppose the depth-first search has visited node x and has moved on to visit node y. The algorithm looks for 
nodes adjacent to y that are unvisited. If a visited node (other than y's “parent” x) is on y’s adjacency list, 
then the graph contains a cycle. For example, in Figure 7.13, when the recursive algorithm is invoked on 
node g, node d is a previously visited node that is adjacent to g but not the parent node of g (which is f"). 


This situation detects the cycle d-g-/-4. 
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EXERCISES 7.5 


1. b 


e 


back arcs: a—c, a—e, b-d, с-е 
7 a(1,1) b(Q,3) 


c(2, 2) 


4 
44, 4) e(5, 8) 


4 4 
fa, 7) &(6, ff) 


articulation points: c, d 


9. 41,1) b(4, 4) 


2 
cQ, 2) d(3,B) 


2 
e(5, B) 
articulation points: c, d 


articulation points: none 


e f g h 
back arcs: a-d, a-f 


e h 


back arcs: е-й 


biconnected components 


S d 


biconnected components 


a b 
| а 
ктр 
е 


biconnected components 


b 
a С 
һ а 
g е 
7 
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CHAPTER 8 


EXERCISES 8.1 


1 +/0 1 a a 0 1 a a 
0 0 1 a a’ 0 0 0 0 0 
1 1 1 1 1 1 0 1 a a 
a a 1 a 1 а 0 а а 0 
a'|a' 1 1 a' a'|0 а 0 а 
3. a. max(x, y) = max(y, x), min(x, y) = min(y, x), max(max(x, y), Z) = max(x, max(y, z)) = max(x, y, Z), 
min(min(x, y), z) = min(x, min(y, Z)) = min(x, у, 2), max(x, min(y, z)) = min(max(x, y), max(x, z)), 
min(x, max(y, z)) = max(min(x, y), min(x, z)). The last two can be shown by taking cases: x < y < z, 
y «€ x < z, etc. 
b. Let m be the 0 element. Then we must have max(x, m) = x for all x € Z. But for x = m — 1, 
max(m — l,m) — m. 
5. a. 16 
b. (fi + f2)(0, 0) = 1 Cfi 0,0) = 1 £1 (0,0) = 0 


(fi + 90,1) = 1 Cfi f2)(0, 1) = 0 40,1) = 1 
(Л + 1,0) = 1 (fi JA, 0) = 0 f'A, 0) = 0 
(Л + AC, 1) = 0 (fi fol, 1) = 0 Лорел 


. + апа аге binary operations on В, ' is a unary operation on В. Max and min аге commutative and 


associative operations; the distributive laws follow by considering different cases for the values of 

ЛО, у), f(x, y), and f(x, у) for a fixed (x, y) є S?. For any fin В and (x, y) in $, (f + 0)(x, y) = max 
(f(x, у), 00, y)) = max( f(x, у), 0) = f(x, у), and (f 1)(х, у) = min( f(x, у), 16, y)) = 

min( f(x, y), 1) = f(x, y). Also (f + f"), y) = max( f(x, y), f œ y) = Land (ff), y) = 
min( f (x, y), f '(x, y)) = 0 because one value of the pair ( f (x, у), f ' (x, y)) is 1 and the other is 0. 


7. Show that x acts like the complement of x', that is, that it satisfies properties 5a and 5b with respect to x'. 


х'+х=х+х' (1a) 
= 1 (5а) 

апа 

xx = хх! (1b) 
ev (5b) 

Therefore x = (x’) by the theorem on the uniqueness of complements. 

9. a. x + (x: y) 
=x-ltx-y (4b) 
= x(1 + y) (3b) 
= x(y + 1) (la) 
= х1 (universal bound) 
=x (4b) 
x: (x + y) = x by duality 

b. x: [y + œ 2)] 
=x-ytx-(x-z) (3b) 
—xcycí(x:x)z (2b) 
=x yptxr-z (dual of idempotent) 


x + [ус (+ 2] = (x + y): (x + 2) by duality 


11. 


е 


= 


e 


(xtyrG ty 
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= (у + х)-(у+х') 
=y+(x:x') 
=y+0 
= J 
(х= у) + (x' у) = y by duality 
(х + (у-2))' 
=x yz) 
=x (у +2’) 
= ху +x’ z 
(x: (y  z)! = (х + y): (x' + 2’) by duality 
„ (x +y)-@t 1) 
(х+у):х + (х + у) 1 
= х:(х+у) + (х + у) 1 
= (х.х) + (ху) + (х + у) 1 
= х + (ху) + (х + у) 1 
=x + y) + (к + у) 
= (х=у)+х+(х+у) 
= (ху) + (Рә) 
= (ху +х+у 
= х + (ху) + у 


(xy) + (x:0) = х: (x + y): y by duality 


xt (хур + х ту)! 
== х! 


х ух) 
= х + (у(х + х))' 
-a (Ga) 
= х + (у: 1)! 
=х+у , 
(6) 2 C072 
-G0:2) 672) 
= (09) +yz 
= (0-2) 2) + 02:1 
= (y:z): (x + 1) 
= (y:z):1 
= yz 
ху tyas 
= y xF yir 
-» Ex) 
=p 
=y-@+1) 
=y:xty-l 
= ух + у 
=хсу+у 
(х + у) zt ny 
—— зуи qeu very 
Еу та ау 
—4 ur +y) 
— tyre) 
= х2] 
—Xx'"z 


(la) 
(За) 
(5b) 
(4a) 


(De Morgan’s laws) 
(De Morgan’s laws) 
(3b) 


(3b) 

(1b) 

(3b) 

(dual of idempotent) 
(4b) 

(la) 

(2a) 

(idempotent) 

(la) 


(1b) 
Gb) 
(la) 
(5a) 
(4b) 


(2b) 
(1b) 
(4b) 
(3b) 
(universal bound) 
(4b) 


(1b) 
(3b) 
(5a) 
(universal bound) 
(3b) 
(4b) 
(1b) 


(De Morgan's laws) 
(1b) 
Gb) 
(la) 
(5a) 
(4b) 
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е. (х-У)-+—(у+')+ (2) = (ху): 1 + (уз )+1-+ (х 2) 1 (4b) 
= (ху) Etz) +z) æ + х') + (х'-2)-(у + у”) (5а) 
SHAY oS PR таре ee узы ч" атату чи ау (3b) 
= хуа д zy tysz5xtx yz +x zy tyes! (1a) 
= уак ty zx tx z yta y cx oper dex oper! (1b) 
= z æ Px yt (х-2')-(у+ у) FO oy) e Fz) (3b) 
= (у 2) :1 + (х2) 1+ y) (5а) 
= (0) + rz!) + (У-у) (4b) 
= (x'*y) + Oz) + œz’) (1a) 


13. This is an “if and only if" problem, so there are two things to prove. 
a. Let x — 0. Then 


Py Pe tyes 0 tx (x = 0) 
EM RON (1b) 
=O+x'-y (dual of universal 
bound) 
=x'-y+0 (1a) 
= х’. у (4а) 
= 1:у (Practice 4) 
ey] (1b) 
= у (4b) 
b. Let x-y’ + x': y = y. Then 
xx x xx (letting y in the 
hypothesis have the 
value x) 
xox’ +x-x' =x (1b) 
0+0=x (Sb) 
0=x (4a) 
15. а. x © у 
= ху yx (definition of @) 
=yx' +x-y’ (1a) 
=y@x (definition of ®) 
b.x x 
= yx + xx (definition of ©) 
=0+0 (Sb) 
= (4a) 
с. 00x 
= Pig a Sil (definition of ®) 
Ен (1b) 
=0+x-0' (dual of universal 
bound) 
= fae ee] (Practice 4) 
=O+x (4b) 
=x+0 (la) 
= (4а) 
d.1@x 
= dy Жа? (definition of @) 
ee (1b) 
=x +x-l’ (4b) 
=x'+x-0 (Practice 4) 
=x +0 (dual of universal 
bound) 


=y (4a) 


| Answers to Odd-Numbered Exercises 931 


17. Suppose x + 0, = x for all x В. Then 0 + 0, = 0 and 0, + 0 = 0, so 0, = 0+0=0 + 0, = 0 
and 0, = 0. Then 1 = 0’, so 1 is unique by the theorem on uniqueness of complements. 


19. a. | b. c. 16 


LAY 


21. a. (i) bijection (п) for x, y E S, f(x: y) = f(x) +f) 
b. Let f(0) = 5,/(1) = 7. Then /(0-0)=/(1)=7=5+5 = (0) + f(0,/(0:1) = (0) = 5 = 
StF =f) tf) Fed) =f) езет у), елет ет ET 
JU) FTD 
23. a. fi R > К". fis onto: given y є R*,letx = log y; then x e Rand f(x) = 2* = 2'°8” = y. fis one-to- 
one: if f(x) = f'(w) then 2* = 2" and (taking the log of both sides) x = w. 
b. for x,y € К, а(х + y) = gx): 20) 
c. fis a bijection from R to R* and for x, y є R, f(x + y) = 2**” = 2-2? = f) fy) 
d. / '()-logy 
e. f lisa bijection from R* to R and for any 
xyeR',f (ху) = logs: y) = logx + logy = f£ (x) +/ Цу) 
25. fi > {1,3}, h > {1,2} h} > (,2,3,5 > (1.2.4.4 9 {1, 3, 4}, 5, > {2,3,4}, f (2,3), 
fo > {2, 4}, fio m {1, 4}, fi > {3, 4} 
27. a. For any y € b, y = f(x) for some х є B. Then y & f (0) = f(x) & / (0) = f(x + 0) = f(x) = y, and 
/ (0) = Фф because the zero element in any Boolean algebra is unique (see Exercise 17). 
b. f) = 707) = [F0)]" = p" = + 
29. a. i.lfx < у, then x < уапіх < x,soxisa lower bound ofx and y. If w* < x and w* < у, then w* < x,so 
xis a greatest lower boundandx = x-y.Ifx = х · y, then x isa greatest lower bound ofx and y, so x < у. 
ii. Similar to 1. 
b. i. Let x + y = z. Thenz is a least upper bound of x and y, which is a least upper bound of y and x, so 
z = y + x. ii. Similar to i. iii. Let (x + y) + z = p and x + (y + z) = q. Then y Xx + y < p and 
z < psopisan upper bound for y and z; because y + z is the least upper bound for y and z, y + z < p. 
Also x < x + y < p. Therefore p is an upper bound for x and y + z, and q < p because д is the least 
upper bound for x and y + z. Similarly p < q,so р = q. iv. Similiar to ш. 
с.х + 0 = х e 0 < x, which is true because 0 is a least element. x: 1 = x «€» x < 1, which is true 
because 1 is a greatest element. 
d. (a) no—no least element, (b) yes, (c) yes, (d) no—not distributive: 2 + (3:4) = 2 + 1 = 2 and 
(2 + 3):(2 + 4) = 5:5 = 5. Also, both 3 and 4 are complements of 2, so complements are not 
unique. 


EXERCISES 8.2 
1. 


* 
X 
№ 
X 
© 


Xi + х)х xi 
(x; 2) Хз " | xi tx | 
X sep id 2 ES 2 + ху)хз 


»—————` 


о = O = 0-02 
O = O = OOO = 


11. 
13. 
15. 
17. 


19. 
21. 
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х{ | X2 | хо + Oy Xp) x —] >] Xp) 
1 1 0 X Oe (и) + (a, x)’ 
1 0 1 x Z i » e x [ 
0 1 1 i EL 
0 0 1 
хро + x) 7. (хро) (0 + x3) 
Xv | % | F(x,, х2) Хү х2 X3 f(X1; X2, Хз) 
1 1 1 1 1 1 0 
1 0 1 1 1 0 0 
0 1 0 1 0 1 0 
0 0 1 1 0 0 1 
0 1 1 1 
0 1 0 1 
0 0 1 0 
0 0 0 1 
а b. 
X — ху! 
Ee xy’ + ух! 


0 0 0 


c. The truth function for the network is the same as part (a). The network illustrates “х ОК у” and “МОТ 
both x AND y” 


Xy 


хүхохз + XXX; 


XXX "ОХХХ + XXX} 


XyXa3X3X4 + xixa a + yA, + axax aa + X Whang, + XIX2X3XÀ 


XQX3X3X4 + XQX3XqX4 + рох EXIXA AXA + xxx, + ХХА 
а. хүхэху + ХүХ5ХА 
b. x, —————— 


ye 
Аз 


А Ў , 
Xq X3 X3 + X4 X5X3 


x OY 
X2 


к Рес 


€. XjX3X5 + XxxX = XXX + xqx$x5 = xqx$(x; + x5) = ху l = хх; 


x x1 x3 
%3 
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23.a. х; X2 X3 f(x4, Xa Хз) b. хухәхз + хурхо; + AXXa F XXX 
1 1 1 1 
1 1 (0) (0) 
1 0 1 1 
1 0 0 0 
0 1 1 1 
0 1 0 1 
0 0 1 0 
0 0 0 0 
с. хх; + Хх) = (хуз + Х\)(Хүх; + ху) = (х + хуз) + xs) = 
(х xi + хз)(х› + ху)(х› + x3) = 69 + х) + x) + x)6o + x) = 
(х x3)(x, + x;)G6 + хз) = Qa + х))(х\| + x3)660 + x3) 
25. a. (xi + x)(x + х)( + х5) 
b. (xi xj, + ху) 
c. (x X + xg) + ху + xy) + x, + ox)6 + XQ + х;)(ху + x) + xy + XQ + x) 
d. (x х + xs)Gxi + ху + x) t x» + әз) + x) + x3) 
е. (x х + x5) + x5 + x) + ху + x36 + ху + x96 + x) + ху) 
27. а 1100 b. 1001 e 001 
0100 0111 11 
(1)0000 (1)0000 (1)000 
29. a. ү 
X3 — *3 Ху ХХ] + xxi 
b. хх + x + x3 = хрз + xx + x3 = x3 + x») + Хз = Qo) + x3 = (((хух))'х)')хз)” 
Хр ——————————ә i ((x3 X) xp : (((x3 х)! x)! хз)! 
à seul” 
Хз 
or, alternatively, x4x, + x4x, + x3 = хх + x3 + xix = x, + xS: 1 + xy, = xy, + 1) + 
X3 = XZ°1 + х = xj + хх = Ga) 
x <D (any)! D Gs dg) 
ар" | 
23 
31. а. 
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b. xj = he (x + Xp)! = x1 x5 ne (105) = x, + 
25 


м 
ot 
(x1 + 2x5)! = хро 
"EU 


33. X | X; | f(x, X2) XjX5 x X X) 
1 1 0 свален 
1 0 1 
0 1 0 
0 0 0 


35. x, = neutral, x, = park,x; = seatbelt (x, + хо); x — 
X X; Xs f(x, X», Хз) » Qa a X)xX3 
Хз 121. ) | 


O2 0202 0ad42 
O O O = O н 


37. a. Let the two input lines be ху and x,, and the selector line be s. The truth function is 


Xo X s Ңхо, X4, S) 
1 1 1 1 
1 1 0 1 
1 0 1 0 
1 0 0 1 
0 1 1 1 
0 1 0 0 
0 0 1 0 
0 0 0 0 


TO 


X|———— 


39. x, = pressure(1 when pressure > 50 psi, otherwise 0) 
x, = salinity (1 when salinity > 45 g/L, otherwise 0) 
ху = temperature (1 when temperature > 53°C, otherwise 0) 
x4 = acidity (1 when acidity < 7.0 pH, otherwise 0) 


Ww 
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The output for each valve should be 1 when the valve is to open, 0 otherwise. The canonical sum-of- 
products forms are 


A= XqX2X3X4 71 
В = xy5xx, 4 


F X4X3X3X4 Я 
Ш X X3X3X4 q 


F хүхохуха + AXA 
F XIXX3X4 


Using these forms, the circuit for А would require 2 inverters (one for x, and one for x4, assuming we 
split the output from an inverter into more than one gate), 4 AND gates, and 1 OR gate; B would require 
2 inverters, 3 AND gates, and 1 OR gate. It is possible to write simpler equivalent expressions. 


EXERCISES 8.3 


15. 


17. 


19. 


21. 
23. 
25. 


27. 


29. 


Х\Хз + xix} + xix or xix + уху + 023 


-X3 X 


XQX3x4  xqxax 


, А , 
Xyx4 F XXX T 
. XX2 + XX3 


.XQX4 F Xia X3 


4 0х4 + 


ахча 


XoX4X4 + Хоз + X|X4 


хх) 


F x5x3x4 OF xqx5x4 + xixax4 + ЭХХ» 


XjX3 c х5 + Хр OF XyXz + Хх + xix, 


x, 
Xa 


xj 
x 
x 
X2 


or 


хох + xqjx4x, Here the don’t care at хүх»хух has been treated as a 1, as has the don’t care at x}xix4x4; 
the don't care at x,x»x4x4 has been ignored. 


X3 + х 


XQX4X4 + xQX4Xx3 + XiX% ОХХ 


H KAN, + xQxqxa d ууа + Хуа 


or 


XQXAxax4 t ухаа + ХХХ + xix 


хү + xjx4 + 
or 
хү + ХА + 
ог 


Xj? + xjx4 + 


хүхз c xjxax3 + ХХ 


XQX3X4 + хрх 


хоху T ХХХ 


PG 


Pau 
Х2Х3Х4 T XX3 
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CHAPTER 9 


EXERCISES 9.1 


1. 


11. 


13. 
15. 


17. 
19. 


a. not commutative, not associative 
b. The completed table is 
рда r s 
plpluw r s 
ара r S p commutative 
rr S p q 
sl s p q r 
a. associative, not commutative b. commutative, not associative c. neither 
d. both e. commutative, not associative 
a. semigroup _ Ь. попе 2 с. попе 
а. monoid; i = 1 + 0V2 e. group; i = 1 + 0v2 f. group; i = 1 
a. group; 7 = zero polynomial а. group; і = 1 
b. none 0 e. group; i = 0 
c. group; i = | 0 i f. monoid; 7 = function mapping every x to 0 
Ë Ri R R Fy Р Fs 
Ri | R В; R Ез F Р 
R: | R Ry R Fo Fg Fi 
Rs | R Ro R Fy F Fe 
Fy | Fo Fg Fy, R, R, В; 
Fa | Ез Fy Fa Po Аз Ry 
Ез | Fy Fp Ез Ry Re Re 


Identity element is R}; inverse for F, is Е; inverse for R, is R}. 


a. No—not the same operation 


b. No—zero polynomial (identity) does not belong to P; also, closure does not hold. 


c. No—not every element of Z* has an inverse in Z* 


d. 


Yes 


Hoy, +12], [Ziz +l, [{0, 2, 4, 6, 8, 10}, +12], [{0, 4, 8}, Fjal [{0, 3; 6, 9}, +12], [{0, 6}, +0] 
= і, a = (1,2) ° (3, 4), a; = (1, 3) ° (2, 4), оц = (1, 4) ° (2, 3), о; = (1, 3) ° (1, 2), 
= (1, 2) ° (1, 3), o = (1, 3) Š (1, 4), Os = (1, 4) ° (1, 2), Qo = (1, 4) 9 (1, 3), | = (1, 2) 5 (1, 4), 


а] 


as 


ay, = (2, 4) ° (2,3), а = (2,3) ° (2,4) 


a. 


12 
c. Yes; f: 52 > 12Z, f(x) = z7 


No 


b. No 
a. Yes; f: Z — 12Z, f(x) = 12x 
b. No; Z; is finite, 5Z is infinite 


c. Yes, but not an isomorphism 


d. No; both sets have 6 elements, but [,53, ° | is noncommutative, [Z,, +6] is commutative. 


21. 


23. 
25. 


27. 


29. 


3 


— 


33. 


35. 
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1 z l w l wtz 0 : NES ; ah 
a. Closure: о illo 1l" |o | e №М(2). Matrix multiplication is associative. 


1 


"Hl вао о MI 
0 11 |0 | p whic elongs to M5(Z). 


1 0 
Г | є М®(2). The inverse of | 


b. fis a bijection and 


s TE nien Det) 
A l)e) t | 


_1 _ 1 2 4 1 3 1 2 . ] 3 Е I 5 1 5 = 
ш @ =|) амл OF | 1 [ | lo ва i= 


a.iisisoi=T' hor rk "=, = Gr 


а. хрх because i-x-i = х: і! : 


=x-i=x. If xpy then for some g€G, g:.x:g =у or 

gx-y:gox-g !-y-g=(g ):y:(g ) !soypx.Ifx p yand y pz then for some g;,g; € G, 
£x'g; | =yandgy-y-g | —zsogogwx:gi ^g; = zor (g: gi): x (g g) | = zandx pz. 
b. Suppose Gis commutative and y є [х]. Thenforsomeg € С, у = g:.x:g |=x-g:g =x i= x. 
Thus [x] = {x}. Conversely, suppose [х] = (x) for each x € G. Let x, y є С, and denote the element 


y:x:y !byz.Thenxpz,soz = xandy:x:y | = xor y:x — ху. 


a. ij = ij ig = ip SO i, = ір and this element is an identity in [S, · ]. 


b. For example, .| 4 b 
aja b 
bla b 
с. For example, .| a b 
а|а а 
bib b 


d. For example, [R*, +) 

Let x € 5 with left inverse y. Then y € S, so let z be the left inverse of y. Then x-y= 
ip(x:y)— (zy) (ху) = 2: (ух) у= ip у = z:y = іу, so y is also a right inverse of x. Also, 
хў = х: (ух) = (wy) x = i, x = x, soi; is also a right identity in S and therefore an identity. 


. For some fixed a є S, let x, be the solution to x- a = a. Let b be any element of S. Then a: x = b for 


somexeSandx,:b = х: (а: x) = (xy a): x = a: x = b. Therefore x, is a left identity in S. Also, for 
any b € S, there is an x such that x: b = x,; hence every element of S has a left inverse. Result follows 
from Exercise 29. 

IfGiscommutative,then (x: yp = (x-y):(x:y) = х: (ух) у= х (ху) у = (хх): (уту) = x: y. 
For the converse, letx, y e G;thenx:y:x:y = x: x: y: y,andby left and right cancellation, y : x = х · у, 
so G is commutative. 

Closure: let x, y € B}. Then (x-y) = x*-y (because of commutativity) = i-i =i, so x: y € B, 
Identity: i* = i, so i є B,. Inverses: for x € B, (х Df = (х)! 2i! =i, sox! € B}. 


37. a. SOT C G. Closure: for x, y e SMT, x: y € S because of closure in S, xy € T because of closure 


in T, so x:y € SMT. Identity: i € S and i € T soi € SMT. Inverses: for x e SOT, x ! € S and 
x 'eTsox 'eSnrT. 

b. No. For example, [{0, 4, 8}, +12] and [{0, 6}, + |] are subgroups of [Z;;, +12] but [{0, 4, 6, 8}, +15] 
is not a subgroup of [2,›, +12] (not closed). 


39. 


41. 


43. 


45. 


47. 
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a. Closure: let f, g є H,. Then (f° g)(a) = f(g(a)) = f(a) = a,so f° g € H,. Identity: the identity map- 
ping on A maps a to a. Inverses: let f € H,. Then Да) = aso / (а) = a, and f ! € H,. 

b. (n — 1)! 

а. Letx = d^, у = a? є А. Then x: y`! = а: (а) = а: (а ly? = a^^? є A. By Exercise 40, A is 
a subgroup. 


h o2 20,952, 9 23244254,2—52'—1,2'—343,2' 5 

с. 5 = 0,5! = 5,52 = 5 +5 = 3, 5 = 1,5 = 6,5? = 4,59 =2 

d. 3° = 0, 3! = 3,32 = 3 +,3 = 2, 3 = 

a. [Aut(S),°] is closed because composition of isomorphisms is an isomorphism (Practice 31). Associa- 


tivity always holds for function composition. The identity function is; is an automorphism on S. Finally, 
if fis an automorphism on 5, so is f. 


b. 00 00 ЭК: 
lol 13 i i f 
1: f: | 
2—2 2—2 flr i 
33 3-—] 
Let ig and i; denote the identity elements of С and H, respectively. Let f be an isomorphism, f: С > Н. 


Then f(ig) = ig and because fis one-to-one, iç is the only such element. Now let Бе a homomorphism 
from С onto H; then f (iç) = ig. Suppose ig is the only such element, and let f(g,) = f (e2) for gj, & E С. 


Then f(gi:g7 ) = f(g)-f(g ) = f(g): (/(&3) = f(g): C(g) | = ін. Therefore g: g3' = ig 
and g, = ig: g; = g2. Thus fis one-to-one; fis already an onto homomorphism, so it is an isomorphism. 


a. The + operation is a binary operation on Е (well-defined and closed). Associativity holds because 
(x] + Dp lg] — [x +y] ll = lee») +z] = le нае Ix] Dy 2] — [x] 4+ 
(Lv] + [z]). [0] is the identity because [x] + [0] = [x + 0] = [x] and [0] + [x] = [0 + x] = [x]. 
Each element has an inverse: [x] + [~x] = [x + (—x)] = [0] = [7x] + [x]. Commutativity holds 
because [x] + [y] = [x + y] = [y + x] = [y] + [x] 

b. The function f: Z; > E; given by f(0) = [0], fC) = [1], £202 [2], ®© = [3], A = [4] 
is a bijection. It is also a homomorphism. For x, y elements of 25, /(х+у)=[х+у]= 
Ix] + [у] 2 £69 + fo») 

c. The inverse of [10] is [—10] = [4]. The preimage of [21] is 7. 


EXERCISES 9.2 


1. 


3. 
5. 


7. 
9: 


f(G) is closed: Let f(x) and f(y) be elements іп (С). Then because f is a homomorphism, 

f(x) + fv) = f(x- y). Because x and y belong to С, х · y belongs to С and f(x: y) is an element of 

f (G). ig belongs to f(G): Let f(x) be an element in f(G). Then f(x) + f(ig) = f(x: ig) = f(x) and 

f (ig) + f(x) = f (ig: x) = f(x), so f (ig) is an identity for the subset f (G) and therefore f (ig) = їн. Ele- 

ments in f (G) have inverses in f (G): Let f(x) be an element in f (G). Then x belongs to С and х! exists 

in С. Then f(x !) + f(x) = f(x !:x) = / (ig) = ig. Similarly f(x) + f(x7') = ig, so f(x !) = —f(x) 

and f(x) has an inverse in f (G). 

К = 42 

а. /((х,у) + (s) = f(x + ‚у +5) = (к ++) + (+5) = (у) + (и + 5) = /(х, у) +f@s). 

b. К = {(х,—х)|х e Z}. 

7 +S = {7, 11, 3} 

a. H has no row ofall 0s and no two rows alike, so the minimum distance = 3 and the code is single-error 
correcting. 


11. 


13. 


15. 
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b. Н can encode all of Z3. 000 — 000000, 001 — 001101, 010 — 010011, 011 — 011110, 
100 — 100111, 101 — 101010, 110 — 110100, 111 — 111001 


For example, 


Let X = (x,,... 


H = 


ocoorooor Fe nme 
ooroorraooe 
онооно © =_ © 
— © © © н но н © © 


,X,) and Y = (у, ...,у,) be elements of Z5. The ith component of (X +, У) ·Н is 


given by(x, +2 у), Hi; +2 (0 +2 у) Hoj tz +, (х, +2,)*H,;. Because the distributive law holds 


and addition 


modulo 2 is commutative, this expression equals (xiH;; +2 x;H5; t5: · +2 x, Hj) +» 


Cy; Hi; +2 В, +2° + + +2 у,Н,,), which is the ith component of X: Н +Y- H. 


Coset leaders 
0000000 
0000001 
0000010 
0010000 
0000100 
0100000 
1000000 
0001000 


Syndromes 
000 
001 
010 
011 
100 
101 
110 
111 


17. The decoded word is 011000010101001. 


EXERCISES 9.3 
1. a. 0001111110 b. aaacaaaa c. 00100110 


3. 


Present state Next state Output 
Present input 
0 1 
5 5 5 0 А 
Ш 1 i Output is 010010 
51 Sy 51 1 
82 $5 50 0 
Present state Next state Output 
Present input 
0 1 
s E s a А 
? ; = Output is abbcbb 
51 $5 53 b 
582 $5 51 С 
53 59 53 b 
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Output is 0001101 


11. a. Q i С) b. 010100 
OO 


13. a. 00, 10, 01 b. 010010 


b. The length of time required to remember a given input grows without bound and eventually would 
exceed the number of states. 


ш; Ргор. 


not 
running 
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31. Once a state is revisited, behavior will be periodic because the input is always 0 and there is no choice of paths from 
a state. The maximum number of inputs that can occur before this happens is n — 1 (visiting all n states before 
repeating). The maximum length of a period is n (output from all n states, with the last state returning to sọ). 


33. 0* 

35. 01* V (110)* 

зт. (1 V 01)(01)* 

39. 10*1 

41. 1* V (010)* 

43.2. (0 V D)*1 — b. 1*01*(01*0)*1* е. 100*1 


45. a. Yes b. No c. No 

47. dd*(+ V —)dd* where d stands for any digit 

49. a. (00)* b. 111*0 c. 1*001* 

51. a. 0*10* b. 000(1 V 0)* с. (1 0)0(1 V 0)1(1 V 0)* 


53. a. Proof is by induction on the length of the regular expression. For the base step, if A = ©, A, or i, then 
A^ = Ø, A, or i. Assume that for all expressions of length = k, A regular > A” regular. Let A be a reg- 
ular expression of length k + 1. If A = BC, where В and C are regular, then B^ and С“ are regular by 
inductive hypothesis and A? = C^B^, so А“ is regular. Similarly, if А = B V C, then А = B^ V C? 
(regular), and if А = B*, then А = (B*)* (regular). 

b. No—no regular expression describes this set. 

55. beer, beter 

57. beter, better, bettter 

59. bit, but, beet 

61. bt 

63. s; 

65. А = {0}, В = {1,2,5}, С = (3,4, D = {6} 


Present state Next state Output 
Present input 
0 1 
А C D 1 
В C B 0 
C B A 1 
р C B 1 


67. A = {0}, B = {5}, C = {2}, D = {7, 8}, E = {1,3}, F = {4,6} 


Present state Next state Output 


Present input 
0 1 


yor о су ox 
w су © їч ah 
єч = ы у б Су 
m = © © O c 
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69. A = {0}, В = {2}, C = {1,4}, D = {3}, Е = {5} 


Present state Next state Output 
Present input 
0 1 
А C D 0 
B E С 0 
C B С 1 
р C B 2 
E C A 2 


71. A = {0,2},В = {1,3}, C = {4} 


Present state Next state Output 
Present input 
a b c 
A B C A 1 
В C A B 0 
C B A 


73. A = {0}, B = {2, 4}, C = {1,5}, D = (3) 


Present state Next state Output 
Present input 
0 1 

A D A 0 

B С: В 0 

C B D 1 

D А В 1 

75. Possible answer: | da, d, x(t) d,(t) d,(t) y(t)  dı(t+1) dt) 

So 0 0 0 0 0 0 1 0 
S4 0 1 1 0 0 0 1 1 
So 1 0 0 0 1 1 0 0 
S5 1 1 1 0 1 1 0 1 
0 1 0 0 0 1 
1 1 0 0 1 1 
0 1 1 1 0 1 
1 1 1 1 1 0 


4|@ + 1) = ха + xdid, + ха, 
4(@ + 1) = xd,d, + хаа + x'd,d> 4 


y(t) = did, + did, = d, 


хаа, == did, + xd, 
- xd,d; + х'а\й› = x(d, + Ф) + x'd, 
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d(t + 1) 


d(t + 1) 


dt) [—4 — dy(t-* 1) 


EXERCISES 9.4 


1. а. halts with final tape ·.. |р |о [о lo |0|0|Ь 
b. does not change the tape and moves forever to the left 


3. One answer: State 2 is a final state. 


(0, b, b, 2, R) blank tape or no more 1’s, go to final state 
(0, 1, 1, 1, А) has read odd number of 1’s 

(1, 1, 1,0, А) has read even number of 1’s 

5. One answer: State 3 is a final state. 

(0, 0, 0, 0, R) 
(0, 1, 1, 1, А) 
(1,0,0, 1, R) 
(1, 1, 1, 2, R) 


} pass over Os to first 1 


\ pass over 05 to second 1 


(2, b, b, 3, R) end of string, halt and accept 
7. One answer: State 9 is a final state 
(0, b, b, 9, К) accepts blank tape 


finds first 1, marks with X 


searches right for 2s 


pair of 2s, marks with Y's 


| searches left for Os 


11. 


13. 


eo 
NN 
v 


эме 
NI 

DDD 
Ry BPH 


Bd hd Dd Hd et ра 
AX 


EE 
м n 
pde 


У 
У 


У 


n 
У 


Xx 


оо чые OO oO WS 
Бы 
оо 

t 0 mm 


LON NN ON ODN ODN ODN m уле, м 


„ 
oo оо ~ 
ws 

wa МУ 


n 


(8, 
(8, b, b, 9, L) 


N 
N 
go 
2 


(0, 0, b, 1, R) 
(1,0,0, 1, R) 
(1, 1, 1, 1, R) 
(1, *, *, 2, R) 
(2, X, X, 2, В) 
(2, 1, 1, 8, А) 


la: 
d 
Nach v 


У 


—— 
NW 
SS 

> 
pe 
n 


* Хх 
Xx 
Sad 
S 
МУ 


BO 

-onoo же; 
- ооо = кюк 

OQ tA tan tA CO SH HY 


n 


У 
v 
У 
n 


У 
У 
m 
n 


У 
У 
У 


У 
У 
nm 
nm 


m 
У 
У 
m 


bg by by by ам 


ar 

~ 
SS SS хы чы ы 
— 


У 


©з» 


к= 
ххк 
“ow 
a 


У 


good 
2у ж 
DOO 


У 


У 
nm 
nm 
nm 


У 
v 
У 
nm 


У 
У 
У 
nm 


m 
m 
У 
m 


m 
M 
m 
m 


m 
У 
У 


У 
У 
У 
У 


v 
У 
У 
У 


v 
У 
У 
У 


v 
m 
У 
У 


DONO UM ss enn и и и и и и и и и и им им 
SONNNRF но оо © а а га © су су су сул щл о PHF 
A 
-* 
"9 
— 


отоо о о 
ooreseoor or ses 
= сю юг юк е SS ооо“ 
BROSDRR Be RRR: 


m 
m 
n 
m 


— aa 


| 


— 


—_—_— —„— 
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pair of 0з, marks with Z’s 


passes right to next 1 


no more ls 


no more 2s 


no more Os, halts and accepts 


. One answer: State 8 15 a final state 


0 read on left of w, 
moves right to * 


passes over X's 
nonzero on left of w,, halts and accepts 
left symbols match 


moves left to * 


finds leftmost symbol 


1 read on left of w, 


moves right to * 


passes over X^s 
non-one on left of w,, halts and accepts 
left symbols match 


word left of * is empty 


word right of * nonempty, halts and accepts 


w; initially empty 


changes 0 to 1 
changes 1 to 0 


passes center point 


adds 0 on right end 


adds 0 on left end 


returns to center point 
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15. The general idea is to decrement the binary number by 1; with each decrement, add a 1 to the string of 1s 


21. 


being built.! 

(0, 1, 1, 0, R) 
(0, 0, 0, 0, R) 
(0, 


m 
m 


== коз оге 
© кекҥе о со со С 
— & Б оо оо омм 
КЭККЭ 


У 


У 
m 
v 


У 
У 
У 


У 
У 
У 


У 
У 
nm 


n 
У 
У 


v os 
У 
m 

ur ee a ee SS SS SNS 


m 
У 


У 
nm 
У 
У 


n 
У 
У 
У 


У 
n 
m 
У 


У 
У 
У 
m 


m 
У 


m 
m 
m 
У 


У 
m 
nm 
У 


wm OS М7 "Ж oN SS М a 


У 


ро о О н = 
оо ммм су оол tA 
У у о ро ру У ро У У 


У 


К м и 
ою N AADU = 
— © © © - — © с © 


nm 
У 


fm, m, пз) = { 


One answer: 
(0, 1, 1, 1, А) 


> 


шты унны ee area у 
х Су Un л & > > ошо мо о 


| 
| 


| 
| 
| 
| 
| 
| 


n+ 1 
undefined if n, = 0 


finds least significant digit 


if least significant digit = 1, then decrement 


finds end of original string 


finds right end of new string 
writes 1 at end of new string 


goes back to original string 


if least significant digit = 0, change to 1 and look 
left for another 1 to decrement 

increment Os to the left 

just incremented the final remaining 0, clean up 
found a 1 to decrement 


prepare to move to end of original string 


the 1 just decremented was the most significant digit 
blank out the leading 0 
clean up and halt 


ifn, > 0 


0, add 1 and halt 


n 
n = 1, add additional 1 and halt 


n > 2, erase extra 15 and halt 


n > 0, finds end of n 


changes 1 to X, adds 1 at right end of string 


goes left to next 1 of n 


Му thanks to Alicia Kime of Fairmont State College and her student Tim Holmes for this solution. 
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n is doubled, changes X’s to 1’s 


1, 1, 7, R) 
(7, b, b, 8, L) | finds right end, erases extra 1, halts 
) 


23. One answer: 


(0, 1, b, 1, R) erases one extra 1 
(1, *, b, 3, R) п = 0 
(1, 1-5. 2, R) 
(2, 1, 1, 2, R) 
(2, *, 1, 3, R) 
25. One answer: 
(0, 1, 1, 0, R) 
(0, *, *, 0, А) 
(0, b, b, 1, L) 
(0, X, X, 1, L) 
(1, 1, X, 
(2, 1, 
(2, * 
(2, b, 
(2, X, 
(3, 1, 
(3, * 


n, > 0, replaces * with leftmost 1 of 7,, halts 


move to right end of 15 for n, 


2; X’s rightmost 1 of n, 
2, 
2, 
3; 


1, 
> *, 
b, 
X, 
X, 
*, X, n, < п 
X 


> 


write 0 on tape and halt 


1 move to left end of 15 for n,, X leftmost 1 


(1, 


all of n, used, now write n, — n; on tape 


X 
, b, 
X 
vil: E R) 
(5, b, b, 9, R) 
*, ж,б, i 
‚Х,Х,6 » 
, b, b, 7,1, | erase n 
, X, b, a 
(7, *, l, F 
8, L) 


clean up п, — n; and halt 


27. invoke 7, invoke T 


EXERCISES 9.5 


. L(G) = {A, a}; the grammar is type 0. 

. L(G) = 0(10)*; the grammar is regular. 

С = (V, Vr, S, P) where V = (a, S}, V; = {a}, and P = {S > А, S > a} 

. The grammar of Exercise 3 is already regular. 

. L(G) = (ab)* 

11. L(G) — aa*bb*. G is context-sensitive. An example of a regular grammar that generates L(G) is 
С' = (У,Ут,5,Р) where V = {а, Б, А, В, 5}, Уз = {a,b} and P= {S —aA,S — aB, А > aA, 
А — aB, В —> bB, B > b} 


on a © = 
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13.a. <S> ::= 0<A>|1<A>, <A> ::= 1<B><B>, <B> = 01] 11 
b. <S> ::= 0|0<4>, <A> = 1<B>, <B> ::= 0<4> |0 
c. <S> ::= 0 <S> | П<А4А>, <A> 2: = 1<A>]1 


15. а. 1*1(00)* 
р. S—> 1, S — 18,S— 04,4 —> 0, A — OB, B —^ 0A 


17. For example, С = (V, Vr, S, P) where V = (6G), S, V; = {(, )}, and P = {S > A, S > (S)S} 

19. For example, С = (V, Vr, S, P) where V = (a,b, A, B, S, Ут = (a, b}, and P = (S — A, S — аба, 
S bSb, S > a, S > b} 

21. For example, С = (V, Vr, S, P) where V = {0, 1, A, В, S}, V; = {0,1}, and P = {S > 14,4 1, 
A—1B8,B—50,B—51,B—0B,B 1B} 

23. For example, С = (V, Vr, S, P) where V = (1, S, V; = {1}, and P = {S > A, $ 115} 

25. For example, С = (V, Vr, S, P) where V = {0,1,5}, V; = {0,1}, and P = {S > A, $— А, А > 01, 
A — 081] 

27. For example, С = (V, Vr, S, P) where V = (0,1, S}, V; = {0, 1}, and P = {S > SS, S — 01, S 10, 
S — 051,5 > 150} 

29. For example, С = (V, V;, S, P) where V = (0, 5, A, B, X), V, = {0}, and P = (S А08, A0  400Х, 
ХО > 00Х, ХВ > B,A > А, В > A} 


31. For example, С = (V, Vn $, Р) where V = 10, 1, 5, 5, А, В, М}, V; = {0,1}, and P= {S А, 
S — 08,4, S > 15/3, S, > 054, S, > 1S,B, S, > М, MA — MO, MB > MI, M > A, 0A — A0, 
0B > ВО, 14 > 41,18 > B1} 


33. a. S 


"AA 
ALIA 
| | | | 


37. The set of productions for G' is formed from the set of productions for G as follows: For A and B nonter- 


minals, whenever А = B in Gand B > а isa production in С with |a| = 2, add the production A — a to 
the set, then eliminate all productions of the form A — B. For any productions of the form A — a, а € Vr, 
add to the set of productions those obtained by replacing any A on the right of an existing production by a, 
then eliminate all productions of the form A — a. Eliminate S — A. The remaining productions all have 
right sides with length > 2 and L(G') C L(G). Only A anda finite number of one-length words may have 
been eliminated, so L(G) — L(G’) is a finite set. 


Answers to Self-Tests 


CHAPTER 1 
SECTION 1.1 
1. Е Ifa statement is not a tautology, it does not have values that are all true, but that does not make them 
all false. 
2. T Because of the truth table for disjunction, ( ) V T is T. 
3. F The statement must have an implication as its main connective. 
4. T 
5. Е Тһе negation of a disjunction is the conjunction of the negations. 
SECTION 1.2 
т 
2. Е 
3. T 
4. Е Itis one in which hypothesis — conclusion is always true. 
Б. T 
SECTION 1.3 
1 Е Infact, (Vx)(P(x) ^ [P(x)]') would be false in all interpretations. 
2.T 
3. T 
4. F There is no one predicate wff defined on an interpretation, nor is the domain at all determined 
by truth values. 
5. T 
SECTION 1.4 
1T 
2. Е Existential instantiation should be used early. 
3. F Universal instantiation would only strip off the leading universal quantifier. 
4. Е М in propositional logic are not even wffs in predicate logic. 
5. T Predicate logic is correct—only valid wffs are provable. 
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SECTION 1.5 
1T 
2. Е A single negated predicate is only one kind of Horn clause. 
3. T 
4. Е AProlog recursive rule is not a rule of inference. 
5. T 
SECTION 1.6 


1. Е It guarantees only that the output satisfies certain conditions, given that the input satisfies certain 
conditions. 


2. F Nothing much can be said about the precondition without knowing the assignment, but at any rate the 
strict inequality will not go away. 


F Program testing involves test data sets. 


Gr pe" n 
— 


T 


CHAPTER 2 


SECTION 2.1 


1. Е А conjecture that only asserts something about a finite number of cases can be proved by proving all 
cases. 


3. Е А universal quantifier is understood, because the formal statement of the theorem 15 
(Vx)(x odd — 2 * x is even). 


4. F The second statement is the converse of the first, not the contrapositive. 


SECTION 2.2 

T 

Е Тһе basis step need not be n = 1. 

T 

T 

F This omits the first k — 1 terms of the series. 


Е O us 


SECTION 2.3 


1. Е А loop invariant remains true after loop termination. 
2. F It means that correctness has been proved only given that the loop terminates. 
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3. F The first principle of induction is used because the values at pass k + 1 through the loop depend only 
on the values at pass К. 
4. F But Q should give the desired result when the condition B’ is true. 
5. T (12 is the remainder when 42 is divided by 30). 
SECTION 2.4 
1T 
2. Е Тһе linear combination must equal 1. = 
3. Е For example, 36 = 2?-37, so the prime factors are 2 and 3, neither of which is > 6 = V36. 
4. Е For example, Ф(5) = 4, which is not a prime. 
5. T 
CHAPTER 3 
SECTION 3.1 
1. Т 
2. Е 18, 20, 22, 24 сап be generated, but пої, for example, 26. 
3. F They are valuable because they represent natural ways of thinking about certain problems, but they 
typically use more storage and perform more operations than a corresponding iterative program. 
4. T 
5. T 
SECTION 3.2 
1 Е Induction may be used to verify a proposed closed-form solution but not to determine such a solution 
from the recurrence relation. 
2. Е Itis not linear because of the presence of the S(n — 2) term. 
3. Е Itis a pattern for the general solution, but it is not a closed-form solution because of the summation; 
in any specific case, there must be a closed-form expression to evaluate the summation. 
4. Т 
5. Т 
SECTION 3.3 
1T 
2. F It requires m(n — m + 1) comparisons. 
3. T 
4. F The recursive version looks at the first item in the list, and if that is not the target, it searches the rest 
of the list. The “input size" goes down only by 1 each time the algorithm is invoked. 
5. T 
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CHAPTER 4 
SECTION 4.1 
1 F Itis not a proper subset of itself. 
2. T 
3. T 
4. F This is the closure property. 
5. Е It is a way to prove that certain sets are uncountable. 
SECTION 4.2 
ЕТ 
2. Е 
3. Т 
4. Т 
5. Е 
SECTION 4.3 
1. F 


2. Е Тһе number of elements in the union plus ће number of elements in ће intersection is the sum of the 
number of elements in each set. 


3. F All must be finite. 
4. F 
5. T 


SECTION 4.4 


T 

T 

.F Use C(n,r). 

Е Itisn!/n,!nj!n,!. 
T 


v RON 


SECTION 4.5 


F Combinations, not arrangements. 
T 

Е All terms are found in row и. 

T 

T 


плом гә 
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SECTION 4.6 
1T 
2.T 
3. Е This will work only if the events are independent. 
4. Е Arandom variable is not a variable; it is a function that assigns numerical values to the members of a 
sample space, and these values are generally not randomly chosen. 
5. T 
CHAPTER 5 
SECTION 5.1 
LT 
2. Е (х, х) can belong. 
3. T 
4. F The relation of equality is both a partial ordering and an equivalence relation. 
5. Е An equivalence relation does this. 
SECTION 5.2 
1. F 
2. 
3. F The converse is true. 
4. F The maximum value is used. 
5. F See Example 18 and Practice 19. 
SECTION 5.3 
1T 
2.T 
3. F 
4. T 
5. Е If the data satisfy data integrity to begin with, then data integrity will still be true after a delete. It is 


referential integrity that may be lost. 


SECTION 5.4 
1 Е Е тау not have an image for each member of the domain. 
2. Е Every element of the range has a preimage; begin with an element of the codomain. 
3, T 


4. 


T 
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5. F The original function must be a bijection. 


SECTION 5.5 
1. Е Other constants may work where these do not. 
2. F Either f= O(g) or f= o(g) 
3. T 
4.F 
5. T 
SECTION 5.6 
1T 
2. Е Ifthe hash table entry does not match the target value, then the collision resolution algorithm must be 
followed before the search can be deemed a failure. 
3. T 
4. F It derives from the difficulty of finding the prime factors of n. 
5. T 
SECTION 5.7 
t. T 
2. Е Тһе two products are not necessarily equal. 
3. T 
4. F 
5. Е See Practice 58. 
CHAPTER 6 
SECTION 6.1 
1. Е А complete graph has an arc between any two nodes. 
2. T 
3. F Aplanar graph could still be drawn with arcs that cross. 
4. F It means that nodes 2, 3, and 4 are all adjacent to some one node. 
5. Е It could be symmetric; it just doesn’t have to be. 
SECTION 6.2 
1T 
2. Е Acomplete (binary) tree has nothing to do with a complete graph. 
3. T 
4. T 
5. T 
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SECTION 6.3 
1T 
2. Е This is the worst case; other cases could require fewer comparisons. 
3. T 
4. F The binary search tree depends on the order in which data elements are inserted. 
5. F It must have at least n! leaves. 
SECTION 6.4 
1T 
2. F Ina prefix code, no code word is the prefix of another code word. 
3. F Characters that occur most frequently have the shortest strings, and frequency does not affect the 
number of Os versus the number of 15. 
>T 
5. F The code table file (giving the code for each character) must be stored along with the encoded file. 
CHAPTER 7 
SECTION 7.1 
1T 
2. T 
з.Е R-AVA v... vAO 
4. Е That's what В = A V AO V --- V A? does; Warshall's algorithm expands the set of nodes available 
for use on a path. 
5. Е  Warshall's algorithm computes transitive closure. 
SECTION 7.2 
1T 
2. F The graph can have at most two odd nodes. 
з. T 
4. F Some arcs may go unused. 
5. Е No efficient algorithm is known, but trial and error solves the problem. 
SECTION 7.3 
1 Е This is how Prim’s algorithm works. 
2. Е A greedy algorithm does not necessarily divide a problem into smaller subproblems, but it takes the 
"best" action based on limited knowledge at each step. 
3. T 
4. Т 
. F It will generally not form a tree at all. 
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SECTION 7.4 
1. Е The starting node for a depth-first search can be any node in the graph. 
2. T 
3. T 
4. Е It is the equivalent of depth-first search, assuming that sibling nodes are labeled in order from left to 


right. 


5. Е Usea succession of depth-first searches. 


SECTION 7.5 
1 Е Any path between some set of two nodes must pass through it. 
2. T 
3. T 
4. T 
5. Е The root is a special case. 


CHAPTER 8 


SECTION 8.1 
1F х+х'=1 
2. T 
3. T x+t(yt+tx: z = х + (xez + y) 
= x1 bees y) 
=: l+ xz) by 


= ee ap y 
= х.(+1)+у 
=х:1+у 
=х+у 
4. Т 
5. Т 
SECTION 8.2 
iT (кту) =x -y’ 
2. Е It has as many terms as the function has 1-values (or one term if the function has all 0-values). 
3. Е Only one half-adder is needed (see Figure 8.14a). 
4. Т Becausex +x =x 
5. F This usually results in an unnecessarily large number of devices; start again from the truth 


function. 
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SECTION 8.3 
1T 
2.T 
3. Е Look for two-square blocks first and work up from there. 
4. T 
5. F The term is essential if the check for that row is the only one in some column. 
CHAPTER 9 
SECTION 9.1 
1 F This describes a commutative operation. 
2. T Even though this is not the definition of the identity 
3. T 
4. Т By Lagrange's theorem. 
5. Е fmust also be a bijection. 
SECTION 9.2 
1T 
2.T 
3.F Н maps Z} toZ;. 
4. Т Ifthe syndrome of X equals 0, then X is in the kernel of ће homomorphism generated by Н and is а 
code word; it will be assumed that no errors occurred. 
5. Е No coding scheme can guarantee zero bit errors. 
SECTION 9.3 
1T 
2. T (0 V 1)*00 
3. Е See Figure 9.8a. 
4. Т 
5. Е 
ЅЕСТІОМ 9.4 
1. Е Entering a final state causes a Turing machine to halt, but it can also halt in a nonfinal state if there is 
no instruction for the current state—input pair. 
2. F There will be n + 2 1s on the tape. 
3. F 
4. Е This is the version with the quantifiers reversed that is trivially true. 
5. T 
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SECTION 9.5 


ат pe oe ass 


T 


F 
F 
T 
T 


Depending on the productions selected, nonterminal symbols may never be removed. 
There may be an equivalent grammar for the language that is context-free. 


Index 


A 
ABA routing number, 437 
Absolute value, 107 
Absorption property, 633 
Ackermann’s function, 407 
Acquaintanceship graph, 499 
Acyclic graph, 482 
Addition inference rule, 29 
Addition modulo n, 343, 438, 692 
Addition of matrices, 449 
Addition principle, 254-55 
definition of, 254 
Adjacency list, 494 
Adjacency matrix, 492 
Adjacency relation, 555 
Adjacent nodes, 481 
Adleman, Len, 431 
AES (Advanced Encryption 
Standard), 431 
Algebraic structure, 686—708. See also 
Group 
associative property, 687 
commutative property, 687 
free monoid, 695 
group of permutations, 693 
identity element, 687 
inverse element, 687 
monoid, 689 
semigroup, 689 
semigroup of transformations, 693 
ALGOL, 164, 788 
Algorithm 
definition of, 12 
divide-and-conquer, 468 
graph traversal, 596—604 
greedy, 586 
optimal, 533 
recursive, 166-71 
Algorithms 
ArtPoint, 610 
Bellman-Ford, 592 


BinarySearch, 169, 208 
BreadthFirst, 599 
BubbleSort, 214 
CombGenerator, 285 
DepthFirst, 597 
Dijkstra 5, 582 
Euclidean, 133 
EulerPath, 574 
Floyd s, 593 
HeapSort, 527 
HuffmanTree, 542 
Inorder, 515 
Kruskal s, 594 
MatrixMultiplication, 452 
MergeSort, 215 
Minimize, 742 
PermGenerator, 282, 283 
Postorder, 515 
Preorder, 514 
QuickSort, 215 
SelectionSort, 169, 215 
SequentialSearch, 204, 207 
Sum-of-Products, 643 
TautologyTest, 13 
TopologicalSort, 360 
Warshall, 563 
Alphabet, 694, 782 
Alternating group, 701 
Ambiguous grammar, 798 
Analysis of algorithms, 203-10, 314, 
415 
average case, 205, 314 
binary search, 209 
breadth-first search, 601 
computational complexity, 416 
definition of, 203 
depth-first search, 601 
Dijkstra's algorithm, 586 
Euclidean algorithm, 210 
Euler path, 565 
lower bound, 511 


sequential search, 205, 207 
upper bound, 210 
using recurrence relations, 206-10 
Warshall’s algorithm, 565 
worst case, 205 
AND connective, 11, 75 
AND gate, 12, 639 
Antecedent, 3 
Antisymmetric relation, 332 
Appel, Kenneth, 507 
Arc, 477, 478 
Archimedes, 286 
Aristotle, 59, 71 
Arithmetic progression (arithmetic 
sequence), 124 
Array, 448 
Articulation point., 607 
Assertion, 85-87 
definition of, 86 
Assignment rule of inference, 87—89 
definition of, 87 
Associative equivalence rule, 28 
Associative property, 9, 234, 687 
Asymmetric encryption, 431 
Asymmetric relation, 349 
Atias,Nir, 497 
Augmented matrix, 453 
Automorphism, 714 
Axiom, 98, 773 


B 
Back arc, 608 
Backus-Naur form (BNF), 163, 787 
Ballot problem, 175 
BASIC, 259 
Basis (basis step), 111 
Bayes' theorem, 308-9 
statement of, 309 
Behm, Patrick, 136 
Bell numbers, 354 
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Index 


Bellman-Ford algorithm, 592 
Benford’s law, 325 
Benoit, Paul, 136 
Bernoulli experiment, 313 
Bernoulli trial, 313 
Bernoulli, Jacob, 313 
Biconnected components, 608 
Big oh, 417 
Big theta, 413 
Bijection, 390 
Binary adder, 731—33 
Binary connective, 3 
Binary GCD algorithm, 142 
Binary operation, 228-30 
associative, 687 
commutative, 687 
definition of, 229 
well-defined, 229, 686 
Binary predicate, 40 
Binary relation, 328-36 
antisymmetric, 332 
asymmetric, 349 
closure of, 334-36 
complement of, 331 
definition of, 329 
and directed graph, 555-57 
equivalence relation, 339-44 
intersection of, 331 
inverse of, 350 
irreflexive, 349 
many-to-many, 330 
many-to-one, 330 
one-to-many, 330 
one-to-one, 330 
partial ordering, 336-38 
properties of, 332-34 
reflexive, 332 
reflexive closure of, 334 
from S to T, 330 
symmetric, 332 
symmetric closure of, 334 
transitive, 332 
transitive closure of, 334, 561 
union of, 331 
Binary search algorithm, 169, 208 
Binary search analysis, 209 
Binary search tree, 352, 511, 534 
Binary string, 163 
Binary tree, 510 
Binary tree search, 352, 534 
Binomial coefficient, 297 
Binomial distribution, 313-14 
definition of, 313 
Binomial theorem, 294—98 
statement of, 296 


Bipartite complete graph, 483 

Birthday problem, 318 

Blind key, 368 

Block cipher, 429 

Blocks of a partition, 339 

BNF. See Backus-Naur form 

Boole, George, 458, 617 

Boolean algebra, 618-31 
absorption property, 633 
complement of an element, 625 
definition of, 620 
De Morgan’s laws, 624 
double negation property, 624 
dual of a property, 623 
idempotent property, 622 
isomorphism, 629 
modular property, 633 
universal bound property, 624 

Boolean AND, 458 

Boolean expression, 639 

Boolean matrix, 458-59 
Boolean AND, 458 
Boolean OR, 458 
multiplication, 458 
and reachability, 558 

Boolean OR, 458 

Bottom up parsing, 794 

Bourg, David M., 749 

Breadth-first search, 598—601 
analysis of, 601 

B-tree, 526 

Bubble sort algorithm, 214 

Byte, 257 


с 
С, 224 
Caesar, Julius, 428 
Caesar cipher, 428 
Cancellation laws, 696 
Canonical parity-check matrix, 721 
Canonical product-of-sums form, 660 
Canonical sum-of-products form, 643 
Cantor, Georg, 237, 401 
Cantor’s diagonalization method, 237 
Cantor’s theorem, 401 
Carbin, Michael, 654 
Cardinality 

of a relation, 366 

of a set, 236, 401 
Cartesian product (cross product), 233 
Castile, Tracy, 262 
Catalan numbers, 175, 263, 293, 526 
Cayley, Arthur, 708 
Cayley’s theorem, 708 


Ceiling function, 386 
Center of a group, 713 
Chain, 338 
Chaining, 426 
Characteristic equation, 190 
Characteristic function, 406 
Chinese remainder theorem, 432, 445 
Chomsky, Noam, 786 
Chomsky hierarchy, 790 
Chromatic number of a graph, 507 
Chung, Fan, 287 
Church, Alonzo, 770, 771, 773 
Church-Turing thesis, 770 
Ciphertext, 428 
Circular left shift, 430, 439 
Closed form solution, 180 
Closure of binary relations, 334-36, 
556 
definition of, 334 
Coding theory, 714-26 
canonical parity-check matrix, 721 
check bits, 721 
coset leader, 723 
double-error detecting code, 715 
group code, 719 
Hamming distance, 718 
information bits, 721 
maximum likelihood decoding, 715 
minimum distance of a code, 718 
perfect code, 722 
single-error correcting code, 715 
syndrome, 724 
weight of a code word, 719 
Codomain, 383 
Coefficient, 690 
Collision, 425 
Collision resolution, 425 
Combinational network, 638-48, 744 
definition of, 642 
Combinations, 274—77 
definition of, 274 
eliminating duplicates, 279 
generating, 284 
with repetitions, 279 
Combinatorial proof, 296 
Combinatorics, 252-87 
addition principle, 254 
combinations, 277 
decision tree, 257-58 
definition of, 252 
multiplication principle, 253 
permutations, 272-74 
pigeonhole principle, 269 
Common difference, 124 
Common ratio, 124 
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Commutative diagram, 391, 479, 628 
Commutative equivalence rule, 28 
Commutative group, 688 
Commutative property, 9, 234, 687 
Complement 
of a binary relation, 331 
of a boolean algebra element, 625 
of a graph, 506 
of a set, 232 
Complement property, 9, 234 
Complete binary tree, 510 
Complete formal system, 27, 58 
Complete graph, 482 
Composite number, 107 
Composition function, 391 
Composition of functions, 3990—92 
definition of, 391 
Compound statement, 2 
Computational complexity, 416, 
776-78 
Computational procedure, 769 
Computer security, 143, 427-35 
ciphertext, 428 
cryptographic hash functiom, 434 
cryptography, 427-33 
one-way encryption, 434 
password encryption, 433-35 
plaintext, 428 
Concatenation, 163, 694 
Conclusion of an argument, 25 
Conditional probability, 306—7 
definition of, 307 
independent events, 307 
Conditional rule of inference, 90-92 
definition of, 90 
Conditional statement, 90 
Congruence modulo n, 343 
Conjunct, 2 
Conjunction, 2 
Conjunction inference rule, 29 
Conjunctive normal form, 660 
Connected component of a graph, 603 
Connected graph, 482 
Connective. See Logical connective 
Consequent, 3 
Constant symbol, 41 
Context free (type 2) grammar, 790, 
793-95 
Context free language, 790 
Context sensitive (type 1) grammar, 
790 
Context sensitive language, 790 
Contradiction, 8, 104—6 
Contraposition, 103—4 
Contraposition inference rule, 37 


Contrapositive, 21, 103 
Converse, 21, 103 
Correct formal system, 27, 58 
Correct program, 85 
Coset, 717 
Countable set, 236 
Counterexample, 99 
Critical path, 359 
Cryptographic hash function, 434 
Cryptography, 427-33 
AES, 431 
asymmetric encryption, 431 
block cipher, 429 
Caesar cipher, 428 
definition of, 428 
DES, 429, 656 
diffusion, 429 
private key encryption, 431 
public key encryption, 431 
RSA public key encryption 
algorithm, 431 
simple substitution cipher, 429 
symmetric encryption, 431 
Cryptology, 428 
Cutler, William, 286 
Cycle in a graph, 482 
Cycle notation, 395 
Cyclic group, 713 


D 

Darwin, Charles, 211 

Data compaction, 540 

Data compression, 540 

Data flow diagram, 480 

Data integrity, 375 

Database, 365—76 
add operation, 375 
blind key, 368 
composite primary key, 367 
conceptual model of, 365 
data integrity, 375 
definition of, 365 
delete operation, 375 
entity, 365 
entity attributes, 365 
entity integrity, 367, 375 
entity-relationship diagram, 365 
entity-relationship model, 365—66 
foreign key, 368 
integrity, 375-76 
join operation, 370 
metadata, 366 
modify operation, 375 
operations, 369—73 


outer join, 369 
primary key, 367 
project operation, 369 
in Prolog, 73 
referential integrity, 375 
relational, 366 
relational algebra, 372 
relational calculus, 372 
relational model, 366—69 
restrict operation, 369 
SQL, 372 
surrogate key, 368 
tuple, 366 
Decision problem, 771—76 
definition of, 772 
negative solution for, 772 
positive solution for, 772 
uncomputable, 772 
undecidable, 772 
unsolvable, 772 
Decision tree, 257—58, 511, 529—36 
definition of, 529 
for searching, 529-35 
for sorting, 535—36 
Declarative language, 73 
Deduction method, 32 
Deductive reasoning, 99 
Degree of a node, 481 
Degree of a polynomial, 691 
Degree of a relation, 366 
Degree of separation, 499 
Delay element, 744 
DeMoivre's theorem, 126 
De Morgan, Augustus, 9, 507 
De Morgan's laws, 10, 28, 178, 248, 
250, 624, 633 
Denumerable set, 236, 401 
Depth of a node, 510 
Depth-first search, 81, 596—98 
analysis of, 601 
Dequeue, 599 
Derangement, 396 
Derivation in a grammar, 784 
Derivation rules, 803-04 
definition of, 27 
for predicate logic, 58—62 
for propositional logic, 28—33 
DES (Data Encryption Standard), 429, 
656 
Descriptive language, 73 
Determinant of a matrix, 473 
Diagonal matrix, 463 
Difference of sets, 233 
Diffusion, 429 
Digraph. See Directed graph 
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Dijkstra, Edsger W., 581 
Dijkstra’s algorithm, 581 
analysis of, 586 
Dimensions of a matrix, 446 
Direct proof, 101—3 
Directed graph 
connected, 568 
definition of, 478 
initial point, 478 
path in, 483 
reachability in, 557-62, 602 
reachability matrix, 559 
reachable node in, 483, 557 
terminal point, 478 
Warshall’s algorithm, 562-65 
Disjoint cycles, 395 
Disjoint sets, 232 
Disjunct, 3 
Disjunction, 3 
Disjunctive normal form, 643 
Disjunctive syllogism, 37 
Distributive inference rule, 37 
Distributive property, 9, 234, 250 
Divide-and-conquer algorithm, 208, 
468 
Divide-and-conquer recurrence 
relation, 193-97 
Divides, 107 
DNA, 205 
Document Type Definition, 788 
Domain 
of a function, 383 
of an interpretation, 41 
Don't-care condition, 653 
Double negation equivalence rule, 28 
Double negation property, 624, 633 
DTD, 788 
Dual 
of Boolean algebra property, 623 
of set identity, 235 
of a tautological equivalence, 9 
Dummy variable, 41 
Dwyer function, 407 


E 

Edge, 477, 478 

Elementary row operations, 454 
Empty set, 224 

Empty string, 163, 694 
Enqueue, 599 

Entity integrity, 367, 375 

Entity relationship diagram, 365 
Equal functions, 387 

Equal matrices, 447 

Equal sets, 223 


Equivalence class, 339 
Equivalence connective, 3 
Equivalence relation, 339-44 
definition of, 339 
and equivalence class, 339 
and partitions, 340 
Equivalence rules 
associative, 28 
commutative, 28 
definition of, 28 
De Morgan’s laws, 28 
double negation, 28 
implication, 28 
Equivalent Boolean expressions, 645 
Equivalent grammars, 791 
Equivalent sets, 401—2 
Cantor’s theorem, 401 
definition of, 401 
Equivalent states, 739 
Equivalent wffs, 8 
Erasing convention, 789 
Euclid, 98, 133, 148 
Euclidean algorithm, 133-35, 144, 
145, 210, 217, 432 
binary GCD algorithm, 142 
greatest common divisor, 133 
Euler, Leonhard, 488, 571 
Euler path, 572, 573 
Euler phi function, 149-51, 431 
definition of, 150 
Euler's formula, 488 
Even node, 572 
Even number, 101 
Even permutation, 700 
Event, 302 
Exclusive OR, 22, 430, 634, 656 
Exhaustive proof, 100—101 
Existential generalization, 59, 62 
Existential instantiation, 59, 60 
Existential quantifier, 40 
Expected value, 310-12 
definition of, 310 
linearity of, 312 
Expert system, 81 
Exportation inference rule, 37 
Extensible Markup Language, 788 


F 

Faivre, Alain, 136 

Fermat, Pierre de, 143 

Fermat's last theorem, 143 

Fermat's little theorem, 432, 445 

Fibonacci sequence, 159, 188, 301 

Field-programmable gate array 
(FPGA), 647 


Figurate numbers, 200 
Final state, 734, 765 
Finite set, 223 
Finite state machine, 479, 728-48 
binary adder, 731-33 
definition of, 729 
equivalent states in, 739 
final state, 734 
input alphabet, 729 
k-equivalent states in, 739 
Kleene’s theorem, 737 
minimization, 737-44 
minimization algorithm, 742 
next-state function, 729 
output alphabet, 729 
output function, 729 
recognition by, 734 
and regular sets, 737 
and sequential networks, 744-48 
state graph, 730 
state table, 730 
unreachable state of, 737 
First principle of induction, 110-12, 
250 
definition of, 111 
First order recurrence relation, 182 
Five-color theorem, 507 
Flip-flop, 744 
Floating point operations, 460 
Floor function, 386 
Floyd’s algorithm, 593 
Foreign key, 368 
Forest, 510 
Formal language, 782-95. See also 
Language 
and computational devices,792—93 
definition of, 785 
generated by a grammar, 784 
hierarchy of, 793 
Formal logic, 1—96 
predicate logic, 58—69, 773 
propositional logic, 25—35 
Formal system 
completeness of, 27, 58 
correctness of, 27, 58 
Four-color problem, 507 
Free monoid, 695 
Free tree, 509 
Free variable, 42 
Full binary tree, 510 
Full-adder, 649 
Function, 381—402 
Ackermann’s function, 407 
bijection, 390 
as a binary relation, 383 
ceiling function, 386 
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characteristic, 406 
codomain, 383 
composition, 391 
composition of, 390—92 
definition of, 383 
domain, 383 
Dwyer function, 407 
equality of, 387 
floor function, 386 
hash function, 441 
identity function, 392 
image under, 383 
injective, 389 
inverse, 392-94 
left inverse, 408 
modulo, 386 
of more than one variable, 385 
next-state, 729 
nondeterministic, 411 
number of, 397—400 
number-theoretic, 767 
one-to-one, 389 
onto, 388 
order of magnitude of, 412-21 
output, 729 
partial, 767 
permutation, 394—96 
preimage under, 383 
properties of, 388-90 
range of, 388 
right inverse, 408 
Smorynski function, 407 
surjective, 388 
total, 767 
truth, 640 
Turing-computable, 768 
Fundamental Theorem of Arithmetic, 
144-48 
definition of, 144 
Fuzzy logic, 23, 250 
Fuzzy set, 250 


G 

Gates, William, 211 

Gauss, Karl Friedrich, 110, 454 

Gaussian elimination, 453—57 

Generating permutations, 280-85 

Generator of a cyclic group, 713 

Geometric progression (geometric 
sequence), 124 

Global clustering coefficient, 500 

Gödel, Kurt, 98, 773 

Goldbach conjecture, 149, 154 

Golden ratio, 160, 219 

Goodman, Jacob, 211 


Graham, Ronald, 287 

Grammar 
ambiguous, 798 
Backus-Naur form, 787 
bottom-up parsing, 794 
Chomsky hierarchy, 790 
classes of, 789—92 
context-free (type 2), 790, 793—95 
context-sensitive (type 1), 790 
direct generation (direct derivation) 

in, 784 
equivalent, 791 
erasing convention, 789 
generation (derivation) in, 784 
language generated by, 784 
parse tree, 793 
phrase-structure (type 0) 
grammar, 783 

production, 783 
regular (type 3), 790 
start symbol, 783 
terminal, 783 
top-down parsing, 794 

Graph, 476-96. See also Tree 
acquaintanceship graph, 499 
acyclic, 482 
adjacency list, 494 
adjacency matrix, 492 
adjacency relation, 555 
adjacent nodes in, 481 
algorithms, 553—604 
applications, 479-81 
arc, 477, 478 
articulation point, 607 
Bellman-Ford algorithm, 592 
biconnected, 608 
biconnected components, 608 
bipartite complete, 483 
breadth-first search of, 598—601 
chromatic number of, 507 
colorability, 506 
coloring of, 507 
complement of, 506 
complete, 482 
connected, 482 
connected component of, 603 
cycle in, 482 
definition (formal), 478 
definition (informal), 477 
degree of a node, 481 
degree of separation, 499 
depth-first search of, 596—98 
Dijkstra's algorithm, 581 
directed. See Directed graph 
dual graph for a map, 507 
edge, 477, 478 


endpoints of an arc, 478 
Euler path in, 572 
Euler's formula, 488 
even node, 572 
five-color theorem, 507 
Floyd's algorithm, 593 
four-color problem, 507 
global clustering coefficient, 500 
Hamiltonian circuit in, 576 
homeomorphic, 491 
in-degree of a node, 579 
isolated node, 481 
isomorphic, 484—87 
isomorphism, 486 
Kruskal’s algorithm, 594 
Kuratowski theorem, 491 
labeled, 479 
length of a path, 482 
loop in, 481 
loop-free, 481 
map-coloring problem, 506 
minimal spanning tree, 587 
node, 477, 478 
odd node, 572 
out-degree of a node, 579 
parallel arcs in, 481 
path in, 482 
Petersen, 492 
planar, 487-92 
Prim’s algorithm, 587 
self-complementary, 506 
shortest path problem, 581-87 
simple, 481 
six-color theorem, 508 
spanning tree in, 587 
sparse adjacency matrix, 494 
subgraph of, 482 
theorem on Euler paths, 573 
traversal, 596 
traversal algorithms, 596—604 
trees, 509-21 
vertex, 477, 478 
weighted, 479 
Graph coloring problem, 778 
Greatest common divisor, 133, 144 
Greatest element, 338 
Greatest lower bound, 637 
Greedy algorithm, 586 
Group 
alternating, 701 
automorphism, 714 
Cayley's theorem, 708 
center of a, 713 
commutative, 688 
coset, 717 
cyclic, 713 
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Group (cont.) 
definition of, 688 
generator of, 713 
homomorphism, 702 
idempotent element, 713 
improper subgroup of, 700 
isomorphic, 702—8 
isomorphism, 702 
kernel of a homomorphism, 716 
Lagrange’s theorem, 701 
left cancellation law, 696 
left identity element, 712 
left inverse element, 713 
order of, 698 
permutation group, 700 
of permutations, 693 
proper subgroup of, 700 
right cancellation law, 696 
right identity element, 712 
right inverse element, 713 
subgroup of, 699 
symmetric group of degree n, 693 
of symmetries of an equilateral 

triangle, 710 
Group code, 719 


H 
Haken, Wolfgang, 507 
Half-adder, 649 
Halting problem, 774 
Hamilton, William Rowan, 576 
Hamiltonian circuit, 576, 777 
Hamming distance, 718 
Hamming, Richard W., 718 
Hash function, 424, 441 
Hash table, 425 
Hashing, 424—27 
chaining, 426 
collision, 425 
collision resolution, 425 
cryptographic hash functiom, 434 
hash function, 424 
hash table, 425 
linear probing, 425 
load factor, 427 
Hasse diagram, 336, 479 
Heap, 527 
Heapsort algorithm, 527 
Height of a tree, 510 
Highway inspector problem, 554, 571 
Hilbert, David, 772 
Hilbert’s tenth problem, 772 
Hoare, Anthony, 86 
Hoare triple, 86 
Homeomorphic graphs, 491 


Homogeneous recurrence relation, 182 
Homomorphism, 702 
Hopkins, Brian, 571 
Horn clause, 76 
Horner’s method, 213 
HTML, 788 
Huffman code, 539-46 
encoding algorithm, 542 
JPEG compression, 547 
prefix code, 541 
Hypertext Markup Language, 788 
Hypothesis of an argument, 25 
Hypothetical syllogism, 33, 37 
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Idempotent property, 622 
Identity element, 687 
Identity function, 392 
Identity matrix, 451 
Identity permutation, 396 
Identity property, 9, 234 
Image under a function, 383 
Immediate predecessor, 336 
Implication, 3 
Implication equivalence rule, 28 
Improper subgroup, 700 
Inclusive OR, 22 
Inconsistency inference rule, 37 
Independent events, 307 
Index of summation, 182 
Index set, 250 
Induction 
basis step, 111 
equivalence to well-ordering, 119 
first principle of, 110-12 
inductive assumption, 112 
inductive hypothesis, 112 
inductive step, 111 
proofs by, 112-18 
second principle of, 118—22 
structural induction, 164, 520 
Inductive assumption, 112 
Inductive definition, 158 
Inductive hypothesis, 112 
Inductive reasoning, 99 
Inductive step, 111 
Inference rules 
addition, 29 
conjunction, 29 
contraposition, 37 
definition of, 29 
disjunctive syllogism, 37 
distributive, 37 
existential generalization, 59, 62 


existential instantiation, 59, 60 
exportation, 37 
hypothetical syllogism, 33, 37 
inconsistency, 37 
modus ponens, 29 
modus tollens, 29 
resolution, 76 
self-reference, 37 
simplification, 29 
universal generalization, 59, 61 
universal instantiation, 59 
Infinite sequence. See Sequence 
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Ingerman, P.Z, 788 
Initial point, 478 
Injective function, 389 
Inorder tree traversal, 515 
Input alphabet, 729 
Integer overflow, 344 
Intermediate statement, 43 
Internal node in a tree, 510 
Interpretation, 41 
domain of, 41 
Intersection of binary relations, 331 
Intersection of sets, 231 
Intractable problem, 417, 776 
Inverse 
of a binary relation, 350 
of a function, 392-94 
of an implication, 21 
of a matrix, 452 
Inverse element, 687 
Inverse function, 393 
Inverter, 12, 639 
Invertible matrix, 452 
IP address, 257 
Irreflexive relation, 349 
ISBN, 435 
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Isomorphic Boolean algebras, 629 
Isomorphic graphs, 484—87 
Isomorphic groups, 702-8 
Isomorphic partially ordered sets, 634 
Isomorphic structures, 626 
Isomorphic trees, 527 
Isomorphism, 626 
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Kuratowski theorem, 491 
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Lagrange, Joseph-Louis, 701 
Lagrange’s theorem, 701 
Lamé, Gabriel, 217 
Language. See also Formal language 
alphabet, 782 
context-free, 790 
context-sensitive, 790 
generated by a grammar, 784 
over an alphabet, 782 
palindrome, 797 
pumping lemma, 798 
regular, 790 
type 0, 790 
vocabulary, 782 
word, 782 
Lattice, 637 
complemented, 637 
distributive, 637 
Leaf of a tree, 510 
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Least element, 338 
Least upper bound, 637 
Left cancellation law, 696 
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of a function, 408 
Length of a string, 694 
Level-order tree traversal, 606 
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Lexicographical ordering, 281, 351 
L'Hópital's rule, 422 
Linear bounded automaton (Iba), 792 
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definition of, 447 
Gaussian elimination, 453-57 
solving systems of, 453-57 
Linear first-order recurrence relations, 
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Linear recurrence relation, 182 
Linear second-order recurrence 
relations, 188—193 
Linked list, 494 
Little oh, 417 
Load factor, 427 
Logarithm function, 809-12 
Logic. See also Formal logic 
fuzzy, 23 
many-valued, 23 
three-valued, 23, 374 
two-valued, 23 
Logic network, 479, 638-53 
AND gate, 639 
canonical product-of-sums form, 
660 
canonical sum-of-products form, 
643 
combinational network, 638-48, 
744 
conjunctive normal form, 660 
delay element, 744 
disjunctive normal form, 643 
don’t-care condition, 653, 672 
field-programmable gate array, 
647 
flip-flop, 744 
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inverter, 639 
Karnaugh map, 665-71 
minimization, 663-77 
multiplexor, 663 
NAND gate, 650 
NOR gate, 651 
OR gate, 639 
programmable logic device, 647 
Quine—McCluskey procedure, 
673-77 
sequential, 744—801 
Logic programming, 73—82 
Logical connective, 2—8 
AND, 11, 75 
conjunction, 2 
disjunction, 3 
equivalence, 3 
implication, 3 
negation, 4 
NOT, 11, 75 
OR, 11, 75 
order of precedence, 6 
and programming, 11 
truth table, 3 
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definition of, 131 
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Machine. See Finite-state machine 
Maclaurin series, 411 
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Main diagonal of a matrix, 447 
Many-valued logic, 23 
Map coloring problem, 506 
Mapping. See Function 
Markov, A., 771 
Marley, Scott, 24 
Master theorem, 417—21 
proof of, 419-21 
statement of, 418 
Mathematical induction. See Induction 
Mathematical structure, 619 
Matrix, 446-59 
addition, 449 
augmented, 453 
Boolean, 458-59 
of coefficients, 447 
cofactor of an element, 473 
definition of, 446 
determinant of, 473 
diagonal, 463 
dimensions of, 446 
elementary row operations on, 454 
equal, 447 
Gaussian elimination, 453—57 
identity, 451 
inverse of, 452 
invertible, 452 
main diagonal, 447 
minor of an element, 473 
multiplication, 450 
operations on, 448—53 
scalar multiplication, 448 
Strassen's algorithm, 467 
subtraction, 449 
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transpose of, 463 
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zero, 449 
Maximal element, 338 
Mergesort algorithm, 215 
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of Boolean expressions, 645—47 
of combinational network, 663-77 
of finite-state machine, 737-44 
Misailovic, Sasa, 654 
Modular arithmetic, 692 
Modular multiplicative inverse, 356 
Modular property, 633 
Modulo function, 386, 423-40 
ABA routing number, 437 
in cryptographic hashing, 434 
to decompose integers, 438 
to generate integer values, 437 
for hashing, 427 
ISBN, 435 
modular arithmetic designs, 438 
residue of x modulo n, 423 
in RSA, 431 
UPC-A, 436 
Modus ponens, 27, 29 
Modus tollens, 29 
Monoid, 689 
Morse code, 541 
Multiplexor, 663 
Multiplication modulo n, 692 
Multiplication of matrices, 450 
Multiplication principle, 252-53 
definition of, 253 
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n factorial, 99, 272 

NAND gate, 22, 650 

n-ary predicate, 40 

n-ary relation, 330 

n-ary relation on a set, 356 
Necessary condition, 4 
Negation connective, 4 
Negation of a statement, 5 
Network. See Logic network 
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Next-state function, 729 
Node, 337, 477, 478 
Nondeterministic function, 411 
Nondeterministic Turing machine, 777 
Nonrooted tree, 509 

NOR gate, 23, 651 
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МР, 777 

NP-complete problem, 778 
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Odd permutation, 700 
One-to-one function, 389 
One-way encryption, 434 
Only, 43 
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Onto function, 388 
Operation 
binary, 228-30 
unary, 230 
Optimal algorithm, 533 
OR connective, 11, 75 
OR gate, 12, 639 
Order of a group, 698 
Order of magnitude, 412-21 
big oh, 417 
big theta, 413 
definition of, 413 
little oh, 417 
master theorem, 418 
Order of precedence, 6 
Ordered pair, 228 
Ore, Oystein, 580 
Outer join, 369 
Output alphabet, 729 
Output function, 729 
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definition of, 777 
and NP, 77—78 
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Palindrome language, 797 
Papadimitriou, Christos, 211 
Paradox, 69, 251 
Parallel arcs, 481 
Parent node in a tree, 509 
Parity bit, 126, 733 
Parse tree, 511, 793 
Parsing 
bottom-up, 794 
top-down, 794 
Partial correctness, 132 
Partial function, 767 
Partial ordering, 336—38 
chain, 338 
definition of, 336 
greatest element, 338 


greatest lower bound, 637 
Hasse diagram, 336 
immediate predecessor in, 336 
lattice, 637 
least element, 338 
least upper bound, 637 
maximal element, 338 
minimal element, 338 
predecessor in, 336 
restriction of, 336 
successor in, 336 
total ordering, 338 

Partially ordered set, 336 
dual of, 350 
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Pascal, Blaise, 294 

Pascal's formula, 295 
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in a graph, 482 

Pattern matching, 205 

Peirce arrow, 23 

Peng, Richard, 460 
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Perfect square, 107 

Perl, 757 

Permutation function, 394—96 
cycle notation, 395 
definition of, 394 
derangement, 396 
disjoint cycles, 395 
identity permutation, 396 

Permutation group, 700 

Permutations, 272—74 
definition of, 272 
eliminating duplicates, 279 
generating, 282 
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Phrase structure (type 0) grammar, 783 

Pig latin, 797 

Pigeonhole principle, 269 
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Postcondition, 86 
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Postorder tree traversal, 515 
Power set, 227 
Precondition, 86 
Predecessor, 336 
Predicate, 39—42 
binary, 40 
definition of, 39 
n-ary, 40 
ternary, 40 
unary, 40, 223 
Predicate logic, 58—69 
completeness of, 63, 773 
correctness of, 63, 773 
definition of, 58 
derivation rules for, 58—62 
valid argument in, 58 
well-formed formula, 41 
Predicate well-formed formula, 35, 41 
validity of, 48 
Prefix code, 541 
Prefix notation, 518 
Preimage, 383 
Prenex normal form, 57 
Preorder tree traversal, 514 
Primary key, 367 
Prime number, 107 
Mersenne primes, 153 
Prim’s algorithm, 587 
Principle of inclusion and exclusion, 
150, 264-68, 300, 304 
definition for n sets, 267 
definition for three sets, 265 
Principle of well-ordering, 119, 250, 702 
Private key encryption, 431 
Probability, 301-14 
axioms, 304 
Bayes' theorem, 308-9, 321 
Bernoulli trial, 313 
binomial distribution, 313-14 
birthday problem, 318 
conditional, 306—7 
equally likely outcomes, 302 
event, 302 
expected value, 310-12 
independent events, 307 
linearity of expected value, 312 
probability distribution, 305 
random variable, 310 
sample space, 302 
weighted average, 310 
Probability distribution, 305 
Procedural language, 73 
Product notation, 807—08 
Production, 783 
Program testing, 85 
Program validation, 85 


Program verification, 84 
Programmable logic device (PLD), 647 
Programming language 
declarative, 73 
descriptive, 73 
procedural, 73 
Project operation, 369 
Prolog, 73-81 
database, 73 
fact, 73 
program, 73 
query, 73 
recursive rule, 80 
rule, 75 
rule of inference, 76 
Proof by cases, 104 
Proof of correctness, 84—92, 129—35 
assertions, 85-87 
assignment rule, 87—89 
conditional rule, 90-92 
definition of, 85 
loop rule, 129-33 
Proof sequence, 27 
Proof techniques, 98—107 
contradiction, 104—6 
contraposition, 103—4 
direct proof, 101—3 
exhaustive proof, 100—101 
indirect proof, 104 
induction, 110—22 
serendipity, 107 
Proper subgroup, 700 
Proper subset, 225 
Proposition, 2 
Propositional calculus, 25 
Propositional logic, 25—35 
completeness of, 32 
correctness of, 32 
definition of, 25 
derivation rules for, 28—33 
valid argument in, 25—28 
well-formed formula, 25 
Propositional well-formed formula, 
25 
Pseudocode, 12 
Public key encryption, 431 
Pumping lemma, 798 
Pushdown automaton (pda), 792 
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enqueue, 599 
Quicksort algorithm, 215 
Quine—McCluskey procedure, 673-77 
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R, 224 
Random variable, 310 
Range of a function, 388 
Rational number, 105 
Reachability, 557-62, 602 
Reachability matrix, 559 
Reachable node, 483, 557 
Recaman’s sequence, 180 
Recognition 
by finite-state machine, 734 
by Turing machine, 765 
Recurrence relation, 159, 180—97 
characteristic equation of, 190 
closed-form solution, 180 
constant coefficients in, 182 
definition of, 159 
divide-and-conquer, 193-97 
first-order, 182 
general solution (divide-and- 
conquer), 196 
general solution (first-order), 183 
general solution (second-order), 
193, 196 
homogeneous, 182 
linear, 182 
linear first-order, 182 
second-order, 188 
solving, 180 
Recursion, 79-81, 158—71 
recursive algorithm, 166—71 
recursive definition, 79, 158 
recursive operation, 165-66 
recursive sequence, 158-62 
recursive set, 162—64 
Referential integrity, 375 
Refinement of a partition, 354, 741 
Reflexive relation, 332 
Regular expression, 735 
Regular (type 3) grammar, 790 
Regular language, 790 
Regular set, 736 
Relation. See Binary relation 
Relational algebra, 372 
Relational calculus, 372 
Relational database. See Database 
Relatively prime, 146 
Residue of x modulo л, 423 
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Restrict operation, 369 
Restriction of a partial ordering, 336 
Reverse Polish notation (RPN), 518 
Right cancellation law, 696 
Right child node, 510 
Right identity element, 712 
Right inverse 
of an element, 713 
of a function, 408 
Rinard, Martin, 654 
Rivest, Ron, 431 
Root of a tree, 509 
RSA public-key encryption 
algorithm, 431 
Rule-based system, 81 
Russell, Bertrand, 251 
Russell’s paradox, 251 
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Sample space, 302 
Satisfiability problem, 778 
Scalar, 448 
Scalar multiplication, 448 
Scanner, 794 
Scope of a quantifier, 41 
Searching 
binary search, 169 
binary tree search, 352, 534 
decision tree for, 529-35 
by hash function, 425 
lower bound for, 532—33 
sequential search, 204, 415 
Second principle of induction, 118-22, 
144, 160, 251 
definition of, 118 
Second-order recurrence relation, 188 
Seemann, Glenn, 749 
Selection sort algorithm, 169, 215 
Self-complementary graph, 506 
Self-reference inference rule, 37 
Semigroup, 689 
Semigroup of transformations, 693 
Sequence 
definition of, 158 
Fibonacci, 159 
recursive, 158-62 
Sequential network, 744-48 
definition of, 745 
Sequential search algorithm, 204 
Sequential search analysis, 205, 
207, 216 
Serendipity, 106—7 
Sets, 222-39 
binary operation on, 229 
binary relation on, 329 
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method, 237 
cardinality of, 236—39, 401 
Cartesian product (cross product) 
of, 233 
characteristic function of, 406 
closure of, 229, 686 
complement of, 232 
countable, 236 
denumerable, 236, 401 
difference of, 232 
disjoint, 232 
empty, 224 
equality of, 223 
equivalent, 401—402 
finite, 223 
fuzzy, 250 
identities, 233—36 
intersection of, 231 
membership, 222 
n-ary relation on, 330 
notation, 22224 
null, 224 
operations on, 230—33 
partial ordering on, 336 
partially ordered, 336, 634 
partition of, 339 
permutations of, 394 
power set of, 227 
proper subset of, 225 
recognition by finite-state 
machine, 734 
recognition by Turing machine, 765 
subset of, 224 
symmetric difference of, 248 
unary operation on, 230 
uncountable, 236 
union, 231 
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Shamir, Adi, 431 
Shannon, Claude, 638 
Sharan, Roded, 497 
Sheffer stroke, 22 
Sieve of Eratosthenes, 154 
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triangulation of, 263 
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Simple statement, 2 
Simple substitution cipher, 429 
Simplex method, 417 
Simplification inference rule, 29 
Six-color theorem, 508 
Sloane, Neil, 179 
Smorynski function, 407 
Smullyan, Raymond, 25, 771 
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bubble sort, 214 
decision tree for, 535—36 
heapsort, 527 
lower bound for, 536 
mergesort, 215 
quicksort, 215 
selection sort, 169, 215 
Spanning tree, 587, 601 
Sparse adjacency 
matrix, 494 
SQL, 372 
Square numbers, 200 
Stack, 167, 409 
pop instruction, 409 
push instruction, 409 
stack overflow, 167 
Standard sets, 224 
Start symbol, 783 
State graph, 730 
State table, 730 
Statement, 2 
Statement letter, 2 
Statement logic, 25 
Stewart, Ian, 174 
Stirling numbers, 355 
Stirling's triangle, 355 
Strassen's algorithm, 467 
String, 694 
binary, 163 
concatenation, 163 
empty, 163 
palindrome, 163 
pattern matching, 205 
Structural induction, 164, 520 
Subgraph, 482 
Subgroup, 699 
Subset, 224 
Subtraction of matrices, 449 
Successor, 336 
Sudoku, 154 
Sufficient condition, 4 
Summation notation, 182, 805—07 
Sun-Tsu, 445 
Surrogate key, 368 
Swift, Jonathan, 550 
Symbol, 694 
Symbol table, 441 
Symbolic logic, 68 
Symmetric difference of sets, 248 
Symmetric encryption, 431 
Symmetric matrix, 447 
Symmetric relation, 332 
Syntax analyzer, 794 
Syntax rule, 6 
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T 
Tape alphabet, 762 
Tautological equivalence, 9 
associative property, 9 
commutative property, 9 
complement property, 9 
distributive property, 9 
identity property, 9 
Tautology, 8—10 
definition of, 8 
Temporary hypothesis, 64 
Terminal, 783 
Terminal point, 478 
Ternary predicate, 40 
Ternary tree, 537 
Theorem, 98, 773 
Three-valued logic, 23, 374 
Tiling problem, 117 
Token, 163 
Top down parsing, 794 
Topological sorting, 356-61, 602 
definition of, 359 
Total function, 767 
Total ordering, 338 
Towers of Hanoi, 179, 199 
Transitive relation, 332 
Transpose of a matrix, 463 
Transposition, 700 
Traveling salesman problem, 554, 577 
Tree, 353, 509-21 
applications of, 511-12 
binary, 510 
binary search, 534 
B-tree, 526 
child node, 509 
complete binary, 510 
decision tree, 257—58, 529—36 
definition of, 509 
depth of, 510 
depth of a node, 510 
forest, 510 
free, 509 
full binary, 510 
height of, 510 
inorder traversal, 515 
internal node in, 510 
isomorphic, 527 
leaf of, 510 
left child in, 510 
left child-right child 
representation, 513 
level-order traversal, 606 
nonrooted, 509 
parent node, 509 


postorder traversal, 515 
preorder traversal, 514 
right child in, 510 
root, 509 
ternary, 537 
traversal, 514 
Tree arc, 608 
Tree traversal, 514 
Triangular numbers, 200 
Triangulation of a convex polygon, 263 
Truth function, 640 
Truth table, 3 
Tuple, 366 
Turing, Alan M., 428, 760, 771 
Turing computable function, 768 
Turing machine, 759—78 
acceptance by, 765 
Church- Turing thesis, 770 
decision problem, 771—76 
definition of, 762 
final state, 765 
as function computer, 767—69 
halting problem, 774 
nondeterministic, 777 
МР, 777 
NP-complete problem, 778 
P,777 
recognition by, 765 
as set recognizer, 764-67 
tape alphabet, 762 
unsolvability of halting problem, 
774 
Two’s complement, 660 
Two-valued logic, 23 
Type 0 language, 790 
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nary operation, 230 

nary predicate, 40, 223 
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ncomputable problem, 772 
ncountable set, 236 
ndecidable problem, 772 
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niversal bound property, 624 
niversal generalization, 59, 61 
niversal instantiation, 59 
niversal quantifier, 39 
niversal set, 231 
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Unsolvability of the halting 
problem, 774 
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UPC-A (Universal Product Code), 436 
Upper bound, 210 
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in predicate logic, 58 
in propositional logic, 25—28 
Valid predicate wff, 48 
Vandermonde's identity, 293 
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dot product of, 463 
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Venn, John, 231 
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Vertex, 337, 477, 478 
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Warshall’s algorithm, 562-65 
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Well-defined binary operation, 229 
Well-formed formula (wff), 6 
comparison of propositional and 
predicate, 48 
equivalent, 8 
predicate, 35, 41 
propositional, 25 
Well-ordering principle, 119, 250, 702 
Wff. See Well-formed formula 
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